NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|755514106|ref|XP_011239133|]
View 

nuclear receptor corepressor 2 isoform X32 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
GPS2_interact pfam15784
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain ...
143-229 1.32e-40

G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain of two co-repressor protein-families found in vertebrates. The domain is found in NCoR and SMRT proteins; N-CoR (nuclear receptor co-repressor) and SMRT (silencing mediator for retinoid and thyroid receptors) are related corepressors that mediate transcriptional repression by unliganded nuclear receptors and other classes of transcriptional repressors. GPS2 is a stoichiometric subunit of the N-CoR-HDAC3 complex. GPS2 links the complex to membrane receptor-related intracellular JNK (c-Jun amino-terminal kinase) signalling pathways.


:

Pssm-ID: 464868 [Multi-domain]  Cd Length: 89  Bit Score: 145.39  E-value: 1.32e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   143 GKLEPVSPPSPPHADPELELAPSRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPVSPPPIESK 222
Cdd:pfam15784    3 PQVEAISPTLPSPEGQDQELSPFRSSKDELLQNIDKVDREIAKVEQQISKLKKKQQQLEEEAAKPPEPEEPVSPPPSESK 82

                   ....*..
gi 755514106   223 HRSLVQI 229
Cdd:pfam15784   83 HRSLAQI 89
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
610-653 7.65e-13

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


:

Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 64.83  E-value: 7.65e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 755514106   610 RWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 653
Cdd:pfam00249    3 PWTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQNYL 46
SANT super family cl21498
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
432-475 1.63e-06

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


The actual alignment was detected with superfamily member cd11661:

Pssm-ID: 473887 [Multi-domain]  Cd Length: 46  Bit Score: 46.84  E-value: 1.63e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 755514106  432 WSEQERDTFREKFMQHPKNFGLI-ASFLERKTVAECVLYYYLTKK 475
Cdd:cd11661     2 WSESEAKLFEEGLRKYGKDFHDIrQDFLPWKSVGELVEFYYMWKK 46
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1715-2224 4.12e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 4.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1715 VPPTPGTPATAIDRLAYLPTAPPPFSSRHSSSPLSPGGPTHLAKPTATSSSERERERERERDKSiltsTTTVEHAPIWRP 1794
Cdd:PHA03247 2591 APPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRD----DPAPGRVSRPRR 2666
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1795 GTEQSSGAGGSSRPASHTHQHSPISPRTQDALQQRPSVLHNTSMKGVVTSVEPGTPTVLRWARSTSTSSPVRPAATFPPA 1874
Cdd:PHA03247 2667 ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA 2746
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1875 ThcplggtleGVYPTLMEPVLLPKETSRVARPERPRVDAGHAFLTKPPAREPASSPSKSSEPRSLAPPSSSHTAIARTPA 1954
Cdd:PHA03247 2747 G---------PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAA 2817
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1955 KNLAPHHASPDPPAPTSASdlhREKTQSKPFSIQELELrslgyhsGAGYSPDGVEPISPVSSPSLTHDKGLSKPLEElek 2034
Cdd:PHA03247 2818 LPPAASPAGPLPPPTSAQP---TAPPPPPGPPPPSLPL-------GGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR--- 2884
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 2035 shlegelRHKQPGPMKlSAEAAHLPHLRPLPESQPSSSPLLQTAPGIKGHQRVVTLAQHISEVITQDYTRHHPQQLSGPL 2114
Cdd:PHA03247 2885 -------RLARPAVSR-STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 2115 PAPLYSFPGASCP----VLDLRRPPSDLYLPPPDHGTPARgsphseggKRSPEPSKTSVLGSSEDAIEPVSPPEGMTEPG 2190
Cdd:PHA03247 2957 GAVPQPWLGALVPgrvaVPRFRVPQPAPSREAPASSTPPL--------TGHSLSRVSSWASSLALHEETDPPPVSLKQTL 3028
                         490       500       510
                  ....*....|....*....|....*....|....*....
gi 755514106 2191 HA-----RSTAYPLLYRDGEQGEPRMGSKSPGNTSQPPA 2224
Cdd:PHA03247 3029 WPpddteDSDADSLFDSDSERSDLEALDPLPPEPHDPFA 3067
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
161-411 1.40e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 47.36  E-value: 1.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   161 ELAPSRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKppepekpvspppIESKHRSLVQIIYDENRKKAEA 240
Cdd:TIGR02168  240 ELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE------------LQKELYALANEISRLEQQKQIL 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   241 AHRIleglgpqvelplynqpsdtRQYHENIKINQAMRKKLilyFKRRNHARK---QWEQRFCQ---RYDQLMEAWEKKVE 314
Cdd:TIGR02168  308 RERL-------------------ANLERQLEELEAQLEEL---ESKLDELAEelaELEEKLEElkeELESLEAELEELEA 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   315 RIENNPRRRAKESKVREYYEKQFPEIRKQRELQERMQSRVGQRGSGLSMSAARSEHEVSEIIDGLSEQEnLEKQMRQLAV 394
Cdd:TIGR02168  366 ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAE-LKELQAELEE 444
                          250
                   ....*....|....*..
gi 755514106   395 IPPMLYDADQQRIKFIN 411
Cdd:TIGR02168  445 LEEELEELQEELERLEE 461
RSC8 super family cl34960
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / ...
535-645 2.93e-04

RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription];


The actual alignment was detected with superfamily member COG5259:

Pssm-ID: 227584 [Multi-domain]  Cd Length: 531  Bit Score: 46.03  E-value: 2.93e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  535 ENEKEELSKEKTDDTSGEDNDEKEAVASKGRKTANS----QGRRKGRITrSMANEANHEETatPQQSSELASMEMNESSR 610
Cdd:COG5259   202 LKSPKKESQGKVDELKDHSEKHPSSCSCCGNKSFNTryhnLRAEKYNSC-SECYDQGRFPS--EFTSSDFKPVTISLLIR 278
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 755514106  611 ---WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQC 645
Cdd:COG5259   279 dknWSRQELLLLLEGIEMYGDDWDKVARHVGTKTKEQC 316
PHA03247 super family cl33720
large tegument protein UL36; Provisional
946-1203 9.03e-03

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 9.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  946 PAGDPRASTSPQK-PLDLKQLKQRAAAIPPIVTKVHEPPREDTVPPKPVPPVPPPTQHLQPEGDVSQQSGGSPRGKSRSP 1024
Cdd:PHA03247 2604 DRGDPRGPAPPSPlPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP 2683
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1025 VPPAEKEAEKPAFFPAFPTEGPKLPTEPPR-WSSGLPFPIPPREVIKTSPHAADPSAFSYTPPGHPLPLGLHDSARPVLP 1103
Cdd:PHA03247 2684 RRRAARPTVGSLTSLADPPPPPPTPEPAPHaLVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT 2763
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1104 RPPISNPPPLISSAKHPGVLERQLGAISQQGMSVQLRVPHSEHAKAPMGPLTMGLPLAVDPkklgtALGSATSGSITKGL 1183
Cdd:PHA03247 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP-----AGPLPPPTSAQPTA 2838
                         250       260
                  ....*....|....*....|...
gi 755514106 1184 PSTRAADGPSYR---GSITHGTP 1203
Cdd:PHA03247 2839 PPPPPGPPPPSLplgGSVAPGGD 2861
 
Name Accession Description Interval E-value
GPS2_interact pfam15784
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain ...
143-229 1.32e-40

G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain of two co-repressor protein-families found in vertebrates. The domain is found in NCoR and SMRT proteins; N-CoR (nuclear receptor co-repressor) and SMRT (silencing mediator for retinoid and thyroid receptors) are related corepressors that mediate transcriptional repression by unliganded nuclear receptors and other classes of transcriptional repressors. GPS2 is a stoichiometric subunit of the N-CoR-HDAC3 complex. GPS2 links the complex to membrane receptor-related intracellular JNK (c-Jun amino-terminal kinase) signalling pathways.


Pssm-ID: 464868 [Multi-domain]  Cd Length: 89  Bit Score: 145.39  E-value: 1.32e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   143 GKLEPVSPPSPPHADPELELAPSRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPVSPPPIESK 222
Cdd:pfam15784    3 PQVEAISPTLPSPEGQDQELSPFRSSKDELLQNIDKVDREIAKVEQQISKLKKKQQQLEEEAAKPPEPEEPVSPPPSESK 82

                   ....*..
gi 755514106   223 HRSLVQI 229
Cdd:pfam15784   83 HRSLAQI 89
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
610-653 7.65e-13

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 64.83  E-value: 7.65e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 755514106   610 RWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 653
Cdd:pfam00249    3 PWTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQNYL 46
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
610-655 1.33e-10

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 58.39  E-value: 1.33e-10
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 755514106    610 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYKKR 655
Cdd:smart00717    3 EWTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLKP 49
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
610-653 1.88e-10

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 57.97  E-value: 1.88e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 755514106  610 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYK 653
Cdd:cd00167     1 PWTEEEDELLLEAVKKYGkNNWEKIAKELPGRTPKQCRERWRNLL 45
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
432-475 1.63e-06

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 46.84  E-value: 1.63e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 755514106  432 WSEQERDTFREKFMQHPKNFGLI-ASFLERKTVAECVLYYYLTKK 475
Cdd:cd11661     2 WSESEAKLFEEGLRKYGKDFHDIrQDFLPWKSVGELVEFYYMWKK 46
PHA03247 PHA03247
large tegument protein UL36; Provisional
1715-2224 4.12e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 4.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1715 VPPTPGTPATAIDRLAYLPTAPPPFSSRHSSSPLSPGGPTHLAKPTATSSSERERERERERDKSiltsTTTVEHAPIWRP 1794
Cdd:PHA03247 2591 APPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRD----DPAPGRVSRPRR 2666
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1795 GTEQSSGAGGSSRPASHTHQHSPISPRTQDALQQRPSVLHNTSMKGVVTSVEPGTPTVLRWARSTSTSSPVRPAATFPPA 1874
Cdd:PHA03247 2667 ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA 2746
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1875 ThcplggtleGVYPTLMEPVLLPKETSRVARPERPRVDAGHAFLTKPPAREPASSPSKSSEPRSLAPPSSSHTAIARTPA 1954
Cdd:PHA03247 2747 G---------PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAA 2817
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1955 KNLAPHHASPDPPAPTSASdlhREKTQSKPFSIQELELrslgyhsGAGYSPDGVEPISPVSSPSLTHDKGLSKPLEElek 2034
Cdd:PHA03247 2818 LPPAASPAGPLPPPTSAQP---TAPPPPPGPPPPSLPL-------GGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR--- 2884
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 2035 shlegelRHKQPGPMKlSAEAAHLPHLRPLPESQPSSSPLLQTAPGIKGHQRVVTLAQHISEVITQDYTRHHPQQLSGPL 2114
Cdd:PHA03247 2885 -------RLARPAVSR-STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 2115 PAPLYSFPGASCP----VLDLRRPPSDLYLPPPDHGTPARgsphseggKRSPEPSKTSVLGSSEDAIEPVSPPEGMTEPG 2190
Cdd:PHA03247 2957 GAVPQPWLGALVPgrvaVPRFRVPQPAPSREAPASSTPPL--------TGHSLSRVSSWASSLALHEETDPPPVSLKQTL 3028
                         490       500       510
                  ....*....|....*....|....*....|....*....
gi 755514106 2191 HA-----RSTAYPLLYRDGEQGEPRMGSKSPGNTSQPPA 2224
Cdd:PHA03247 3029 WPpddteDSDADSLFDSDSERSDLEALDPLPPEPHDPFA 3067
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
432-471 8.88e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 44.80  E-value: 8.88e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 755514106   432 WSEQERDTFREKFMQHPKNFGLIASFLERKTVAECVLYYY 471
Cdd:pfam00249    4 WTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQ 43
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
161-411 1.40e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 47.36  E-value: 1.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   161 ELAPSRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKppepekpvspppIESKHRSLVQIIYDENRKKAEA 240
Cdd:TIGR02168  240 ELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE------------LQKELYALANEISRLEQQKQIL 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   241 AHRIleglgpqvelplynqpsdtRQYHENIKINQAMRKKLilyFKRRNHARK---QWEQRFCQ---RYDQLMEAWEKKVE 314
Cdd:TIGR02168  308 RERL-------------------ANLERQLEELEAQLEEL---ESKLDELAEelaELEEKLEElkeELESLEAELEELEA 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   315 RIENNPRRRAKESKVREYYEKQFPEIRKQRELQERMQSRVGQRGSGLSMSAARSEHEVSEIIDGLSEQEnLEKQMRQLAV 394
Cdd:TIGR02168  366 ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAE-LKELQAELEE 444
                          250
                   ....*....|....*..
gi 755514106   395 IPPMLYDADQQRIKFIN 411
Cdd:TIGR02168  445 LEEELEELQEELERLEE 461
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
432-475 2.08e-04

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 41.06  E-value: 2.08e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 755514106    432 WSEQERDTFREKFMQHP-KNFGLIASFLERKTVAECVLYYYLTKK 475
Cdd:smart00717    4 WTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLK 48
RSC8 COG5259
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / ...
535-645 2.93e-04

RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription];


Pssm-ID: 227584 [Multi-domain]  Cd Length: 531  Bit Score: 46.03  E-value: 2.93e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  535 ENEKEELSKEKTDDTSGEDNDEKEAVASKGRKTANS----QGRRKGRITrSMANEANHEETatPQQSSELASMEMNESSR 610
Cdd:COG5259   202 LKSPKKESQGKVDELKDHSEKHPSSCSCCGNKSFNTryhnLRAEKYNSC-SECYDQGRFPS--EFTSSDFKPVTISLLIR 278
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 755514106  611 ---WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQC 645
Cdd:COG5259   279 dknWSRQELLLLLEGIEMYGDDWDKVARHVGTKTKEQC 316
PTZ00121 PTZ00121
MAEBL; Provisional
169-627 2.96e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 46.67  E-value: 2.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  169 KEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEA-AKPPEPEKPVSPPPIESKHRSLVQIIYDENRKKAEAAHRILEG 247
Cdd:PTZ00121 1310 KAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAeAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  248 LGPQVELPlyNQPSDTRQYHENIKINQAMRKKLILYFKRRNHARKQWEQRFCQRYDQLMEAWEKKVE--RIENNPRRRAK 325
Cdd:PTZ00121 1390 KKKADEAK--KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEeaKKAEEAKKKAE 1467
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  326 ESKVREYYEKQFPEIRKQRELQERMQSrvGQRGSGLSMSAARSEHEVSEIIDGLSEQENLEKQMRQLAVIPPMLYDADQQ 405
Cdd:PTZ00121 1468 EAKKADEAKKKAEEAKKADEAKKKAEE--AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK 1545
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  406 RikfinmnglmdDPMKVYKDRQVTNmwSEQERDTFREKFMQHPKNFGL----IASFLERKTVAECVLYYYLTKKNENYKS 481
Cdd:PTZ00121 1546 K-----------KADELKKAEELKK--AEEKKKAEEAKKAEEDKNMALrkaeEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  482 LVRRSYRRRGKSQQQQQQQQQQQQQQMARSSQEEKEEKEKEKEADKEEEKQDAENEKEELSKEKTDDTSGEDNDEKEAVA 561
Cdd:PTZ00121 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755514106  562 SKGRKTanSQGRRKGRITRSMANEANHEETAtpQQSSELASMEMNESSRWTEEEMETAKKGLLEHG 627
Cdd:PTZ00121 1693 ALKKEA--EEAKKAEELKKKEAEEKKKAEEL--KKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEE 1754
PHA03247 PHA03247
large tegument protein UL36; Provisional
946-1203 9.03e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 9.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  946 PAGDPRASTSPQK-PLDLKQLKQRAAAIPPIVTKVHEPPREDTVPPKPVPPVPPPTQHLQPEGDVSQQSGGSPRGKSRSP 1024
Cdd:PHA03247 2604 DRGDPRGPAPPSPlPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP 2683
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1025 VPPAEKEAEKPAFFPAFPTEGPKLPTEPPR-WSSGLPFPIPPREVIKTSPHAADPSAFSYTPPGHPLPLGLHDSARPVLP 1103
Cdd:PHA03247 2684 RRRAARPTVGSLTSLADPPPPPPTPEPAPHaLVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT 2763
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1104 RPPISNPPPLISSAKHPGVLERQLGAISQQGMSVQLRVPHSEHAKAPMGPLTMGLPLAVDPkklgtALGSATSGSITKGL 1183
Cdd:PHA03247 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP-----AGPLPPPTSAQPTA 2838
                         250       260
                  ....*....|....*....|...
gi 755514106 1184 PSTRAADGPSYR---GSITHGTP 1203
Cdd:PHA03247 2839 PPPPPGPPPPSLplgGSVAPGGD 2861
 
Name Accession Description Interval E-value
GPS2_interact pfam15784
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain ...
143-229 1.32e-40

G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain of two co-repressor protein-families found in vertebrates. The domain is found in NCoR and SMRT proteins; N-CoR (nuclear receptor co-repressor) and SMRT (silencing mediator for retinoid and thyroid receptors) are related corepressors that mediate transcriptional repression by unliganded nuclear receptors and other classes of transcriptional repressors. GPS2 is a stoichiometric subunit of the N-CoR-HDAC3 complex. GPS2 links the complex to membrane receptor-related intracellular JNK (c-Jun amino-terminal kinase) signalling pathways.


Pssm-ID: 464868 [Multi-domain]  Cd Length: 89  Bit Score: 145.39  E-value: 1.32e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   143 GKLEPVSPPSPPHADPELELAPSRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPVSPPPIESK 222
Cdd:pfam15784    3 PQVEAISPTLPSPEGQDQELSPFRSSKDELLQNIDKVDREIAKVEQQISKLKKKQQQLEEEAAKPPEPEEPVSPPPSESK 82

                   ....*..
gi 755514106   223 HRSLVQI 229
Cdd:pfam15784   83 HRSLAQI 89
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
610-653 7.65e-13

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 64.83  E-value: 7.65e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 755514106   610 RWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 653
Cdd:pfam00249    3 PWTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQNYL 46
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
610-655 1.33e-10

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 58.39  E-value: 1.33e-10
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 755514106    610 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYKKR 655
Cdd:smart00717    3 EWTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLKP 49
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
610-653 1.88e-10

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 57.97  E-value: 1.88e-10
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 755514106  610 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYK 653
Cdd:cd00167     1 PWTEEEDELLLEAVKKYGkNNWEKIAKELPGRTPKQCRERWRNLL 45
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
432-475 1.63e-06

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 46.84  E-value: 1.63e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 755514106  432 WSEQERDTFREKFMQHPKNFGLI-ASFLERKTVAECVLYYYLTKK 475
Cdd:cd11661     2 WSESEAKLFEEGLRKYGKDFHDIrQDFLPWKSVGELVEFYYMWKK 46
PHA03247 PHA03247
large tegument protein UL36; Provisional
1715-2224 4.12e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 4.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1715 VPPTPGTPATAIDRLAYLPTAPPPFSSRHSSSPLSPGGPTHLAKPTATSSSERERERERERDKSiltsTTTVEHAPIWRP 1794
Cdd:PHA03247 2591 APPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRD----DPAPGRVSRPRR 2666
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1795 GTEQSSGAGGSSRPASHTHQHSPISPRTQDALQQRPSVLHNTSMKGVVTSVEPGTPTVLRWARSTSTSSPVRPAATFPPA 1874
Cdd:PHA03247 2667 ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA 2746
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1875 ThcplggtleGVYPTLMEPVLLPKETSRVARPERPRVDAGHAFLTKPPAREPASSPSKSSEPRSLAPPSSSHTAIARTPA 1954
Cdd:PHA03247 2747 G---------PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAA 2817
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1955 KNLAPHHASPDPPAPTSASdlhREKTQSKPFSIQELELrslgyhsGAGYSPDGVEPISPVSSPSLTHDKGLSKPLEElek 2034
Cdd:PHA03247 2818 LPPAASPAGPLPPPTSAQP---TAPPPPPGPPPPSLPL-------GGSVAPGGDVRRRPPSRSPAAKPAAPARPPVR--- 2884
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 2035 shlegelRHKQPGPMKlSAEAAHLPHLRPLPESQPSSSPLLQTAPGIKGHQRVVTLAQHISEVITQDYTRHHPQQLSGPL 2114
Cdd:PHA03247 2885 -------RLARPAVSR-STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 2115 PAPLYSFPGASCP----VLDLRRPPSDLYLPPPDHGTPARgsphseggKRSPEPSKTSVLGSSEDAIEPVSPPEGMTEPG 2190
Cdd:PHA03247 2957 GAVPQPWLGALVPgrvaVPRFRVPQPAPSREAPASSTPPL--------TGHSLSRVSSWASSLALHEETDPPPVSLKQTL 3028
                         490       500       510
                  ....*....|....*....|....*....|....*....
gi 755514106 2191 HA-----RSTAYPLLYRDGEQGEPRMGSKSPGNTSQPPA 2224
Cdd:PHA03247 3029 WPpddteDSDADSLFDSDSERSDLEALDPLPPEPHDPFA 3067
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
432-471 8.88e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 44.80  E-value: 8.88e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 755514106   432 WSEQERDTFREKFMQHPKNFGLIASFLERKTVAECVLYYY 471
Cdd:pfam00249    4 WTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQ 43
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
432-474 7.01e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 42.18  E-value: 7.01e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 755514106  432 WSEQERDTFREKFMQHP-KNFGLIASFLERKTVAECVLYYYLTK 474
Cdd:cd00167     2 WTEEEDELLLEAVKKYGkNNWEKIAKELPGRTPKQCRERWRNLL 45
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
161-411 1.40e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 47.36  E-value: 1.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   161 ELAPSRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKppepekpvspppIESKHRSLVQIIYDENRKKAEA 240
Cdd:TIGR02168  240 ELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE------------LQKELYALANEISRLEQQKQIL 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   241 AHRIleglgpqvelplynqpsdtRQYHENIKINQAMRKKLilyFKRRNHARK---QWEQRFCQ---RYDQLMEAWEKKVE 314
Cdd:TIGR02168  308 RERL-------------------ANLERQLEELEAQLEEL---ESKLDELAEelaELEEKLEElkeELESLEAELEELEA 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   315 RIENNPRRRAKESKVREYYEKQFPEIRKQRELQERMQSRVGQRGSGLSMSAARSEHEVSEIIDGLSEQEnLEKQMRQLAV 394
Cdd:TIGR02168  366 ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAE-LKELQAELEE 444
                          250
                   ....*....|....*..
gi 755514106   395 IPPMLYDADQQRIKFIN 411
Cdd:TIGR02168  445 LEEELEELQEELERLEE 461
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
432-475 2.08e-04

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 41.06  E-value: 2.08e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 755514106    432 WSEQERDTFREKFMQHP-KNFGLIASFLERKTVAECVLYYYLTKK 475
Cdd:smart00717    4 WTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLK 48
RSC8 COG5259
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / ...
535-645 2.93e-04

RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription];


Pssm-ID: 227584 [Multi-domain]  Cd Length: 531  Bit Score: 46.03  E-value: 2.93e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  535 ENEKEELSKEKTDDTSGEDNDEKEAVASKGRKTANS----QGRRKGRITrSMANEANHEETatPQQSSELASMEMNESSR 610
Cdd:COG5259   202 LKSPKKESQGKVDELKDHSEKHPSSCSCCGNKSFNTryhnLRAEKYNSC-SECYDQGRFPS--EFTSSDFKPVTISLLIR 278
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 755514106  611 ---WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQC 645
Cdd:COG5259   279 dknWSRQELLLLLEGIEMYGDDWDKVARHVGTKTKEQC 316
PTZ00121 PTZ00121
MAEBL; Provisional
169-627 2.96e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 46.67  E-value: 2.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  169 KEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEA-AKPPEPEKPVSPPPIESKHRSLVQIIYDENRKKAEAAHRILEG 247
Cdd:PTZ00121 1310 KAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAeAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  248 LGPQVELPlyNQPSDTRQYHENIKINQAMRKKLILYFKRRNHARKQWEQRFCQRYDQLMEAWEKKVE--RIENNPRRRAK 325
Cdd:PTZ00121 1390 KKKADEAK--KKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEeaKKAEEAKKKAE 1467
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  326 ESKVREYYEKQFPEIRKQRELQERMQSrvGQRGSGLSMSAARSEHEVSEIIDGLSEQENLEKQMRQLAVIPPMLYDADQQ 405
Cdd:PTZ00121 1468 EAKKADEAKKKAEEAKKADEAKKKAEE--AKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEK 1545
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  406 RikfinmnglmdDPMKVYKDRQVTNmwSEQERDTFREKFMQHPKNFGL----IASFLERKTVAECVLYYYLTKKNENYKS 481
Cdd:PTZ00121 1546 K-----------KADELKKAEELKK--AEEKKKAEEAKKAEEDKNMALrkaeEAKKAEEARIEEVMKLYEEEKKMKAEEA 1612
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  482 LVRRSYRRRGKSQQQQQQQQQQQQQQMARSSQEEKEEKEKEKEADKEEEKQDAENEKEELSKEKTDDTSGEDNDEKEAVA 561
Cdd:PTZ00121 1613 KKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAE 1692
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755514106  562 SKGRKTanSQGRRKGRITRSMANEANHEETAtpQQSSELASMEMNESSRWTEEEMETAKKGLLEHG 627
Cdd:PTZ00121 1693 ALKKEA--EEAKKAEELKKKEAEEKKKAEEL--KKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEE 1754
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
168-458 4.23e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.83  E-value: 4.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   168 SKEELIQNMDRV---DREITMVEQQISKLKKKQQQLEEEAAKPPE----PEKPVSPPPIESKHRSLVQIiyDENRKKAEA 240
Cdd:TIGR02169  735 LKERLEELEEDLsslEQEIENVKSELKELEARIEELEEDLHKLEEalndLEARLSHSRIPEIQAELSKL--EEEVSRIEA 812
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   241 AHRILEGlgpqvelplynqpsdtrqyheniKINQAMRKKLILYFKRRNHARKQ--WEQR---FCQRYDQLMEAWEKKVER 315
Cdd:TIGR02169  813 RLREIEQ-----------------------KLNRLTLEKEYLEKEIQELQEQRidLKEQiksIEKEIENLNGKKEELEEE 869
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   316 IENnprrraKESKVREyYEKQFPEIRKQR-ELQERM---QSRVGQrgsgLSMSAARSEHEVSEIIDGLSEQEN----LEK 387
Cdd:TIGR02169  870 LEE------LEAALRD-LESRLGDLKKERdELEAQLrelERKIEE----LEAQIEKKRKRLSELKAKLEALEEelseIED 938
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106   388 QMRQLAVIPPMLYDADQ------------QRIKFINM-------------NGLMDDPMKVYKDR----QVTNMWSEQERD 438
Cdd:TIGR02169  939 PKGEDEEIPEEELSLEDvqaelqrveeeiRALEPVNMlaiqeyeevlkrlDELKEKRAKLEEERkailERIEEYEKKKRE 1018
                          330       340
                   ....*....|....*....|
gi 755514106   439 TFREKFMQHPKNFGLIASFL 458
Cdd:TIGR02169 1019 VFMEAFEAINENFNEIFAEL 1038
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
611-652 5.55e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 39.99  E-value: 5.55e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 755514106   611 WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNY 652
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYGNDWKQIAKELGRRTPKQCFDRWRRK 42
PHA03247 PHA03247
large tegument protein UL36; Provisional
1733-2164 5.91e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.70  E-value: 5.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1733 PTAPPPFSSRHSSSPLSPGGPTHLAKPTATSSSERERERERERDKSILTSTTtvehAPIWRPGTEQSSGAGGSSRPASHT 1812
Cdd:PHA03247 2686 RAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPA----APAPPAVPAGPATPGGPARPARPP 2761
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1813 HQHSPISPRT--------QDALQQRPSVLHNTSMKGVVTSVEPGTPTVLRWARSTSTSSPVRPAATFPPathcplggtle 1884
Cdd:PHA03247 2762 TTAGPPAPAPpaapaagpPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPP----------- 2830
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1885 gvyPTLMEPVLLPKETSRVARPERP--RVDAGHAFLTKPPAREPASSPSKSSEP--RSLAPPssshtAIARTPAKNLAPh 1960
Cdd:PHA03247 2831 ---PTSAQPTAPPPPPGPPPPSLPLggSVAPGGDVRRRPPSRSPAAKPAAPARPpvRRLARP-----AVSRSTESFALP- 2901
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1961 hasPDPPAPTSASDLHREKTQSKPFSIQElelrslgyhsgagySPDGVEPISPVSSPSLTHDKGlSKPLEELEKSHLEGE 2040
Cdd:PHA03247 2902 ---PDQPERPPQPQAPPPPQPQPQPPPPP--------------QPQPPPPPPPRPQPPLAPTTD-PAGAGEPSGAVPQPW 2963
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 2041 LRHKQPGpmKLSAEAAHLPHLRPLPESQPSSSPLLQTAPGIKGHQRVVTLAQHISEV---ITQDYTRHHPQQLSGP-LPA 2116
Cdd:PHA03247 2964 LGALVPG--RVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDpppVSLKQTLWPPDDTEDSdADS 3041
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|
gi 755514106 2117 PLYSFPGAS-CPVLD-LRRPPSDLYLPPPDHGTPARGSPHSEGGKRSPEP 2164
Cdd:PHA03247 3042 LFDSDSERSdLEALDpLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPP 3091
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1716-1974 3.48e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 3.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1716 PPTPGTPATAIDRLAYLPTAPPPFSSRHSSSPLSPGGPTHLAKPTATSSSERERERERERDKSILTSTTTVEHAPIWRPG 1795
Cdd:PHA03307  111 PSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPP 190
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1796 TEQSSGAGGSSRPASHTHQHSPISPRTQDAlQQRPSVLHNTSMKGVVTSVEPGTPTVLRWARSTSTSSPVRPAATFPPAT 1875
Cdd:PHA03307  191 AEPPPSTPPAAASPRPPRRSSPISASASSP-APAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRI 269
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1876 HCPLGGTLEGVYPTLMEPVLLPKETSRVARPERPRvdaGHAFLTKPPAREPASSPSKSSEPRSLAPPSSSHTAIARTPAk 1955
Cdd:PHA03307  270 WEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPG---SGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGP- 345
                         250
                  ....*....|....*....
gi 755514106 1956 nlaPHHASPDPPAPTSASD 1974
Cdd:PHA03307  346 ---SPSRSPSPSRPPPPAD 361
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1830-2205 4.52e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 4.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1830 PSVLHNTSMKGVVTSVEPGTPTVLRWARSTSTSSPVRPAATFPPATHCPlGGTLEGVYPTLMEPVLLPKETSRVARPERP 1909
Cdd:PHA03307   75 PGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP-ASPPPSPAPDLSEMLRPVGSPGPPPAASPP 153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1910 RVDAGHAFLTKPPAREP-ASSPSKSSEPRSLAPPSSSHTAIARTPaknlaPHHASPDPPAPTSASDLHREKTQSKPFSIQ 1988
Cdd:PHA03307  154 AAGASPAAVASDAASSRqAALPLSSPEETARAPSSPPAEPPPSTP-----PAAASPRPPRRSSPISASASSPAPAPGRSA 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1989 ELELR---SLGYHSGAGYSPDGVEPISPVSSPSLTHDKGLskpleelEKSHLEGELRHKQPGPmklSAEAAHLPHLRPLP 2065
Cdd:PHA03307  229 ADDAGassSDSSSSESSGCGWGPENECPLPRPAPITLPTR-------IWEASGWNGPSSRPGP---ASSSSSPRERSPSP 298
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 2066 ESQPSSSPLLQTAPGIKGHQrvvtlaqhisevitqdyTRHHPQQLSGPLPAPLYSFPGASCPVLDLRRPPSDLYLPPPDH 2145
Cdd:PHA03307  299 SPSSPGSGPAPSSPRASSSS-----------------SSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPAD 361
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 755514106 2146 GTPARGSPHSEGGKRSPEPSKTSVL------------------GSSEDAIEPVSPPEGMTEPGhARSTAYPLLYRDGE 2205
Cdd:PHA03307  362 PSSPRKRPRPSRAPSSPAASAGRPTrrraraavagrarrrdatGRFPAGRPRPSPLDAGAASG-AFYARYPLLTPSGE 438
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1696-1971 8.15e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 8.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1696 AAGPRGIIDLSQVPHLPVLVPPTPG-TPATAIDRLAYLPTAPPPFSS-RHSSSPLSPGGPTHLAKPTATSSSERERERER 1773
Cdd:PHA03307  166 AASSRQAALPLSSPEETARAPSSPPaEPPPSTPPAAASPRPPRRSSPiSASASSPAPAPGRSAADDAGASSSDSSSSESS 245
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1774 ERDKSILTSTTTVEHAPIWRPGTE-QSSGAGGSSRPASHTHQHSPISPRTQDALQQRPSVLHNTSMKGVVTSVEPGTPTV 1852
Cdd:PHA03307  246 GCGWGPENECPLPRPAPITLPTRIwEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESS 325
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1853 LrwARSTSTSSPVRPAATFPPATHCPLGGTLEGvyptlmepvllPKETSRVARPERPRVDAGHAFLTKPPAREPASSPSK 1932
Cdd:PHA03307  326 S--SSTSSSSESSRGAAVSPGPSPSRSPSPSRP-----------PPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARA 392
                         250       260       270
                  ....*....|....*....|....*....|....*....
gi 755514106 1933 SSEPRSLAPPSSSHTAIARTPAKNLAPHHASPDPPAPTS 1971
Cdd:PHA03307  393 AVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYP 431
PHA03247 PHA03247
large tegument protein UL36; Provisional
946-1203 9.03e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 9.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106  946 PAGDPRASTSPQK-PLDLKQLKQRAAAIPPIVTKVHEPPREDTVPPKPVPPVPPPTQHLQPEGDVSQQSGGSPRGKSRSP 1024
Cdd:PHA03247 2604 DRGDPRGPAPPSPlPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRP 2683
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1025 VPPAEKEAEKPAFFPAFPTEGPKLPTEPPR-WSSGLPFPIPPREVIKTSPHAADPSAFSYTPPGHPLPLGLHDSARPVLP 1103
Cdd:PHA03247 2684 RRRAARPTVGSLTSLADPPPPPPTPEPAPHaLVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTT 2763
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755514106 1104 RPPISNPPPLISSAKHPGVLERQLGAISQQGMSVQLRVPHSEHAKAPMGPLTMGLPLAVDPkklgtALGSATSGSITKGL 1183
Cdd:PHA03247 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP-----AGPLPPPTSAQPTA 2838
                         250       260
                  ....*....|....*....|...
gi 755514106 1184 PSTRAADGPSYR---GSITHGTP 1203
Cdd:PHA03247 2839 PPPPPGPPPPSLplgGSVAPGGD 2861
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH