NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1622847431|ref|XP_028685955|]
View 

nuclear receptor corepressor 2 isoform X24 [Macaca mulatta]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
GPS2_interact pfam15784
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain ...
66-154 2.90e-41

G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain of two co-repressor protein-families found in vertebrates. The domain is found in NCoR and SMRT proteins; N-CoR (nuclear receptor co-repressor) and SMRT (silencing mediator for retinoid and thyroid receptors) are related corepressors that mediate transcriptional repression by unliganded nuclear receptors and other classes of transcriptional repressors. GPS2 is a stoichiometric subunit of the N-CoR-HDAC3 complex. GPS2 links the complex to membrane receptor-related intracellular JNK (c-Jun amino-terminal kinase) signalling pathways.


:

Pssm-ID: 464868 [Multi-domain]  Cd Length: 89  Bit Score: 147.31  E-value: 2.90e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   66 LTGKLEPVSPPSPPHTDPELELVPPRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPVSPPPIE 145
Cdd:pfam15784    1 YYPQVEAISPTLPSPEGQDQELSPFRSSKDELLQNIDKVDREIAKVEQQISKLKKKQQQLEEEAAKPPEPEEPVSPPPSE 80

                   ....*....
gi 1622847431  146 SKHRSLVQI 154
Cdd:pfam15784   81 SKHRSLAQI 89
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
531-574 7.56e-13

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


:

Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 64.83  E-value: 7.56e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1622847431  531 RWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 574
Cdd:pfam00249    3 PWTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQNYL 46
SANT super family cl21498
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
357-400 1.07e-06

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


The actual alignment was detected with superfamily member cd11661:

Pssm-ID: 473887 [Multi-domain]  Cd Length: 46  Bit Score: 47.22  E-value: 1.07e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1622847431  357 WSEQEKETFREKFMQHPKNFGLI-ASFLERKTVAECVLYYYLTKK 400
Cdd:cd11661      2 WSESEAKLFEEGLRKYGKDFHDIrQDFLPWKSVGELVEFYYMWKK 46
PHA03247 super family cl33720
large tegument protein UL36; Provisional
828-1151 1.09e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  828 APQDSDSSATCSADEVDEPEggDKNRLLSPRpsllTPTGDPRANASPQ------KPLDLKQLKQRAAAIPPIQVTKVHEP 901
Cdd:PHA03247  2574 APRPSEPAVTSRARRPDAPP--QSARPRAPV----DDRGDPRGPAPPSplppdtHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  902 PREDAAPTKPAPPAPPPPQHLQPESdaPQQPGSSPRGksrsPAPAADKEAEKPVFFPAFAAEAQKLPGD-PPCWTSGLPF 980
Cdd:PHA03247  2648 PPERPRDDPAPGRVSRPRRARRLGR--AAQASSPPQR----PRRRAARPTVGSLTSLADPPPPPPTPEPaPHALVSATPL 2721
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  981 PVPPREVIKASPHAPDPSAFSYAPPGHPLPLGLHDTARPVLPRPPTISNPPPLISSAKHPSVLERQIGAISQGMSVQLHV 1060
Cdd:PHA03247  2722 PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1061 PYSEHAKAPVGPVTMGLPLPMDPKKLAPFSGVKQeQLSPRGQAGPPESlgvPTAQEASVlrgtalgsVPGGSITKGIPST 1140
Cdd:PHA03247  2802 WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPP---SLPLGGSV--------APGGDVRRRPPSR 2869
                          330
                   ....*....|.
gi 1622847431 1141 RVPSDSAITYR 1151
Cdd:PHA03247  2870 SPAAKPAAPAR 2880
SMC_N super family cl47134
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
25-383 8.49e-06

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


The actual alignment was detected with superfamily member TIGR02169:

Pssm-ID: 481474 [Multi-domain]  Cd Length: 1164  Bit Score: 51.61  E-value: 8.49e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   25 MEFIESKRP-----RLELLPDPLLRPSPLLAAGQPAGSEDLTKDRSLTGKLEPVSppsppHTDPELELVPPRLSKE--EL 97
Cdd:TIGR02169  626 VEDIEAARRlmgkyRMVTLEGELFEKSGAMTGGSRAPRGGILFSRSEPAELQRLR-----ERLEGLKRELSSLQSElrRI 700
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   98 IQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPvspppIESKHRSLVQIIYDENRKKAEAAHRI--LEGLGP 175
Cdd:TIGR02169  701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE-----LEEDLSSLEQEIENVKSELKELEARIeeLEEDLH 775
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  176 QVELPLyNQPSDtRQYHENIKINQAMRKKLILYFKR-------------RNHARKQWEQKFCQRYDQLMEAWEKKV---- 238
Cdd:TIGR02169  776 KLEEAL-NDLEA-RLSHSRIPEIQAELSKLEEEVSRiearlreieqklnRLTLEKEYLEKEIQELQEQRIDLKEQIksie 853
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  239 ERIEN-NPRRRAKESKVREY------YEKQFPEIRKQR-ELQERM---QSRVGQrgsgLSMSAARSEHEVSEIIDGLSEQ 307
Cdd:TIGR02169  854 KEIENlNGKKEELEEELEELeaalrdLESRLGDLKKERdELEAQLrelERKIEE----LEAQIEKKRKRLSELKAKLEAL 929
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  308 EN----LEKQMRQLAVIPPMLYDADQ------------QRIKFINM-------------NGLMADPMKVYKDR----QVM 354
Cdd:TIGR02169  930 EEelseIEDPKGEDEEIPEEELSLEDvqaelqrveeeiRALEPVNMlaiqeyeevlkrlDELKEKRAKLEEERkailERI 1009
                          410       420
                   ....*....|....*....|....*....
gi 1622847431  355 NMWSEQEKETFREKFMQHPKNFGLIASFL 383
Cdd:TIGR02169 1010 EEYEKKKREVFMEAFEAINENFNEIFAEL 1038
RSC8 super family cl34960
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / ...
458-566 6.23e-04

RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription];


The actual alignment was detected with superfamily member COG5259:

Pssm-ID: 227584 [Multi-domain]  Cd Length: 531  Bit Score: 44.88  E-value: 6.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  458 DKEDLLKEKTDDTSGEDNDEKEAVASKGRKTANSQGRrKGRITRSMANEANS--------EEAITPQ--QSAELASMELN 527
Cdd:COG5259    196 ENYSPSLKSPKKESQGKVDELKDHSEKHPSSCSCCGN-KSFNTRYHNLRAEKynscsecyDQGRFPSefTSSDFKPVTIS 274
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1622847431  528 ESSR---WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQC 566
Cdd:COG5259    275 LLIRdknWSRQELLLLLEGIEMYGDDWDKVARHVGTKTKEQC 316
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1673-2190 7.44e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 7.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1673 PGTPATAMDRLAYLPTAPQPFSSRHSSSPLSPGGPTHLTKPTTTsssererdrdrerdrdrereksILTSSTTVEHAP-- 1750
Cdd:PHA03247  2475 PGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPA----------------------ILPDEPVGEPVHpr 2532
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1751 --IWRPGTEQ-SSGSSGSSGGGGGSSSRPASHSHAHQHSPISPRTQD----ALQQRPSVLHNTGMKGiiTAVEPSTPTVL 1823
Cdd:PHA03247  2533 mlTWIRGLEElASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEpavtSRARRPDAPPQSARPR--APVDDRGDPRG 2610
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1824 RSTSTSSPVRPAATFPPA-THCPLGGTLDGVYPTLMEPVLLPKEAPRVARPERPRAdtghaflAKPPARSGlePASSPSK 1902
Cdd:PHA03247  2611 PAPPSPLPPDTHAPDPPPpSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-------ARRLGRAA--QASSPPQ 2681
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1903 GSEPRPLAPPVSGHATIARTPAKNLAPHHASPDPPAPPASASDPHREKTQSKPFSIQEL-------------ELRSLGYH 1969
Cdd:PHA03247  2682 RPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAppavpagpatpggPARPARPP 2761
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1970 GSSYSPEGVEPVSPVSSPSLTHDKGLPKHLEELDKShLEGELRPKQPGPVKLGGEAAHLPHLRPLPENQPSSSPlLQTAP 2049
Cdd:PHA03247  2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES-LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA-QPTAP 2839
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 2050 GVKGHQRVVTLAQHISEVITQDYTRHHPQQLSAPLPAPLYSFPGASCPVLDLRRPPSDLYLPP--------PDHGAPARG 2121
Cdd:PHA03247  2840 PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPdqperppqPQAPPPPQP 2919
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622847431 2122 SPNSEGGKRSPEPSKTSvlGGGEDGIEPVSPPEGVTEP---------GHSRSAVYPLLYRDGEQTEPSRMGSKSPGNT 2190
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPP--PRPQPPLAPTTDPAGAGEPsgavpqpwlGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
 
Name Accession Description Interval E-value
GPS2_interact pfam15784
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain ...
66-154 2.90e-41

G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain of two co-repressor protein-families found in vertebrates. The domain is found in NCoR and SMRT proteins; N-CoR (nuclear receptor co-repressor) and SMRT (silencing mediator for retinoid and thyroid receptors) are related corepressors that mediate transcriptional repression by unliganded nuclear receptors and other classes of transcriptional repressors. GPS2 is a stoichiometric subunit of the N-CoR-HDAC3 complex. GPS2 links the complex to membrane receptor-related intracellular JNK (c-Jun amino-terminal kinase) signalling pathways.


Pssm-ID: 464868 [Multi-domain]  Cd Length: 89  Bit Score: 147.31  E-value: 2.90e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   66 LTGKLEPVSPPSPPHTDPELELVPPRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPVSPPPIE 145
Cdd:pfam15784    1 YYPQVEAISPTLPSPEGQDQELSPFRSSKDELLQNIDKVDREIAKVEQQISKLKKKQQQLEEEAAKPPEPEEPVSPPPSE 80

                   ....*....
gi 1622847431  146 SKHRSLVQI 154
Cdd:pfam15784   81 SKHRSLAQI 89
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
531-574 7.56e-13

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 64.83  E-value: 7.56e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1622847431  531 RWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 574
Cdd:pfam00249    3 PWTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQNYL 46
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
531-576 1.32e-10

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 58.39  E-value: 1.32e-10
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1622847431   531 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYKKR 576
Cdd:smart00717    3 EWTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLKP 49
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
531-574 1.86e-10

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 57.97  E-value: 1.86e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1622847431  531 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYK 574
Cdd:cd00167      1 PWTEEEDELLLEAVKKYGkNNWEKIAKELPGRTPKQCRERWRNLL 45
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
357-400 1.07e-06

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 47.22  E-value: 1.07e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1622847431  357 WSEQEKETFREKFMQHPKNFGLI-ASFLERKTVAECVLYYYLTKK 400
Cdd:cd11661      2 WSESEAKLFEEGLRKYGKDFHDIrQDFLPWKSVGELVEFYYMWKK 46
PHA03247 PHA03247
large tegument protein UL36; Provisional
828-1151 1.09e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  828 APQDSDSSATCSADEVDEPEggDKNRLLSPRpsllTPTGDPRANASPQ------KPLDLKQLKQRAAAIPPIQVTKVHEP 901
Cdd:PHA03247  2574 APRPSEPAVTSRARRPDAPP--QSARPRAPV----DDRGDPRGPAPPSplppdtHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  902 PREDAAPTKPAPPAPPPPQHLQPESdaPQQPGSSPRGksrsPAPAADKEAEKPVFFPAFAAEAQKLPGD-PPCWTSGLPF 980
Cdd:PHA03247  2648 PPERPRDDPAPGRVSRPRRARRLGR--AAQASSPPQR----PRRRAARPTVGSLTSLADPPPPPPTPEPaPHALVSATPL 2721
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  981 PVPPREVIKASPHAPDPSAFSYAPPGHPLPLGLHDTARPVLPRPPTISNPPPLISSAKHPSVLERQIGAISQGMSVQLHV 1060
Cdd:PHA03247  2722 PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1061 PYSEHAKAPVGPVTMGLPLPMDPKKLAPFSGVKQeQLSPRGQAGPPESlgvPTAQEASVlrgtalgsVPGGSITKGIPST 1140
Cdd:PHA03247  2802 WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPP---SLPLGGSV--------APGGDVRRRPPSR 2869
                          330
                   ....*....|.
gi 1622847431 1141 RVPSDSAITYR 1151
Cdd:PHA03247  2870 SPAAKPAAPAR 2880
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
357-396 2.75e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 45.96  E-value: 2.75e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1622847431  357 WSEQEKETFREKFMQHPKNFGLIASFLERKTVAECVLYYY 396
Cdd:pfam00249    4 WTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQ 43
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
25-383 8.49e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 51.61  E-value: 8.49e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   25 MEFIESKRP-----RLELLPDPLLRPSPLLAAGQPAGSEDLTKDRSLTGKLEPVSppsppHTDPELELVPPRLSKE--EL 97
Cdd:TIGR02169  626 VEDIEAARRlmgkyRMVTLEGELFEKSGAMTGGSRAPRGGILFSRSEPAELQRLR-----ERLEGLKRELSSLQSElrRI 700
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   98 IQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPvspppIESKHRSLVQIIYDENRKKAEAAHRI--LEGLGP 175
Cdd:TIGR02169  701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE-----LEEDLSSLEQEIENVKSELKELEARIeeLEEDLH 775
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  176 QVELPLyNQPSDtRQYHENIKINQAMRKKLILYFKR-------------RNHARKQWEQKFCQRYDQLMEAWEKKV---- 238
Cdd:TIGR02169  776 KLEEAL-NDLEA-RLSHSRIPEIQAELSKLEEEVSRiearlreieqklnRLTLEKEYLEKEIQELQEQRIDLKEQIksie 853
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  239 ERIEN-NPRRRAKESKVREY------YEKQFPEIRKQR-ELQERM---QSRVGQrgsgLSMSAARSEHEVSEIIDGLSEQ 307
Cdd:TIGR02169  854 KEIENlNGKKEELEEELEELeaalrdLESRLGDLKKERdELEAQLrelERKIEE----LEAQIEKKRKRLSELKAKLEAL 929
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  308 EN----LEKQMRQLAVIPPMLYDADQ------------QRIKFINM-------------NGLMADPMKVYKDR----QVM 354
Cdd:TIGR02169  930 EEelseIEDPKGEDEEIPEEELSLEDvqaelqrveeeiRALEPVNMlaiqeyeevlkrlDELKEKRAKLEEERkailERI 1009
                          410       420
                   ....*....|....*....|....*....
gi 1622847431  355 NMWSEQEKETFREKFMQHPKNFGLIASFL 383
Cdd:TIGR02169 1010 EEYEKKKREVFMEAFEAINENFNEIFAEL 1038
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
357-400 7.92e-05

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 42.21  E-value: 7.92e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 1622847431   357 WSEQEKETFREKFMQHP-KNFGLIASFLERKTVAECVLYYYLTKK 400
Cdd:smart00717    4 WTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLK 48
RSC8 COG5259
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / ...
458-566 6.23e-04

RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription];


Pssm-ID: 227584 [Multi-domain]  Cd Length: 531  Bit Score: 44.88  E-value: 6.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  458 DKEDLLKEKTDDTSGEDNDEKEAVASKGRKTANSQGRrKGRITRSMANEANS--------EEAITPQ--QSAELASMELN 527
Cdd:COG5259    196 ENYSPSLKSPKKESQGKVDELKDHSEKHPSSCSCCGN-KSFNTRYHNLRAEKynscsecyDQGRFPSefTSSDFKPVTIS 274
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1622847431  528 ESSR---WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQC 566
Cdd:COG5259    275 LLIRdknWSRQELLLLLEGIEMYGDDWDKVARHVGTKTKEQC 316
PHA03247 PHA03247
large tegument protein UL36; Provisional
1673-2190 7.44e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 7.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1673 PGTPATAMDRLAYLPTAPQPFSSRHSSSPLSPGGPTHLTKPTTTsssererdrdrerdrdrereksILTSSTTVEHAP-- 1750
Cdd:PHA03247  2475 PGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPA----------------------ILPDEPVGEPVHpr 2532
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1751 --IWRPGTEQ-SSGSSGSSGGGGGSSSRPASHSHAHQHSPISPRTQD----ALQQRPSVLHNTGMKGiiTAVEPSTPTVL 1823
Cdd:PHA03247  2533 mlTWIRGLEElASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEpavtSRARRPDAPPQSARPR--APVDDRGDPRG 2610
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1824 RSTSTSSPVRPAATFPPA-THCPLGGTLDGVYPTLMEPVLLPKEAPRVARPERPRAdtghaflAKPPARSGlePASSPSK 1902
Cdd:PHA03247  2611 PAPPSPLPPDTHAPDPPPpSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-------ARRLGRAA--QASSPPQ 2681
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1903 GSEPRPLAPPVSGHATIARTPAKNLAPHHASPDPPAPPASASDPHREKTQSKPFSIQEL-------------ELRSLGYH 1969
Cdd:PHA03247  2682 RPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAppavpagpatpggPARPARPP 2761
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1970 GSSYSPEGVEPVSPVSSPSLTHDKGLPKHLEELDKShLEGELRPKQPGPVKLGGEAAHLPHLRPLPENQPSSSPlLQTAP 2049
Cdd:PHA03247  2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES-LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA-QPTAP 2839
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 2050 GVKGHQRVVTLAQHISEVITQDYTRHHPQQLSAPLPAPLYSFPGASCPVLDLRRPPSDLYLPP--------PDHGAPARG 2121
Cdd:PHA03247  2840 PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPdqperppqPQAPPPPQP 2919
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622847431 2122 SPNSEGGKRSPEPSKTSvlGGGEDGIEPVSPPEGVTEP---------GHSRSAVYPLLYRDGEQTEPSRMGSKSPGNT 2190
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPP--PRPQPPLAPTTDPAGAGEPsgavpqpwlGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
 
Name Accession Description Interval E-value
GPS2_interact pfam15784
G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain ...
66-154 2.90e-41

G-protein pathway suppressor 2-interacting domain; GPS2_interact is the more N-terminal domain of two co-repressor protein-families found in vertebrates. The domain is found in NCoR and SMRT proteins; N-CoR (nuclear receptor co-repressor) and SMRT (silencing mediator for retinoid and thyroid receptors) are related corepressors that mediate transcriptional repression by unliganded nuclear receptors and other classes of transcriptional repressors. GPS2 is a stoichiometric subunit of the N-CoR-HDAC3 complex. GPS2 links the complex to membrane receptor-related intracellular JNK (c-Jun amino-terminal kinase) signalling pathways.


Pssm-ID: 464868 [Multi-domain]  Cd Length: 89  Bit Score: 147.31  E-value: 2.90e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   66 LTGKLEPVSPPSPPHTDPELELVPPRLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPVSPPPIE 145
Cdd:pfam15784    1 YYPQVEAISPTLPSPEGQDQELSPFRSSKDELLQNIDKVDREIAKVEQQISKLKKKQQQLEEEAAKPPEPEEPVSPPPSE 80

                   ....*....
gi 1622847431  146 SKHRSLVQI 154
Cdd:pfam15784   81 SKHRSLAQI 89
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
531-574 7.56e-13

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 64.83  E-value: 7.56e-13
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1622847431  531 RWTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNYK 574
Cdd:pfam00249    3 PWTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQNYL 46
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
531-576 1.32e-10

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 58.39  E-value: 1.32e-10
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1622847431   531 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYKKR 576
Cdd:smart00717    3 EWTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLKP 49
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
531-574 1.86e-10

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 57.97  E-value: 1.86e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1622847431  531 RWTEEEMETAKKGLLEHG-RNWSAIARMVGSKTVSQCKNFYFNYK 574
Cdd:cd00167      1 PWTEEEDELLLEAVKKYGkNNWEKIAKELPGRTPKQCRERWRNLL 45
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
357-400 1.07e-06

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 47.22  E-value: 1.07e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1622847431  357 WSEQEKETFREKFMQHPKNFGLI-ASFLERKTVAECVLYYYLTKK 400
Cdd:cd11661      2 WSESEAKLFEEGLRKYGKDFHDIrQDFLPWKSVGELVEFYYMWKK 46
PHA03247 PHA03247
large tegument protein UL36; Provisional
828-1151 1.09e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 1.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  828 APQDSDSSATCSADEVDEPEggDKNRLLSPRpsllTPTGDPRANASPQ------KPLDLKQLKQRAAAIPPIQVTKVHEP 901
Cdd:PHA03247  2574 APRPSEPAVTSRARRPDAPP--QSARPRAPV----DDRGDPRGPAPPSplppdtHAPDPPPPSPSPAANEPDPHPPPTVP 2647
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  902 PREDAAPTKPAPPAPPPPQHLQPESdaPQQPGSSPRGksrsPAPAADKEAEKPVFFPAFAAEAQKLPGD-PPCWTSGLPF 980
Cdd:PHA03247  2648 PPERPRDDPAPGRVSRPRRARRLGR--AAQASSPPQR----PRRRAARPTVGSLTSLADPPPPPPTPEPaPHALVSATPL 2721
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  981 PVPPREVIKASPHAPDPSAFSYAPPGHPLPLGLHDTARPVLPRPPTISNPPPLISSAKHPSVLERQIGAISQGMSVQLHV 1060
Cdd:PHA03247  2722 PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1061 PYSEHAKAPVGPVTMGLPLPMDPKKLAPFSGVKQeQLSPRGQAGPPESlgvPTAQEASVlrgtalgsVPGGSITKGIPST 1140
Cdd:PHA03247  2802 WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ-PTAPPPPPGPPPP---SLPLGGSV--------APGGDVRRRPPSR 2869
                          330
                   ....*....|.
gi 1622847431 1141 RVPSDSAITYR 1151
Cdd:PHA03247  2870 SPAAKPAAPAR 2880
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
357-396 2.75e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 45.96  E-value: 2.75e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 1622847431  357 WSEQEKETFREKFMQHPKNFGLIASFLERKTVAECVLYYY 396
Cdd:pfam00249    4 WTPEEDELLLEAVEKLGNRWKKIAKLLPGRTDNQCKNRWQ 43
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
778-1052 5.28e-06

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 52.00  E-value: 5.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  778 PIKSECTEQaEEGPAKGKDAEASEATAEGalkaeKKEGGSGRATTAKGSGAPQDSDSSATCSADEVDEPEGGDKNRLLSP 857
Cdd:PTZ00449   498 PIEEEDSDK-HDEPPEGPEASGLPPKAPG-----DKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSK 571
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  858 RPSLLTPTGDPRANASPQKPLDLKQLKQRAAAIPPIQVTKVHEPPREDAAPTKPAPPAPPPPQHLQPesdaPQQPGSSPR 937
Cdd:PTZ00449   572 IPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPP----PQRPSSPER 647
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  938 GKS-RSPAPAADKEAEKPVFFPAFAAE--------AQKLPGDPPCWTSGLPFPVPPREVIKASPHAPDPSAfsyappgHP 1008
Cdd:PTZ00449   648 PEGpKIIKSPKPPKSPKPPFDPKFKEKfyddyldaAAKSKETKTTVVLDESFESILKETLPETPGTPFTTP-------RP 720
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 1622847431 1009 LPlglhdtarPVLPRPPTISNPPPLISSAKHPSVLERQIGAISQ 1052
Cdd:PTZ00449   721 LP--------PKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEE 756
PHA03247 PHA03247
large tegument protein UL36; Provisional
675-1144 7.24e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 7.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  675 AAKDTGqnGPQPPAtqSTDGPPPEPPTPPPEDIPAPTEPTPASEATGPPTPPPAPPSPSVPPPVVPKEEKEEEAAAVSPV 754
Cdd:PHA03247  2544 ASDDAG--DPPPPL--PPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPP 2619
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  755 EEGEEQKPPAAEELAVDTGKAEEPIKSECTEQAEEGPAKGKDAEASEATAEGalkaekkeggsgRATTAkgSGAPQDSDS 834
Cdd:PHA03247  2620 DTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG------------RAAQA--SSPPQRPRR 2685
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  835 SATcsadevdEPEGGDKNRLLSPRPSLLTPTGDPRAnASPQKPLDLKQLKQRAAAiPPIQVTKVHEPPREDAAPTKPAPP 914
Cdd:PHA03247  2686 RAA-------RPTVGSLTSLADPPPPPPTPEPAPHA-LVSATPLPPGPAAARQAS-PALPAAPAPPAVPAGPATPGGPAR 2756
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  915 APPPPQHLQPESDAPqqpgssPRGKSRSPAPAADKEAEKPVFFPAFAAEAQKLPGDPPCWTSGlPFPVPPREVIKASPHA 994
Cdd:PHA03247  2757 PARPPTTAGPPAPAP------PAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLA-PAAALPPAASPAGPLP 2829
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  995 PDPSAFSYAPPGHPLPL-------GLHDTARPVLPRPPTISnPPPLISSAKHPsvlerqigaisqgmsvqlhvPYSEHAK 1067
Cdd:PHA03247  2830 PPTSAQPTAPPPPPGPPppslplgGSVAPGGDVRRRPPSRS-PAAKPAAPARP--------------------PVRRLAR 2888
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1068 APVGPVTMGLPLPMD-------------PKKLAPFSGVKQEQLSPRGQAGPPESLGvPTAQEASVlrGTALGSVPG---G 1131
Cdd:PHA03247  2889 PAVSRSTESFALPPDqperppqpqapppPQPQPQPPPPPQPQPPPPPPPRPQPPLA-PTTDPAGA--GEPSGAVPQpwlG 2965
                          490
                   ....*....|....*.
gi 1622847431 1132 SITKG---IPSTRVPS 1144
Cdd:PHA03247  2966 ALVPGrvaVPRFRVPQ 2981
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
25-383 8.49e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 51.61  E-value: 8.49e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   25 MEFIESKRP-----RLELLPDPLLRPSPLLAAGQPAGSEDLTKDRSLTGKLEPVSppsppHTDPELELVPPRLSKE--EL 97
Cdd:TIGR02169  626 VEDIEAARRlmgkyRMVTLEGELFEKSGAMTGGSRAPRGGILFSRSEPAELQRLR-----ERLEGLKRELSSLQSElrRI 700
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   98 IQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKPPEPEKPvspppIESKHRSLVQIIYDENRKKAEAAHRI--LEGLGP 175
Cdd:TIGR02169  701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE-----LEEDLSSLEQEIENVKSELKELEARIeeLEEDLH 775
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  176 QVELPLyNQPSDtRQYHENIKINQAMRKKLILYFKR-------------RNHARKQWEQKFCQRYDQLMEAWEKKV---- 238
Cdd:TIGR02169  776 KLEEAL-NDLEA-RLSHSRIPEIQAELSKLEEEVSRiearlreieqklnRLTLEKEYLEKEIQELQEQRIDLKEQIksie 853
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  239 ERIEN-NPRRRAKESKVREY------YEKQFPEIRKQR-ELQERM---QSRVGQrgsgLSMSAARSEHEVSEIIDGLSEQ 307
Cdd:TIGR02169  854 KEIENlNGKKEELEEELEELeaalrdLESRLGDLKKERdELEAQLrelERKIEE----LEAQIEKKRKRLSELKAKLEAL 929
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  308 EN----LEKQMRQLAVIPPMLYDADQ------------QRIKFINM-------------NGLMADPMKVYKDR----QVM 354
Cdd:TIGR02169  930 EEelseIEDPKGEDEEIPEEELSLEDvqaelqrveeeiRALEPVNMlaiqeyeevlkrlDELKEKRAKLEEERkailERI 1009
                          410       420
                   ....*....|....*....|....*....
gi 1622847431  355 NMWSEQEKETFREKFMQHPKNFGLIASFL 383
Cdd:TIGR02169 1010 EEYEKKKREVFMEAFEAINENFNEIFAEL 1038
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
357-399 2.40e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 43.33  E-value: 2.40e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1622847431  357 WSEQEKETFREKFMQHP-KNFGLIASFLERKTVAECVLYYYLTK 399
Cdd:cd00167      2 WTEEEDELLLEAVKKYGkNNWEKIAKELPGRTPKQCRERWRNLL 45
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
357-400 7.92e-05

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 42.21  E-value: 7.92e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 1622847431   357 WSEQEKETFREKFMQHP-KNFGLIASFLERKTVAECVLYYYLTKK 400
Cdd:smart00717    4 WTEEEDELLIELVKKYGkNNWEKIAKELPGRTAEQCRERWRNLLK 48
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
532-573 5.48e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 39.99  E-value: 5.48e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1622847431  532 WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQCKNFYFNY 573
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYGNDWKQIAKELGRRTPKQCFDRWRRK 42
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
91-336 5.62e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 45.43  E-value: 5.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431   91 RLSKEELIQNMDRVDREITMVEQQISKLKKKQQQLEEEAAKppepekpvspppIESKHRSLVQIIYDENRKKAEAAHRIl 170
Cdd:TIGR02168  245 QEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE------------LQKELYALANEISRLEQQKQILRERL- 311
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  171 eglgpqvelplynqpsdtRQYHENIKINQAMRKKLilyFKRRNHARK---QWEQKFCQ---RYDQLMEAWEKKVERIENN 244
Cdd:TIGR02168  312 ------------------ANLERQLEELEAQLEEL---ESKLDELAEelaELEEKLEElkeELESLEAELEELEAELEEL 370
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  245 PRRRAKESKVREYYEKQFPEIRKQRELQERMQSRVGQRGSGLSMSAARSEHEVSEIIDGLSEQEnLEKQMRQLAVIPPML 324
Cdd:TIGR02168  371 ESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAE-LKELQAELEELEEEL 449
                          250
                   ....*....|..
gi 1622847431  325 YDADQQRIKFIN 336
Cdd:TIGR02168  450 EELQEELERLEE 461
RSC8 COG5259
RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / ...
458-566 6.23e-04

RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription];


Pssm-ID: 227584 [Multi-domain]  Cd Length: 531  Bit Score: 44.88  E-value: 6.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  458 DKEDLLKEKTDDTSGEDNDEKEAVASKGRKTANSQGRrKGRITRSMANEANS--------EEAITPQ--QSAELASMELN 527
Cdd:COG5259    196 ENYSPSLKSPKKESQGKVDELKDHSEKHPSSCSCCGN-KSFNTRYHNLRAEKynscsecyDQGRFPSefTSSDFKPVTIS 274
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1622847431  528 ESSR---WTEEEMETAKKGLLEHGRNWSAIARMVGSKTVSQC 566
Cdd:COG5259    275 LLIRdknWSRQELLLLLEGIEMYGDDWDKVARHVGTKTKEQC 316
PHA03247 PHA03247
large tegument protein UL36; Provisional
1673-2190 7.44e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 7.44e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1673 PGTPATAMDRLAYLPTAPQPFSSRHSSSPLSPGGPTHLTKPTTTsssererdrdrerdrdrereksILTSSTTVEHAP-- 1750
Cdd:PHA03247  2475 PGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPA----------------------ILPDEPVGEPVHpr 2532
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1751 --IWRPGTEQ-SSGSSGSSGGGGGSSSRPASHSHAHQHSPISPRTQD----ALQQRPSVLHNTGMKGiiTAVEPSTPTVL 1823
Cdd:PHA03247  2533 mlTWIRGLEElASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEpavtSRARRPDAPPQSARPR--APVDDRGDPRG 2610
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1824 RSTSTSSPVRPAATFPPA-THCPLGGTLDGVYPTLMEPVLLPKEAPRVARPERPRAdtghaflAKPPARSGlePASSPSK 1902
Cdd:PHA03247  2611 PAPPSPLPPDTHAPDPPPpSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-------ARRLGRAA--QASSPPQ 2681
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1903 GSEPRPLAPPVSGHATIARTPAKNLAPHHASPDPPAPPASASDPHREKTQSKPFSIQEL-------------ELRSLGYH 1969
Cdd:PHA03247  2682 RPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAppavpagpatpggPARPARPP 2761
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1970 GSSYSPEGVEPVSPVSSPSLTHDKGLPKHLEELDKShLEGELRPKQPGPVKLGGEAAHLPHLRPLPENQPSSSPlLQTAP 2049
Cdd:PHA03247  2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES-LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA-QPTAP 2839
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 2050 GVKGHQRVVTLAQHISEVITQDYTRHHPQQLSAPLPAPLYSFPGASCPVLDLRRPPSDLYLPP--------PDHGAPARG 2121
Cdd:PHA03247  2840 PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPdqperppqPQAPPPPQP 2919
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622847431 2122 SPNSEGGKRSPEPSKTSvlGGGEDGIEPVSPPEGVTEP---------GHSRSAVYPLLYRDGEQTEPSRMGSKSPGNT 2190
Cdd:PHA03247  2920 QPQPPPPPQPQPPPPPP--PRPQPPLAPTTDPAGAGEPsgavpqpwlGALVPGRVAVPRFRVPQPAPSREAPASSTPP 2995
PHA03247 PHA03247
large tegument protein UL36; Provisional
657-1106 2.78e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 2.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  657 ATVNNSSDTESIPSPRTEAAKDTGQNGPQPPATQSTDGPPPEPPTPPPEDIPAPTEPTPASEATgpPTPPPAPPSPSVPP 736
Cdd:PHA03247  2693 GSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPAR--PARPPTTAGPPAPA 2770
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  737 PVVPKEEKEEEAAAvspveegeeqkPPAAEELAVDTGKAEEPIKSECTEQAEEGPAKGKDAEASEAtaegalkaekkeGG 816
Cdd:PHA03247  2771 PPAAPAAGPPRRLT-----------RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA------------GP 2827
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  817 SGRATTAKGSGAPQDSDSSATCSADEVDEPEGGDKNRLLSPRPSLLTPTGDPRANAS----PQKPLDLKQLKQRAAAIPP 892
Cdd:PHA03247  2828 LPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRrlarPAVSRSTESFALPPDQPER 2907
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  893 IQVTKVHEPPREdaaptkpappappppqhlQPESDAPQQPGSSPRGKSRSPAPAAdkeaekPVFFPAFAAEAQklPGDPP 972
Cdd:PHA03247  2908 PPQPQAPPPPQP------------------QPQPPPPPQPQPPPPPPPRPQPPLA------PTTDPAGAGEPS--GAVPQ 2961
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431  973 CWTSGL---PFPVPPREVIKASPHAPDPSAFSYAPPGHPLP--------LGLHDTARPvlprpptisNPPPLISSAKHPS 1041
Cdd:PHA03247  2962 PWLGALvpgRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSrvsswassLALHEETDP---------PPVSLKQTLWPPD 3032
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622847431 1042 VLERqigaiSQGMSVQLHVPYSEHAKAPvGPVTmglPLPMDPKKLAPFSGVKQ--EQLSPRGQAGPP 1106
Cdd:PHA03247  3033 DTED-----SDADSLFDSDSERSDLEAL-DPLP---PEPHDPFAHEPDPATPEagARESPSSQFGPP 3090
PHA03247 PHA03247
large tegument protein UL36; Provisional
1663-1929 2.85e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 2.85e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1663 PHLPVLVPPTPGTPATAMDRLAYLPTAPQPFSSRHSSSPLSPGGPTHLTKPTTTSSSERERDRDRERDRDREreksilts 1742
Cdd:PHA03247  2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPR-------- 2781
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1743 STTVEHAPIWRPGTEQSSGSSGSSGGGGGSSSRPASHSHAHQHSPISPRTQDALQQRPS-----VLHNTGMKGiitAVEP 1817
Cdd:PHA03247  2782 RLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPpppgpPPPSLPLGG---SVAP 2858
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622847431 1818 STPTVLRSTSTSSPVRPAA-TFPPATHCPLGGTldgvyPTLMEPVLLPKEAPrvARPERPRADTGhaflAKPPARSGLEP 1896
Cdd:PHA03247  2859 GGDVRRRPPSRSPAAKPAApARPPVRRLARPAV-----SRSTESFALPPDQP--ERPPQPQAPPP----PQPQPQPPPPP 2927
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1622847431 1897 ASSPSKGSEPRPLAPPVSGHATIARTPAKNLAP 1929
Cdd:PHA03247  2928 QPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP 2960
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH