NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568931819|ref|XP_006539202|]
View 

arginine-glutamic acid dipeptide repeats protein isoform X3 [Mus musculus]

Protein Classification

arginine-glutamic acid dipeptide repeats protein( domain architecture ID 11562211)

arginine-glutamic acid dipeptide repeats protein (RERE) plays a role as a transcriptional repressor during development

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
568-1566 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 1206.90  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   568 GKHSMRTRRSRGSMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKST 647
Cdd:pfam03154    1 GKHSMRTRRSRGSMSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLKSA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   648 KRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 727
Cdd:pfam03154   81 KRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   728 SAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPTLH 807
Cdd:pfam03154  161 SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPTLH 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   808 PPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPAAA 884
Cdd:pfam03154  239 PQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAP 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   885 HP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKP 963
Cdd:pfam03154  319 GQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKP 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   964 LSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPITPP 1042
Cdd:pfam03154  399 LSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPITPP 478
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1043 SCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHASQS 1122
Cdd:pfam03154  479 SGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHASQS 556
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1123 ARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqKAS 1202
Cdd:pfam03154  557 ARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-KAS 635
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1203 SSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAYHM 1282
Cdd:pfam03154  636 SSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAYHM 715
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1283 PGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASFHP 1362
Cdd:pfam03154  716 PGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASFHP 795
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1363 GLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQGSA 1442
Cdd:pfam03154  796 GLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQGSG 875
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1443 GPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQAMH 1522
Cdd:pfam03154  876 GPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQAMH 947
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....
gi 568931819  1523 AQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1566
Cdd:pfam03154  948 AQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
BAH_MTA cd04709
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The ...
102-307 3.74e-81

BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.


:

Pssm-ID: 240060  Cd Length: 164  Bit Score: 263.87  E-value: 3.74e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  102 DVVYRPGDCVYIESRrPNTPYFICSIQDFKLvhssqaccrspapafcdppacslpvapqppqhlseagrgpggSKRDHLL 181
Cdd:cd04709     1 ANMYRVGDYVYFESS-PNNPYLIRRIEELNK------------------------------------------TARGHVE 37
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  182 MNVKWYYRQSEVPDSVYQHLVQDRHNEND-SGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04709    38 AKVVCYYRRRDIPDSLYQLADQHRRELEEkSDDLTPKQRHQLRHRELFLSRQVETLPATHIRGKCSVTLLNDTESARSYL 117
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 568931819  261 ARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDG 307
Cdd:cd04709   118 AREDTFFYSLVYDPEQKTLLADQGEIRVGPSYQAKLPDLQPFPSPDG 164
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
395-440 1.33e-22

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


:

Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 91.91  E-value: 1.33e-22
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 568931819  395 CWTEDEVKRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 440
Cdd:cd11661     1 EWSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
ZnF_GATA smart00401
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
503-552 2.72e-15

zinc finger binding to DNA consensus sequence [AT]GATA[AG];


:

Pssm-ID: 214648 [Multi-domain]  Cd Length: 52  Bit Score: 71.30  E-value: 2.72e-15
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 568931819    503 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 552
Cdd:smart00401    2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
ELM2 pfam01448
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown ...
286-336 1.63e-12

ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in Swiss:O82364. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.


:

Pssm-ID: 460214  Cd Length: 53  Bit Score: 63.40  E-value: 1.63e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 568931819   286 IRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMP--GVSDCDLLMYLRAARS 336
Cdd:pfam01448    1 IRVGPRYQAEIPELLPPSEEEDRYEEEDELLVWDPnhNLPDRKLDEYLVVARS 53
 
Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
568-1566 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 1206.90  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   568 GKHSMRTRRSRGSMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKST 647
Cdd:pfam03154    1 GKHSMRTRRSRGSMSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLKSA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   648 KRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 727
Cdd:pfam03154   81 KRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   728 SAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPTLH 807
Cdd:pfam03154  161 SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPTLH 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   808 PPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPAAA 884
Cdd:pfam03154  239 PQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAP 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   885 HP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKP 963
Cdd:pfam03154  319 GQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKP 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   964 LSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPITPP 1042
Cdd:pfam03154  399 LSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPITPP 478
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1043 SCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHASQS 1122
Cdd:pfam03154  479 SGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHASQS 556
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1123 ARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqKAS 1202
Cdd:pfam03154  557 ARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-KAS 635
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1203 SSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAYHM 1282
Cdd:pfam03154  636 SSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAYHM 715
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1283 PGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASFHP 1362
Cdd:pfam03154  716 PGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASFHP 795
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1363 GLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQGSA 1442
Cdd:pfam03154  796 GLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQGSG 875
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1443 GPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQAMH 1522
Cdd:pfam03154  876 GPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQAMH 947
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....
gi 568931819  1523 AQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1566
Cdd:pfam03154  948 AQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
BAH_MTA cd04709
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The ...
102-307 3.74e-81

BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.


Pssm-ID: 240060  Cd Length: 164  Bit Score: 263.87  E-value: 3.74e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  102 DVVYRPGDCVYIESRrPNTPYFICSIQDFKLvhssqaccrspapafcdppacslpvapqppqhlseagrgpggSKRDHLL 181
Cdd:cd04709     1 ANMYRVGDYVYFESS-PNNPYLIRRIEELNK------------------------------------------TARGHVE 37
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  182 MNVKWYYRQSEVPDSVYQHLVQDRHNEND-SGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04709    38 AKVVCYYRRRDIPDSLYQLADQHRRELEEkSDDLTPKQRHQLRHRELFLSRQVETLPATHIRGKCSVTLLNDTESARSYL 117
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 568931819  261 ARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDG 307
Cdd:cd04709   118 AREDTFFYSLVYDPEQKTLLADQGEIRVGPSYQAKLPDLQPFPSPDG 164
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
395-440 1.33e-22

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 91.91  E-value: 1.33e-22
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 568931819  395 CWTEDEVKRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 440
Cdd:cd11661     1 EWSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
ZnF_GATA smart00401
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
503-552 2.72e-15

zinc finger binding to DNA consensus sequence [AT]GATA[AG];


Pssm-ID: 214648 [Multi-domain]  Cd Length: 52  Bit Score: 71.30  E-value: 2.72e-15
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 568931819    503 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 552
Cdd:smart00401    2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
ZnF_GATA cd00202
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ...
506-560 3.29e-15

Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C


Pssm-ID: 238123 [Multi-domain]  Cd Length: 54  Bit Score: 71.25  E-value: 3.29e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 568931819  506 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 560
Cdd:cd00202     1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
ELM2 pfam01448
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown ...
286-336 1.63e-12

ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in Swiss:O82364. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.


Pssm-ID: 460214  Cd Length: 53  Bit Score: 63.40  E-value: 1.63e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 568931819   286 IRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMP--GVSDCDLLMYLRAARS 336
Cdd:pfam01448    1 IRVGPRYQAEIPELLPPSEEEDRYEEEDELLVWDPnhNLPDRKLDEYLVVARS 53
GATA pfam00320
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ...
507-542 1.25e-11

GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.


Pssm-ID: 425605 [Multi-domain]  Cd Length: 36  Bit Score: 60.41  E-value: 1.25e-11
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 568931819   507 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 542
Cdd:pfam00320    1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
BAH pfam01426
BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been ...
103-281 2.71e-10

BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.


Pssm-ID: 460207  Cd Length: 120  Bit Score: 59.24  E-value: 2.71e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   103 VVYRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPPACSLPVapqppqhlseagrgpggskrdhllm 182
Cdd:pfam01426    1 ETYSVGDFVLVEPDDADEPYYVARIEEL----------------FEDTKNGKKMV------------------------- 39
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   183 NVKWYYRQSEVPdsvyqHLVQDRHNEndsgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK-A 261
Cdd:pfam01426   40 RVQWFYRPEETV-----HRAGKAFNK----------------DELFLSDEEDDVPLSAIIGKCSVLHKSDLESLDPYKiK 98
                          170       180
                   ....*....|....*....|
gi 568931819   262 RVDSFFYILGYNPETRRLNS 281
Cdd:pfam01426   99 EPDDFFCELLYDPKTKSFKK 118
PHA03247 PHA03247
large tegument protein UL36; Provisional
540-1025 9.34e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 9.34e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  540 GELPPIEKPVDPPPFMFKPVKEeddglsgkhSMRTRRSRGSMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAA 617
Cdd:PHA03247 2549 GDPPPPLPPAAPPAAPDRSVPP---------PRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPP 2619
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  618 STSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEG 697
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLT 2696
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  698 SS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATA 772
Cdd:PHA03247 2697 SLadpppPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPA 2773
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  773 VPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTG 852
Cdd:PHA03247 2774 APAAGPPRRLTRPAVASLSESRES------LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  853 PLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPHSTIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQ 930
Cdd:PHA03247 2848 PSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPVRRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  931 LPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSSLSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAA 1010
Cdd:PHA03247 2928 QPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSS 3006
                         490
                  ....*....|....*
gi 568931819 1011 SHPTTGLHQVPSQSP 1025
Cdd:PHA03247 3007 WASSLALHEETDPPP 3021
BAH smart00439
Bromo adjacent homology domain;
105-281 1.24e-09

Bromo adjacent homology domain;


Pssm-ID: 214664 [Multi-domain]  Cd Length: 121  Bit Score: 57.69  E-value: 1.24e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819    105 YRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPpacslpvapqppqhlseagrgpGGSKRDHLlmNV 184
Cdd:smart00439    2 ISVGDFVLVEPDDADEPYYIGRIEEI----------------FETK----------------------KNSESKMV--RV 41
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819    185 KWYYRQSEVPdsvyqHLVQDRHNENdsgrelvitdpviknrELFISDYVDTYHAAALRGKCNISHFSDIF--AAREFKAR 262
Cdd:smart00439   42 RWFYRPEETV-----LEKAALFDKN----------------EVFLSDEYDTVPLSDIIGKCNVLYKSDYPglRPEGSIGE 100
                           170
                    ....*....|....*....
gi 568931819    263 VDSFFYILGYNPETRRLNS 281
Cdd:smart00439  101 PDVFFCESAYDPEKGSFKK 119
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
545-729 2.17e-08

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 59.15  E-value: 2.17e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  545 IEKPVDPP----PFMFKPVKEEDDGLSGKHSMRTRRSR------GSMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 614
Cdd:NF033609  539 IDKPVVPEqpdePGEIEPIPEDSDSDPGSDSGSDSSNSdsgsdsGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  615 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 692
Cdd:NF033609  619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 568931819  693 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 729
Cdd:NF033609  699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
600-760 1.51e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.91  E-value: 1.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  600 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 678
Cdd:NF033609  716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  679 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 756
Cdd:NF033609  796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875

                  ....
gi 568931819  757 GTPQ 760
Cdd:NF033609  876 NSPK 879
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
396-441 6.69e-05

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 41.83  E-value: 6.69e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 568931819    396 WTEDEVKRFVKGLRQYG-KNFFRIRKElLPSKETGELITFYYYWKKT 441
Cdd:smart00717    4 WTEEEDELLIELVKKYGkNNWEKIAKE-LPGRTAEQCRERWRNLLKP 49
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
580-729 1.85e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.44  E-value: 1.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  580 SMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE 659
Cdd:NF033609  630 SASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 709
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568931819  660 -DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 729
Cdd:NF033609  710 sDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
396-439 1.91e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 40.56  E-value: 1.91e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 568931819   396 WTEDEVKRFVKGLRQYGKNFFRIrKELLPSKETGELITFYYYWK 439
Cdd:pfam00249    4 WTPEEDELLLEAVEKLGNRWKKI-AKLLPGRTDNQCKNRWQNYL 46
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
580-729 3.29e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.29  E-value: 3.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  580 SMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTe 659
Cdd:NF033609  654 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS- 732
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568931819  660 DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 729
Cdd:NF033609  733 DSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
650-782 6.24e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 40.65  E-value: 6.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   650 QREKVASDTEDTDRITSKKTKTQ-EISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDN---------RSTSPSIPSPQ 719
Cdd:TIGR00601    9 QQQKFKIDMEPDETVKELKEKIEaEQGKDAYPVAQQKLIYSGKILSDDKTVKEYKIKEKDfvvvmvskpKTGTGKVAPPA 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568931819   720 DNESDSDSSAqqqmlqAQPPALQAPSGAASAPSTAPPGTP---QLPTQGPTPSATAVPPQGSPATS 782
Cdd:TIGR00601   89 ATPTSAPTPT------PSPPASPASGMSAAPASAVEEKSPseeSATATAPESPSTSVPSSGSDAAS 148
 
Name Accession Description Interval E-value
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
568-1566 0e+00

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 1206.90  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   568 GKHSMRTRRSRGSMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKST 647
Cdd:pfam03154    1 GKHSMRTRRSRGSMSTLRSGRKKQTASPDGRASPTNEDLRSSGRNSPSAASTSSNDSKAESMKKSSKKIKEEAPSPLKSA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   648 KRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 727
Cdd:pfam03154   81 KRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDS 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   728 SAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPaaHTHIQQAPTLH 807
Cdd:pfam03154  161 SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAP--HTLIQQTPTLH 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   808 PPRLPSPHPPLQPMT--APPSQSSAQPHPQPSLHSQGPPGPHSLQTGP-LLQHPGPPQPFGLPSQPSQGQGPLGPSPAAA 884
Cdd:pfam03154  239 PQRLPSPHPPLQPMTqpPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPsHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAP 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   885 HP-HSTIQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNANLPPPPALKP 963
Cdd:pfam03154  319 GQsQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKP 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   964 LSSLSTHHPPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTT-GLHQVPSQSPFPQHPFVPGGPPPITPP 1042
Cdd:pfam03154  399 LSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTsGLHQVPSQSPFPQHPFVPGGPPPITPP 478
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1043 SCPPTSTPPAGPSSSSQPPcsAAVSSGGSVPGAPSCPLPAVQIKEEALDEAEEPESPPPPPRSPSPEPTVVDTPSHASQS 1122
Cdd:pfam03154  479 SGPPTSTSSAMPGIQPPSS--ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHASQS 556
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1123 ARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKAREEREREKEKEKEREREREREREAERAAqKAS 1202
Cdd:pfam03154  557 ARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEALEKAKREAEQKAREEKEREKEKEKEREREREREREAERAA-KAS 635
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1203 SSAHEGRLSDPQLSGPGHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAYHM 1282
Cdd:pfam03154  636 SSSHEGRMGDPQLAGPAHMRPSFEPPPTTIAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFFVPLNPTDPLLAYHM 715
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1283 PGLYNVDPTIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHSALTIPPAAGPHPFASFHP 1362
Cdd:pfam03154  716 PGLYNVDPAIRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPATNPMEHFARHGALTLPPMAGPHPFASFHP 795
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1363 GLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQGSA 1442
Cdd:pfam03154  796 GLNPLERERLALAGPQLRPEMSYPDRLAAERLHAERMASLTNDPLARLQMFNVTPHHHQHSHIHSHLHLHQQDPLHQGSG 875
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  1443 GPVHPLVDPLTAGPHLARFPYPPGTLPNPLLGQPPHEHEMLRHPVFaepvlrlaGTPYPRDLPGAIPPPMSAAHQLQAMH 1522
Cdd:pfam03154  876 GPVHPLVDPLAAGPHLARFPYPPGAIPNPLLGQPPHEHEMLRHPVF--------GTPYPRDLPGGLPPPMSAAHQLQAMH 947
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....
gi 568931819  1523 AQSAELQRLAMEQQWLHGHPHMHGGHLPSQEDYYSRLKKEGDKQ 1566
Cdd:pfam03154  948 AQSAELQRLAMEQQWLHGHPHMHGGHLPGQEDYYSRLKKESDKQ 991
BAH_MTA cd04709
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The ...
102-307 3.74e-81

BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.


Pssm-ID: 240060  Cd Length: 164  Bit Score: 263.87  E-value: 3.74e-81
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  102 DVVYRPGDCVYIESRrPNTPYFICSIQDFKLvhssqaccrspapafcdppacslpvapqppqhlseagrgpggSKRDHLL 181
Cdd:cd04709     1 ANMYRVGDYVYFESS-PNNPYLIRRIEELNK------------------------------------------TARGHVE 37
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  182 MNVKWYYRQSEVPDSVYQHLVQDRHNEND-SGRELVITDPVIKNRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04709    38 AKVVCYYRRRDIPDSLYQLADQHRRELEEkSDDLTPKQRHQLRHRELFLSRQVETLPATHIRGKCSVTLLNDTESARSYL 117
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 568931819  261 ARVDSFFYILGYNPETRRLNSTQGEIRVGPSHQAKLPDLQPFPSPDG 307
Cdd:cd04709   118 AREDTFFYSLVYDPEQKTLLADQGEIRVGPSYQAKLPDLQPFPSPDG 164
SANT_MTA3_like cd11661
Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family ...
395-440 1.33e-22

Myb-Like Dna-Binding Domain of MTA3 and related proteins; Members in this SANT/myb family include domains found in mouse metastasis-associated protein 3 (MTA3) proteins and arginine-glutamic dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2, N-CoR and TFIIIB) DNA-binding domains are a diverse set of proteins that share a common 3 alpha-helix bundle. MTA3 has been shown to interact with nucleosome remodeling and deacetylase (NuRD) proteins CHD4 and HDAC1, and the core cohesin complex protein RAD21 in the ovary, and regulate G2/M progression in proliferating granulosa cells. RERE belongs to the atrophin family and has been identified as a nuclear receptor corepressor; altered expression levels of RERE are associated with cancer in humans while mutations of Rere in mice cause failure in closing the anterior neural tube and fusion of the telencephalic and optic vesicles during embryogenesis.


Pssm-ID: 212559 [Multi-domain]  Cd Length: 46  Bit Score: 91.91  E-value: 1.33e-22
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 568931819  395 CWTEDEVKRFVKGLRQYGKNFFRIRKELLPSKETGELITFYYYWKK 440
Cdd:cd11661     1 EWSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
ZnF_GATA smart00401
zinc finger binding to DNA consensus sequence [AT]GATA[AG];
503-552 2.72e-15

zinc finger binding to DNA consensus sequence [AT]GATA[AG];


Pssm-ID: 214648 [Multi-domain]  Cd Length: 52  Bit Score: 71.30  E-value: 2.72e-15
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|.
gi 568931819    503 KGYACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL-PPIEKPVDPP 552
Cdd:smart00401    2 SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLkRPLSLKKDGI 52
ZnF_GATA cd00202
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] ...
506-560 3.29e-15

Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C


Pssm-ID: 238123 [Multi-domain]  Cd Length: 54  Bit Score: 71.25  E-value: 3.29e-15
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 568931819  506 ACRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGELPPIEKPvDPPPFMFKPVK 560
Cdd:cd00202     1 ACSNCGTTTTPLWRRGPSGGSTLCNACGLYWKKHGVMRPLSKR-KKDQIKRRNRK 54
ELM2 pfam01448
ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown ...
286-336 1.63e-12

ELM2 domain; The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in Swiss:O82364. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.


Pssm-ID: 460214  Cd Length: 53  Bit Score: 63.40  E-value: 1.63e-12
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 568931819   286 IRVGPSHQAKLPDLQPFPSPDGDTVTQHEELVWMP--GVSDCDLLMYLRAARS 336
Cdd:pfam01448    1 IRVGPRYQAEIPELLPPSEEEDRYEEEDELLVWDPnhNLPDRKLDEYLVVARS 53
GATA pfam00320
GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This ...
507-542 1.25e-11

GATA zinc finger; This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contain a single copy of the domain.


Pssm-ID: 425605 [Multi-domain]  Cd Length: 36  Bit Score: 60.41  E-value: 1.25e-11
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 568931819   507 CRHCFTTTSKDWHHGGRENILLCTDCRIHFKKYGEL 542
Cdd:pfam00320    1 CSNCGTTKTPLWRRGPNGNRTLCNACGLYYKKKGLK 36
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
732-898 1.40e-10

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 66.21  E-value: 1.40e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   732 QM-LQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspatsQPPNQTQSTVAPAAHTHIQQAPtlhppr 810
Cdd:pfam09770  202 AMrAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQ------QQPQQPQQHPGQGHPVTILQRP------ 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   811 lpsphpplQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQtgpLLQHP-------------GPPQPFGLPSQPSQGQGPL 877
Cdd:pfam09770  270 --------QSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQ---ILQNPnrlsaarvgypqnPQPGVQPAPAHQAHRQQGS 338
                          170       180
                   ....*....|....*....|...
gi 568931819   878 --GPSPAAAHPHSTIQLPASQSA 898
Cdd:pfam09770  339 fgRQAPIITHPQQLAQLSEEEKA 361
BAH pfam01426
BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been ...
103-281 2.71e-10

BAH domain; This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.


Pssm-ID: 460207  Cd Length: 120  Bit Score: 59.24  E-value: 2.71e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   103 VVYRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPPACSLPVapqppqhlseagrgpggskrdhllm 182
Cdd:pfam01426    1 ETYSVGDFVLVEPDDADEPYYVARIEEL----------------FEDTKNGKKMV------------------------- 39
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   183 NVKWYYRQSEVPdsvyqHLVQDRHNEndsgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK-A 261
Cdd:pfam01426   40 RVQWFYRPEETV-----HRAGKAFNK----------------DELFLSDEEDDVPLSAIIGKCSVLHKSDLESLDPYKiK 98
                          170       180
                   ....*....|....*....|
gi 568931819   262 RVDSFFYILGYNPETRRLNS 281
Cdd:pfam01426   99 EPDDFFCELLYDPKTKSFKK 118
PHA03247 PHA03247
large tegument protein UL36; Provisional
540-1025 9.34e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 9.34e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  540 GELPPIEKPVDPPPFMFKPVKEeddglsgkhSMRTRRSRGSMSTLRSGRKKQPTSPDGRASPINE--DIRSSGRNSPSAA 617
Cdd:PHA03247 2549 GDPPPPLPPAAPPAAPDRSVPP---------PRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSPLPP 2619
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  618 STSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTqeiSRPNSPSEGEGESSDSRSVNDEG 697
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA---AQASSPPQRPRRRAARPTVGSLT 2696
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  698 SS-----DPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAlqaPSGAASAPSTAPPGTPQLPTQGPTPSATA 772
Cdd:PHA03247 2697 SLadpppPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAV---PAGPATPGGPARPARPPTTAGPPAPAPPA 2773
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  773 VPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTG 852
Cdd:PHA03247 2774 APAAGPPRRLTRPAVASLSESRES------LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP 2847
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  853 PLLQHPGPPQPFG-LPSQPSQGQGPLGPSPAAAHPHSTIQLPA-SQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQ 930
Cdd:PHA03247 2848 PSLPLGGSVAPGGdVRRRPPSRSPAAKPAAPARPPVRRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  931 LPAPQAHKHPPHLSGPSPFSLNANLPPPPALKPLSSLSTHHPPSAHPPPLQLmPQSQPLPSSPAQPPGLTQSQSLPPPAA 1010
Cdd:PHA03247 2928 QPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRV-PQPAPSREAPASSTPPLTGHSLSRVSS 3006
                         490
                  ....*....|....*
gi 568931819 1011 SHPTTGLHQVPSQSP 1025
Cdd:PHA03247 3007 WASSLALHEETDPPP 3021
BAH smart00439
Bromo adjacent homology domain;
105-281 1.24e-09

Bromo adjacent homology domain;


Pssm-ID: 214664 [Multi-domain]  Cd Length: 121  Bit Score: 57.69  E-value: 1.24e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819    105 YRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapaFCDPpacslpvapqppqhlseagrgpGGSKRDHLlmNV 184
Cdd:smart00439    2 ISVGDFVLVEPDDADEPYYIGRIEEI----------------FETK----------------------KNSESKMV--RV 41
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819    185 KWYYRQSEVPdsvyqHLVQDRHNENdsgrelvitdpviknrELFISDYVDTYHAAALRGKCNISHFSDIF--AAREFKAR 262
Cdd:smart00439   42 RWFYRPEETV-----LEKAALFDKN----------------EVFLSDEYDTVPLSDIIGKCNVLYKSDYPglRPEGSIGE 100
                           170
                    ....*....|....*....
gi 568931819    263 VDSFFYILGYNPETRRLNS 281
Cdd:smart00439  101 PDVFFCESAYDPEKGSFKK 119
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
545-729 2.17e-08

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 59.15  E-value: 2.17e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  545 IEKPVDPP----PFMFKPVKEEDDGLSGKHSMRTRRSR------GSMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSP 614
Cdd:NF033609  539 IDKPVVPEqpdePGEIEPIPEDSDSDPGSDSGSDSSNSdsgsdsGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDS 618
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  615 SAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRS 692
Cdd:NF033609  619 ASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDS 698
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 568931819  693 VNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 729
Cdd:NF033609  699 DSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 734
BAH_fungalPHD cd04710
BAH, or Bromo Adjacent Homology domain, as present in fungal proteins containing PHD domains. ...
101-278 5.28e-08

BAH, or Bromo Adjacent Homology domain, as present in fungal proteins containing PHD domains. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.


Pssm-ID: 240061  Cd Length: 135  Bit Score: 53.14  E-value: 5.28e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  101 DDVVYRPGDCVYIESRRPNTPYFICSIQDFklvhssqaccrspapafcdppacsLPVAPQPPQHLSEAGRGPGGSKRdhl 180
Cdd:cd04710     8 NGELLKVNDHIYMSSEPPGEPYYIGRIMEF------------------------VPKHEFPSGIHARVFPASYFQVR--- 60
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  181 lMNvkWYYRQSEVpdsvyqhlvqDRHNENDSgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIFAAREFK 260
Cdd:cd04710    61 -LN--WYYRPRDI----------SRRVVADS-------------RLLYASMHSDICPIGSVRGKCTVRHRDQIPDLEEYK 114
                         170
                  ....*....|....*...
gi 568931819  261 ARVDSFFYILGYNPETRR 278
Cdd:cd04710   115 KRPNHFYFDQLFDRYILR 132
BAH cd04370
BAH, or Bromo Adjacent Homology domain (also called ELM1 and BAM for Bromo Adjacent Motif). ...
105-277 6.39e-08

BAH, or Bromo Adjacent Homology domain (also called ELM1 and BAM for Bromo Adjacent Motif). BAH domains have first been described as domains found in the polybromo protein and Yeast Rsc1/Rsc2 (Remodeling of the Structure of Chromatin). They also occur in mammalian DNA methyltransferases and the MTA1 subunits of histone deacetylase complexes. A BAH domain is also found in Yeast Sir3p and in the origin receptor complex protein 1 (Orc1p), where it was found to interact with the N-terminal lobe of the silence information regulator 1 protein (Sir1p), confirming the initial hypothesis that BAH plays a role in protein-protein interactions.


Pssm-ID: 239835 [Multi-domain]  Cd Length: 123  Bit Score: 52.78  E-value: 6.39e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  105 YRPGDCVYIE--SRRPNTPYFICSIQDFklvhssqaccrspapaFCDPpacslpvapqppqhlseagrgpggskRDHLLM 182
Cdd:cd04370     4 YEVGDSVYVEpdDSIKSDPPYIARIEEL----------------WEDT--------------------------NGSKQV 41
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  183 NVKWYYRQSEVPDSVYQHlvqdrHNEndsgrelvitdpviknRELFISDYVDTYHAAALRGKCNISHFSDIF--AAREFK 260
Cdd:cd04370    42 KVRWFYRPEETPKGLSPF-----ALR----------------RELFLSDHLDEIPVESIIGKCKVLFVSEFEglKQRPNK 100
                         170
                  ....*....|....*..
gi 568931819  261 ARVDSFFYILGYNPETR 277
Cdd:cd04370   101 IDTDDFFCRLAYDPTTK 117
PHA03247 PHA03247
large tegument protein UL36; Provisional
664-1030 2.27e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 2.27e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  664 ITSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSAQQQMLQAQPPALQA 743
Cdd:PHA03247 2582 VTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPS-PAANEPDPHPPPTVPPPERPRDDPAPGR 2660
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  744 PSGAASAPSTAPPGTPQLPTQGPTPSAtAVPPQGSPATSQPPNQTQSTVAPAahthiqqaPTLHPPRLPsphpplqpmTA 823
Cdd:PHA03247 2661 VSRPRRARRLGRAAQASSPPQRPRRRA-ARPTVGSLTSLADPPPPPPTPEPA--------PHALVSATP---------LP 2722
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  824 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSALQPQQ 903
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW 2802
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  904 PPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPSPFSLNA------------NLPPPPALKPLSSLSTHH 971
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvapggdvrrRPPSRSPAAKPAAPARPP 2882
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  972 PPSAHPPPLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLHQVPSQSP-FPQHP 1030
Cdd:PHA03247 2883 VRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPpRPQPP 2942
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
734-886 3.45e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.99  E-value: 3.45e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  734 LQAQPPALQAPSGAASAPSTAPPGTPQL-------PTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTL 806
Cdd:PRK07764  580 GDWQVEAVVGPAPGAAGGEGPPAPASSGppeeaarPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  807 HPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGP-PQPFGLPSQPSQGQGPLGPSPAAAH 885
Cdd:PRK07764  660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQaDDPAAQPPQAAQGASAPSPAADDPV 739

                  .
gi 568931819  886 P 886
Cdd:PRK07764  740 P 740
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
716-886 3.72e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.99  E-value: 3.72e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  716 PSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPA 795
Cdd:PRK07764  597 GEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGA 676
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  796 AHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLH------SQGPPGPHSLQTGPLLQHPG-PPQPFGLPS 868
Cdd:PRK07764  677 APAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQppqaaqGASAPSPAADDPVPLPPEPDdPPDPAGAPA 756
                         170
                  ....*....|....*...
gi 568931819  869 QPSQGQGPLGPSPAAAHP 886
Cdd:PRK07764  757 QPPPPPAPAPAAAPAAAP 774
PHA03247 PHA03247
large tegument protein UL36; Provisional
708-1137 6.32e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.56  E-value: 6.32e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  708 NRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSG------AASAPSTAPPGTPQLPTQGPTPSATAV-PPQGSPA 780
Cdd:PHA03247 2565 DRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDdrgdprGPAPPSPLPPDTHAPDPPPPSPSPAANePDPHPPP 2644
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  781 TSQPPNQTQSTVAPAA-----HTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHS---LQTG 852
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRvsrprRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSatpLPPG 2724
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  853 PLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHStiqlPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLP 932
Cdd:PHA03247 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARP----PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  933 APQAHKHPPHLSGPSPFSLNANLPPPPALKPLSSLSTHHPPSAHPPPLQL--------------MPQSQPLPSSPAQPPG 998
Cdd:PHA03247 2801 PWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLplggsvapggdvrrRPPSRSPAAKPAAPAR 2880
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  999 lTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvPGGPPPITPPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSC 1078
Cdd:PHA03247 2881 -PPVRRLARPAVSRSTESFALPPDQPERPPQP--QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568931819 1079 PLPAVQikeeaLDEAEEPESPPPPPRSPSPEPTVvdtPSHASQSARFYKHLDRGYNSCA 1137
Cdd:PHA03247 2958 AVPQPW-----LGALVPGRVAVPRFRVPQPAPSR---EAPASSTPPLTGHSLSRVSSWA 3008
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
723-878 8.62e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.84  E-value: 8.62e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  723 SDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPAtsqPPNQTQSTVAPAAhthiQQ 802
Cdd:PRK07764  367 ASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPA---PAAAPQPAPAPAP----AP 439
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568931819  803 APTlhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPslhsQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLG 878
Cdd:PRK07764  440 APP---SPAGNAPAGGAPSPPPAAAPSAQPAPAP----AAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508
PHA03247 PHA03247
large tegument protein UL36; Provisional
542-953 4.26e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 4.26e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  542 LPPIEKPVDPPPFMFKPVKEEDDGLSGKHSMRTRRSRGSMSTLRSGRKKQPTSPD--GRASPINEDIRSSGRNSPSAAST 619
Cdd:PHA03247 2617 LPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGraAQASSPPQRPRRRAARPTVGSLT 2696
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  620 SSNDSKAETVKKSAKKVKEEAASPL---KSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGESSDSRSVNDE 696
Cdd:PHA03247 2697 SLADPPPPPPTPEPAPHALVSATPLppgPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPA 2776
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  697 GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTA---PPGTPQLPTQGPTPSATAV 773
Cdd:PHA03247 2777 AGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAqptAPPPPPGPPPPSLPLGGSV 2856
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  774 PPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH-SLQTG 852
Cdd:PHA03247 2857 APGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQpPPPPP 2936
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  853 PLLQHPGPPQPFGLPSQPSQGQGP-----------------LGPSPAAA-------------HPHSTIQLPASQSALQPQ 902
Cdd:PHA03247 2937 PRPQPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrvavprfRVPQPAPSreapasstppltgHSLSRVSSWASSLALHEE 3016
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  903 QPP-----------------REQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLS------------GPSPFSLNA 953
Cdd:PHA03247 3017 TDPppvslkqtlwppddtedSDADSLFDSDSERSDLEALDPLPPEPHDPFAHEPDPATPeagarespssqfGPPPLSANA 3096
PRK10263 PRK10263
DNA translocase FtsK; Provisional
547-892 6.58e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 51.24  E-value: 6.58e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  547 KPVDPPPFMFKPVKE----EDDGLSGKHS-----MRTRRSRGSMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAA 617
Cdd:PRK10263  500 KPARPPLYYFEEVEEkrarEREQLAAWYQpipepVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKATLATGAAA 579
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  618 STSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTED------TDRITSKKTKTQEISRPNSPSEGEGESSDSR 691
Cdd:PRK10263  580 TVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASygiklpSQRAAEEKAREAQRNQYDSGDQYNDDEIDAM 659
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  692 SvNDE-----GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGA------------------- 747
Cdd:PRK10263  660 Q-QDElarqfAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGAnpfslddfefspmkalldd 738
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  748 -ASAPSTAPPGTP-QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPP 825
Cdd:PRK10263  739 gPHEPLFTPIVEPvQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQ 818
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 568931819  826 S-QSSAQPHPQPSLHSQGP-PGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQL 892
Cdd:PRK10263  819 QpQQPVAPQPQYQQPQQPVaPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFAL 887
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
736-854 1.01e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 50.10  E-value: 1.01e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  736 AQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqqAPTLHPPRLPSPH 815
Cdd:PRK14951  382 ARPEA--AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAA------APAAVALAPAPPA 453
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 568931819  816 PPLQPMTAPPSQSSAQPH-PQPSLHSQGPPGPHSLQTGPL 854
Cdd:PRK14951  454 QAAPETVAIPVRVAPEPAvASAAPAPAAAPAAARLTPTEE 493
PLN02967 PLN02967
kinase
558-687 1.42e-05

kinase


Pssm-ID: 215521 [Multi-domain]  Cd Length: 581  Bit Score: 49.66  E-value: 1.42e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  558 PVKEEDDGLSGKHSMRTRRSRgsmstlRSGRKKQPTSPDGRASPINEDIRssgrNSPSAASTSSNDSKAETVKKSA---K 634
Cdd:PLN02967   57 AVDEEPDENGAVSKKKPTRSV------KRATKKTVVEISEPLEEGSELVV----NEDAALDKESKKTPRRTRRKAAaasS 126
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 568931819  635 KVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEGES 687
Cdd:PLN02967  127 DVEEEKTEKKVRKRRKVKKMDEDVEDQGSESEVSDVEESEFVTSLENESEEEL 179
PRK10856 PRK10856
cytoskeleton protein RodZ;
717-798 1.48e-05

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 48.87  E-value: 1.48e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  717 SPQDNES---DSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVA 793
Cdd:PRK10856  155 SQNSGQSvplDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGA 234

                  ....*
gi 568931819  794 PAAHT 798
Cdd:PRK10856  235 APLPT 239
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
600-760 1.51e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.91  E-value: 1.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  600 SPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE-DTDRITSKKTKTQEISRPN 678
Cdd:NF033609  716 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDsDSDSDSDSDSDSDSDSDSD 795
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  679 SPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPS-IPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPP 756
Cdd:NF033609  796 SDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP 875

                  ....
gi 568931819  757 GTPQ 760
Cdd:NF033609  876 NSPK 879
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
709-850 1.73e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.65  E-value: 1.73e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   709 RSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTP-------SATAVPPQGSPAT 781
Cdd:pfam09770  204 RAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVtilqrpqSPQPDPAQPSIQP 283
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568931819   782 SQPPNQTQSTVAPAAHTHIQQAPTLhpPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPP---GPHSLQ 850
Cdd:pfam09770  284 QAQQFHQQPPPVPVQPTQILQNPNR--LSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGRQAPiitHPQQLA 353
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
665-846 2.44e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.21  E-value: 2.44e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  665 TSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSA--QQQMLQAQPPALQ 742
Cdd:PRK07764  600 PPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDggDGWPAKAGGAAPA 679
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  743 APSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMT 822
Cdd:PRK07764  680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP 759
                         170       180
                  ....*....|....*....|....
gi 568931819  823 APPSQSSAQPHPQPSLHSQGPPGP 846
Cdd:PRK07764  760 PPPAPAPAAAPAAAPPPSPPSEEE 783
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
396-441 6.69e-05

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 41.83  E-value: 6.69e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 568931819    396 WTEDEVKRFVKGLRQYG-KNFFRIRKElLPSKETGELITFYYYWKKT 441
Cdd:smart00717    4 WTEEEDELLIELVKKYGkNNWEKIAKE-LPGRTAEQCRERWRNLLKP 49
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
396-439 6.78e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 41.79  E-value: 6.78e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 568931819  396 WTEDEVKRFVKGLRQYG-KNFFRIRKElLPSKETGELITFYYYWK 439
Cdd:cd00167     2 WTEEEDELLLEAVKKYGkNNWEKIAKE-LPGRTPKQCRERWRNLL 45
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
736-947 7.55e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.56  E-value: 7.55e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  736 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPnqtqstvAPAAHTHIQQAPTLHPPrlpsph 815
Cdd:PRK12323  379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP-------APEALAAARQASARGPG------ 445
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  816 pplqPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPllqhPGPPQPFGLPSQPSQGQGP---LGPSPAAAHPHSTIQL 892
Cdd:PRK12323  446 ----GAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA----PARAAPAAAPAPADDDPPPweeLPPEFASPAPAQPDAA 517
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 568931819  893 PASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGPS 947
Cdd:PRK12323  518 PAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG 572
PRK10927 PRK10927
cell division protein FtsN;
637-863 1.06e-04

cell division protein FtsN;


Pssm-ID: 236797 [Multi-domain]  Cd Length: 319  Bit Score: 46.21  E-value: 1.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  637 KEEAASPLKSTKRQREKVASDTEDTDR-ITSKKTKTQEISRPNSPSEGeGESSDSRSVNDEGSSDPKDIDQDNRSTSPSI 715
Cdd:PRK10927   58 KKEESETLQSQKVTGNGLPPKPEERWRyIKELESRQPGVRAPTEPSAG-GEVKTPEQLTPEQRQLLEQMQADMRQQPTQL 136
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  716 PSPQDNESDSDSsaQQQMLQAQPpalQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPatsQPPNQTQStvapa 795
Cdd:PRK10927  137 VEVPWNEQTPEQ--RQQTLQRQR---QAQQLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQP---RQSKPAST----- 203
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568931819  796 ahthiqQAPtlhpprlpsphppLQPMTAPPSQSSAQPHPQpslhsqgppgphslQTGPLLQHPGPPQP 863
Cdd:PRK10927  204 ------QQP-------------YQDLLQTPAHTTAQSKPQ--------------QAAPVTRAADAPKP 238
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
691-796 1.10e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 47.19  E-value: 1.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  691 RSVNDEGSSDPK--DIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPgTPQLPTQGPTP 768
Cdd:PRK12270   17 QYLADPNSVDPSwrEFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP-KPAAAAAAAAA 95
                          90       100       110
                  ....*....|....*....|....*....|.
gi 568931819  769 SATAVPPQGSPATSQPPNQTQSTV---APAA 796
Cdd:PRK12270   96 PAAPPAAAAAAAPAAAAVEDEVTPlrgAAAA 126
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
623-793 1.67e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 46.26  E-value: 1.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  623 DSKAETVKKSAKKVKEEAAsPLKSTKRQREKVASDTEdtdriTSKKTKTQEISRPNSPSEGEGESSDSRSVNdEGSSDPK 702
Cdd:PRK14949  630 SPKEGDGKKSSADRKPKTP-PSRAPPASLSKPASSPD-----ASQTSASFDLDPDFELATHQSVPEAALASG-SAPAPPP 702
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  703 DIDQDNRstsPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATS 782
Cdd:PRK14949  703 VPDPYDR---PPWEEAPEVASANDGPNNAAEGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQ 779
                         170
                  ....*....|.
gi 568931819  783 QPPNQTQSTVA 793
Cdd:PRK14949  780 DTELNLVLLSS 790
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
580-729 1.85e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.44  E-value: 1.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  580 SMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTE 659
Cdd:NF033609  630 SASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 709
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568931819  660 -DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 729
Cdd:NF033609  710 sDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 780
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
396-439 1.91e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 40.56  E-value: 1.91e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 568931819   396 WTEDEVKRFVKGLRQYGKNFFRIrKELLPSKETGELITFYYYWK 439
Cdd:pfam00249    4 WTPEEDELLLEAVEKLGNRWKKI-AKLLPGRTDNQCKNRWQNYL 46
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
735-855 2.63e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.48  E-value: 2.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  735 QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPT---PSATAVPPQGSPATSQPPNqtQSTVAPAAHTHIQQAPTLHPPRL 811
Cdd:PRK14951  387 AAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAaapPAPVAAPAAAAPAAAPAAA--PAAVALAPAPPAQAAPETVAIPV 464
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 568931819  812 PSPHPPLQPMTAPPSqssaQPHPQPSLHSQGPPGPHSLQTGPLL 855
Cdd:PRK14951  465 RVAPEPAVASAAPAP----AAAPAAARLTPTEEGDVWHATVQQL 504
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
580-729 3.29e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 45.29  E-value: 3.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  580 SMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTe 659
Cdd:NF033609  654 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS- 732
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568931819  660 DTDRITSKKTKTQEISRPNSPSEGEGES-SDSRSVNDEGSSDPKDIDQDNRSTSPSiPSPQDNESDSDSSA 729
Cdd:NF033609  733 DSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDS 802
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
537-847 3.97e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 45.07  E-value: 3.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  537 KKYGELPPIEK---------PVDPPPFMFKPVKEEDDGLSGKHSmrtRRSRGSMSTLRSGRKKQPTSPDGRASPinediR 607
Cdd:PTZ00449  491 KSKKKLAPIEEedsdkhdepPEGPEASGLPPKAPGDKEGEEGEH---EDSKESDEPKEGGKPGETKEGEVGKKP-----G 562
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  608 SSGRNSPSAASTSSNDSKAETVKKSAKKVKEEaasplKSTKRQRekvaSDTEDTDRITSKKTKTQEI----SRPNSPSEG 683
Cdd:PTZ00449  563 PAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEP-----KKPKRPR----SAQRPTRPKSPKLPELLDIpkspKRPESPKSP 633
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  684 EGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIP-SPQDNES--DSDSSAQQQMLQAQPPALQAPSGAASAPSTAP--PGT 758
Cdd:PTZ00449  634 KRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPfDPKFKEKfyDDYLDAAAKSKETKTTVVLDESFESILKETLPetPGT 713
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  759 P-----QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQP----MTAPPSQSS 829
Cdd:PTZ00449  714 PfttprPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEdihaETGEPDEAM 793
                         330
                  ....*....|....*...
gi 568931819  830 AQPHpQPSLHSQGPPGPH 847
Cdd:PTZ00449  794 KRPD-SPSEHEDKPPGDH 810
dnaA PRK14086
chromosomal replication initiator protein DnaA;
709-880 4.93e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 44.82  E-value: 4.93e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  709 RSTSPSIPSPQDNESDSDSSAQQQMLQAQP--PALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavpPQGSPATSQPPn 786
Cdd:PRK14086  115 RRPYEGYGGPRADDRPPGLPRQDQLPTARPayPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRA----PYASPASYAPE- 189
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  787 qtQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmtaPPSQSSAQPHPQPS---LHSQGPPGPHSLQTGPLLQHPGPPQP 863
Cdd:PRK14086  190 --QERDREPYDAGRPEYDQRRRDYDHPRPDWDR----PRRDRTDRPEPPPGaghVHRGGPGPPERDDAPVVPIRPSAPGP 263
                         170
                  ....*....|....*..
gi 568931819  864 FglPSQPSQGQGPLGPS 880
Cdd:PRK14086  264 L--AAQPAPAPGPGEPT 278
PRK10905 PRK10905
cell division protein DamX; Validated
679-895 5.01e-04

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 44.16  E-value: 5.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  679 SPSEGEGESSDSRSVNDEGSSDpkdiDQDNRSTspsiPSP-QDNESDSDSSAQQQMlqAQPPALQAPSGAASAPstAPPG 757
Cdd:PRK10905   24 STSSSDQTASGEKSIDLAGNAT----DQANGVQ----PAPgTTSAEQTAGNTQQDV--SLPPISSTPTQGQTPV--ATDG 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  758 TPQLPTQG------------------------PTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPS 813
Cdd:PRK10905   92 QQRVEVQGdlnnaltqpqnqqqlnnvavnstlPTEPATVAPVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQAT 171
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  814 PHPPLQPMTAPP--SQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQ 891
Cdd:PRK10905  172 AKTEPKPVAQTPkrTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTLQ 251

                  ....
gi 568931819  892 LPAS 895
Cdd:PRK10905  252 LSSS 255
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
725-1013 5.75e-04

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 44.64  E-value: 5.75e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   725 SDSSAQQQML-QAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATA------------------------VPPQGSP 779
Cdd:pfam09770   92 SDAIEEEQVRfNRQQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVrtgyekykepepipdlqvdaslwgVAPKKAA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   780 ATSQPPnqtqsTVAPAAHTHIQQAPTLHPPRLPSPHPPLQpmTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPG 859
Cdd:pfam09770  172 APAPAP-----QPAAQPASLPAPSRKMMSLEEVEAAMRAQ--AKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ 244
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   860 PPQPFGLPSQPSQGQGplgpspaaaHPHSTIQLPASQSAlqpqqppreqplppaplamphikPPPTTPIPQLPAPQAHKH 939
Cdd:pfam09770  245 QPQQQPQQPQQHPGQG---------HPVTILQRPQSPQP-----------------------DPAQPSIQPQAQQFHQQP 292
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568931819   940 PPHLSGPSPFSLNANLPPPPALKPLSslsthhppsahppplQLMPQSQPLPSSPAQPPGltQSQSLPPPAASHP 1013
Cdd:pfam09770  293 PPVPVQPTQILQNPNRLSAARVGYPQ---------------NPQPGVQPAPAHQAHRQQ--GSFGRQAPIITHP 349
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
713-876 6.32e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 44.47  E-value: 6.32e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  713 PSIPSPQDNESD----SDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQgspATSQPPNQT 788
Cdd:PRK07994  361 PAAPLPEPEVPPqsaaPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQ---LQRAQGATK 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  789 QSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPS 868
Cdd:PRK07994  438 AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPELAAKLA 517

                  ....*...
gi 568931819  869 QPSQGQGP 876
Cdd:PRK07994  518 AEAIERDP 525
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
691-803 9.25e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.00  E-value: 9.25e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  691 RSVNDEGSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAlqAPSGAASAPSTAPPGTPQLPTQGPTPSA 770
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPS--APQSATQPAGTPPTVSVDPPAAVPVNPP 440
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 568931819  771 TAVPPQGSPATSQPPNQ----TQSTVAPAAHTHIQQA 803
Cdd:PRK14971  441 STAPQAVRPAQFKEEKKipvsKVSSLGPSTLRPIQEK 477
PHA03378 PHA03378
EBNA-3B; Provisional
701-869 9.89e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 9.89e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  701 PKDIDQDNRSTSPSIPSPQDNESDSDSSAQQqMLQAQPPALQAPSGAAS--APSTAPPGTPQLPTQGPTPsatAVPPQGS 778
Cdd:PHA03378  655 PQVEITPYKPTWTQIGHIPYQPSPTGANTML-PIQWAPGTMQPPPRAPTpmRPPAAPPGRAQRPAAATGR---ARPPAAA 730
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  779 PATSQPPNQTQSTVAP--AAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQ 856
Cdd:PHA03378  731 PGRARPPAAAPGRARPpaAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLM 810
                         170
                  ....*....|...
gi 568931819  857 HPGPPQPFGLPSQ 869
Cdd:PHA03378  811 PRAAPGQQGPTKQ 823
PHA03247 PHA03247
large tegument protein UL36; Provisional
739-1084 1.01e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.01e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  739 PALQAPSGAASaPStAPPGTPQlPTQGPTPSATAVPPQGSPATSQPPNQTQ-STVAPAAHTHIQQAPTLHPPRLPSPHPP 817
Cdd:PHA03247 2478 PVYRRPAEARF-PF-AAGAAPD-PGGGGPPDPDAPPAPSRLAPAILPDEPVgEPVHPRMLTWIRGLEELASDDAGDPPPP 2554
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  818 LQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAH--------PHST 889
Cdd:PHA03247 2555 LPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHapdppppsPSPA 2634
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  890 IQLPASQSALQPQQPPREQPLPPAPLAMPHIKPPPTTPIPQLPAPQAHKHPPHLSGP-SPFSLNANLPPPPALKPLSSLS 968
Cdd:PHA03247 2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTvGSLTSLADPPPPPPTPEPAPHA 2714
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  969 THHPPSAHPPPLQLMPQSQPLPSSPAQP--------PGLTQSQSLPPPAASHPTTGLHQVPSQSPFPQHPfvpGGPPPIT 1040
Cdd:PHA03247 2715 LVSATPLPPGPAAARQASPALPAAPAPPavpagpatPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLT---RPAVASL 2791
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....
gi 568931819 1041 PPSCPPTSTPPAGPSSSSQPPCSAAVSSGGSVPGAPSCPLPAVQ 1084
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQ 2835
PRK11901 PRK11901
hypothetical protein; Reviewed
698-897 1.19e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 42.75  E-value: 1.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  698 SSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQ---LPTQGPTPSATAVP 774
Cdd:PRK11901   55 GSALKSPTEHESQQSSNNAGAEKNIDLSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPPQdisAPPISPTPTQAAPP 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  775 PQgsPATSQ----PPN------QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSlhSQGPP 844
Cdd:PRK11901  135 QT--PNGQQrielPGNisdalsQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPAT--KKPAV 210
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 568931819  845 GPHSlqTGPLLQHPGPpqpfglPSQPSQGQGPLGPSPAAAHPHSTIQLP-ASQS 897
Cdd:PRK11901  211 NHHK--TATVAVPPAT------SGKPKSGAASARALSSAPASHYTLQLSsASRS 256
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
711-893 1.32e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 1.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  711 TSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQ----GSPATSQPPN 786
Cdd:PRK12323  400 AAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAaagpRPVAAAAAAA 479
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  787 QTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQT---GPLLQHPGPPQP 863
Cdd:PRK12323  480 PARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPApaaAPAPRAAAATEP 559
                         170       180       190
                  ....*....|....*....|....*....|
gi 568931819  864 FGLPSQPSQGQGPLGPSPAAAHPHSTIQLP 893
Cdd:PRK12323  560 VVAPRPPRASASGLPDMFDGDWPALAARLP 589
PRK13042 PRK13042
superantigen-like protein SSL4; Reviewed;
710-787 1.39e-03

superantigen-like protein SSL4; Reviewed;


Pssm-ID: 183854 [Multi-domain]  Cd Length: 291  Bit Score: 42.70  E-value: 1.39e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  710 STSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPSGAASAPSTAPPGT----PQLPTQGPTPSATAVPPQGSPATSQPP 785
Cdd:PRK13042   17 TTGVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTkveaPQQTPNATTPSSTKVETPQSPTTKQVP 96

                  ..
gi 568931819  786 NQ 787
Cdd:PRK13042   97 TE 98
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
733-863 1.44e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.16  E-value: 1.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  733 MLQAQPP-ALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAhthiqQAPTLHPPRL 811
Cdd:PRK14951  361 LLAFKPAaAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAP-----VAAPAAAAPA 435
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 568931819  812 PSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQP 863
Cdd:PRK14951  436 AAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPA--PAAAPAA 485
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
605-899 1.88e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 43.18  E-value: 1.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  605 DIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKEEAASPLKSTKRQREK-VASDTEDTDRITSKKTKTQEISRPNSPSEG 683
Cdd:PRK14949  482 NSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNYSAEDtLESNGLDEGDYAQDSAPLDAYQDDYVAFSS 561
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  684 EGESSDSRSVNDEGSSDPKdidQDNRSTSPSIPSPQDNE---SDSDSSAQQQMLQA----------QPPALQAPSGAASA 750
Cdd:PRK14949  562 ESYNALSDDEQHSANVQSA---QSAAEAQPSSQSLSPISavtTAAASLADDDILDAvlaardsllsDLDALSPKEGDGKK 638
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  751 PSTA--PPGTPQLPTQGPTPSATAVPP--QGSPATSQPPNQTQSTV--APAAHTHIQQAPTLHPPRLPSPHPplqPMTAP 824
Cdd:PRK14949  639 SSADrkPKTPPSRAPPASLSKPASSPDasQTSASFDLDPDFELATHqsVPEAALASGSAPAPPPVPDPYDRP---PWEEA 715
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568931819  825 PSQSSAQPHPQPSLHSQGPPGPHSLQTGPLlqHPGPPQPFGLPSQPSQGQGPlGPSPAAAHPHSTIQLPASQSAL 899
Cdd:PRK14949  716 PEVASANDGPNNAAEGNLSESVEDASNSEL--QAVEQQATHQPQVQAEAQSP-ASTTALTQTSSEVQDTELNLVL 787
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
761-898 2.16e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 2.16e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  761 LPTQGPTPSATAVPPQGSPAtSQPPNQTQSTVAPAAhthiqqaptlhpprlpsphpplqpMTAPPSQSSAQPHPQPSLHS 840
Cdd:PRK07764  385 LGVAGGAGAPAAAAPSAAAA-APAAAPAPAAAAPAA------------------------AAAPAPAAAPQPAPAPAPAP 439
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568931819  841 QGPPGPHSLQTGPLLQHP----GPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQSA 898
Cdd:PRK07764  440 APPSPAGNAPAGGAPSPPpaaaPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPA 501
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
736-825 2.19e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 42.49  E-value: 2.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  736 AQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSAtavPPQGSPATsqPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPH 815
Cdd:PRK14950  366 PQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPK---EPVRETAT--PPPVPPRPVAPPVPHTPESAPKLTRAAIPVDE 440
                          90
                  ....*....|
gi 568931819  816 PPLQPMTAPP 825
Cdd:PRK14950  441 KPKYTPPAPP 450
PHA03269 PHA03269
envelope glycoprotein C; Provisional
745-860 2.68e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 42.41  E-value: 2.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  745 SGAASAPSTAPpgTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTlhpprlPSPHPPLQPMTAP 824
Cdd:PHA03269   17 LIIANLNTNIP--IPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPT------PAASEKFDPAPAP 88
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 568931819  825 PSQSSAQPHPQ--PSLHSQGPPGPHSLQTGPLLQHPGP 860
Cdd:PHA03269   89 HQAASRAPDPAvaPQLAAAPKPDAAEAFTSAAQAHEAP 126
PRK08581 PRK08581
amidase domain-containing protein;
593-855 3.33e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 42.08  E-value: 3.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  593 TSPDGRASPINEDIRSSGRNSPSAASTSsNDSKAETVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITS---KKT 669
Cdd:PRK08581   21 TSPTAYADDPQKDSTAKTTSHDSKKSND-DETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDfiyKNL 99
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  670 KTQEISRPNSPSEGEGESSDSRSVNDEGSSDpKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPAL----QAPS 745
Cdd:PRK08581  100 PQTNINQLLTKNKYDDNYSLTTLIQNLFNLN-SDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKadnqKAPS 178
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  746 GAASAPST-------APPGTPQLPTQGPTPSATAVPPQGS--------------------PATSQPPNQTQSTVAPAAHT 798
Cdd:PRK08581  179 SNNTKPSTsnkqpnsPKPTQPNQSNSQPASDDTANQKSSSkdnqsmsdsaldsildqyseDAKKTQKDYASQSKKDKTET 258
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 568931819  799 HIQQAPTLHPPRLPSPhpplqpmTAPPSQSSAQPHPQPSLHSQgppgpHSLQTGPLL 855
Cdd:PRK08581  259 SNTKNPQLPTQDELKH-------KSKPAQSFENDVNQSNTRST-----SLFETGPSL 303
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
737-825 4.58e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.80  E-value: 4.58e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  737 QPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPqGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHP 816
Cdd:PRK12270   37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPP-AAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115

                  ....*....
gi 568931819  817 PLQPMTAPP 825
Cdd:PRK12270  116 EVTPLRGAA 124
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
594-837 4.83e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.48  E-value: 4.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   594 SPDGRASPINEDIRSSGRNSPSAASTSSNDSKaeTVKKSAKKVKEEAASPLKSTKRQREKVASDTEDTDRITSKKTKTQE 673
Cdd:pfam17823   22 PADPRHFVLNKMWNGAGKQNASGDAVPRADNK--SSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSE 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   674 isrpNSPSEGEGESSDSRSVNDEGSSDPKDIDQDNRSTSPSIPSpqdnESDSDSSAQQqmlqAQPPALQAPSGAASAPST 753
Cdd:pfam17823  100 ----PATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPS----EAFSAPRAAA----CRANASAAPRAAIAAASA 167
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   754 APPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPH 833
Cdd:pfam17823  168 PHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVGT 247

                   ....
gi 568931819   834 PQPS 837
Cdd:pfam17823  248 VTPA 251
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
760-1031 4.94e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 41.56  E-value: 4.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   760 QLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPA-----AHTHIQQAPTLhpprlpsphpplQPM-----TAPPSQSS 829
Cdd:pfam09770  105 QQPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVrtgyeKYKEPEPIPDL------------QVDaslwgVAPKKAAA 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   830 AQPHPQPSLHSQGPPGPH----SLQ----------TGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPAS 895
Cdd:pfam09770  173 PAPAPQPAAQPASLPAPSrkmmSLEeveaamraqaKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQ 252
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   896 QsalqpqqppreqplppaplamphikpppttpipqlPAPQAHKHPPHLsgpspfslnanlppppalkplsslsthhppsa 975
Cdd:pfam09770  253 P-----------------------------------QQHPGQGHPVTI-------------------------------- 265
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   976 hpppLQLMPQSQPLPSSPAQPPGLTQSQSLPPPAASHPTTGLH----QVPSQSPFPQHPF 1031
Cdd:pfam09770  266 ----LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILQnpnrLSAARVGYPQNPQ 321
PHA03269 PHA03269
envelope glycoprotein C; Provisional
768-888 5.13e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 41.25  E-value: 5.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  768 PSATAVPPQGSPATSQPPNQtqstvAPAAHTHIQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPH 847
Cdd:PHA03269   23 NTNIPIPELHTSAATQKPDP-----APAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPD 97
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 568931819  848 SLQTGPLLQHPGPPQPFGLPSQPSQGQGPL---------GPSPAAAHPHS 888
Cdd:PHA03269   98 PAVAPQLAAAPKPDAAEAFTSAAQAHEAPAdagtsaaskKPDPAAHTQHS 147
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
650-782 6.24e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 40.65  E-value: 6.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   650 QREKVASDTEDTDRITSKKTKTQ-EISRPNSPSEGEGESSDSRSVNDEGSSDPKDIDQDN---------RSTSPSIPSPQ 719
Cdd:TIGR00601    9 QQQKFKIDMEPDETVKELKEKIEaEQGKDAYPVAQQKLIYSGKILSDDKTVKEYKIKEKDfvvvmvskpKTGTGKVAPPA 88
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568931819   720 DNESDSDSSAqqqmlqAQPPALQAPSGAASAPSTAPPGTP---QLPTQGPTPSATAVPPQGSPATS 782
Cdd:TIGR00601   89 ATPTSAPTPT------PSPPASPASGMSAAPASAVEEKSPseeSATATAPESPSTSVPSSGSDAAS 148
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
727-884 7.25e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 40.99  E-value: 7.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  727 SSAQQQMLQAQPPALQAPSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHThiqqaptl 806
Cdd:PRK07003  413 KAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDA-------- 484
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568931819  807 hpprlpsphppLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAA 884
Cdd:PRK07003  485 -----------PPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAGGAAA 551
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
716-896 7.33e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 41.15  E-value: 7.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   716 PSPQDNESDSDSSAQQQML-QAQPPALQAPSGAAS-------APSTAPPGTPQLPTQGPTPSATAVPPQGSPA------- 780
Cdd:pfam09606  229 MNPQQMGGAPNQVAMQQQQpQQQGQQSQLGMGINQmqqmpqgVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIGdqnnyqq 308
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819   781 -TSQPPNQTQSTVAPAAHTH-----IQQAPTLHPPRLPSPHPPLQPMTAPPSQSSAQPHPQPSLHSQGPPGPHSL--QTG 852
Cdd:pfam09606  309 qQTRQQQQQQGGNHPAAHQQqmnqsVGQGGQVVALGGLNHLETWNPGNFGGLGANPMQRGQPGMMSSPSPVPGQQvrQVT 388
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....
gi 568931819   853 PLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHSTIQLPASQ 896
Cdd:pfam09606  389 PNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQ 432
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
724-829 7.64e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 40.91  E-value: 7.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  724 DSDSSAQQQMLQAQPPALQAPSgAASAPSTAPPGTPQLPTQGPTPSatavPPQGSPATSQPPNQTQSTVAPAAHTHIQQA 803
Cdd:PRK14971  366 GDDASGGRGPKQHIKPVFTQPA-AAPQPSAAAAASPSPSQSSAAAQ----PSAPQSATQPAGTPPTVSVDPPAAVPVNPP 440
                          90       100
                  ....*....|....*....|....*.
gi 568931819  804 PTLHPPRLPSPHPPLQPMtaPPSQSS 829
Cdd:PRK14971  441 STAPQAVRPAQFKEEKKI--PVSKVS 464
PRK10856 PRK10856
cytoskeleton protein RodZ;
727-830 7.74e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 40.39  E-value: 7.74e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  727 SSAQ--QQMLQAQPPALQAPSGAASAPSTAPPGTPQlPTQGPTPSATAVP-PQGSPATSQPPNQTQSTVAPAAHThiQQA 803
Cdd:PRK10856  150 SSAElsQNSGQSVPLDTSTTTDPATTPAPAAPVDTT-PTNSQTPAVATAPaPAVDPQQNAVVAPSQANVDTAATP--APA 226
                          90       100
                  ....*....|....*....|....*..
gi 568931819  804 PTLHPPRLPSPHPPLQPMTAPPSQSSA 830
Cdd:PRK10856  227 APATPDGAAPLPTDQAGVSTPAADPNA 253
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
744-889 8.18e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 40.62  E-value: 8.18e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  744 PSGAASAPSTAPPGTPQLPTQGPTPSATAVPPQGSPATSQPPNQTQSTVAPAAHTHIQQAPTLHPPRLPSphpplqpmta 823
Cdd:PRK07994  361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQ---------- 430
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568931819  824 PPSQSSAQPHPQPSLHSQGPPGPHSLQTGPLLQHPGPPQPFGLPSQPSQGQGPLGPSPAAAHPHST 889
Cdd:PRK07994  431 RAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVAT 496
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
536-708 8.72e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 40.80  E-value: 8.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  536 FKKYGELPPIEKPVDPPPFMFKPVKEEDDglSGKHSMRTRRSRGSMSTLRSGRKKQPTSPDGRASPINEDIRSSGRNSPS 615
Cdd:PTZ00108 1223 SDQEDDEEQKTKPKKSSVKRLKSKKNNSS--KSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGS 1300
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568931819  616 AASTSSNDSKAETVKKSAKKVKEeaasPLKSTKRQREKVASDTEDTDRITSKKTKTQEISRPNSPSEGEgESSDSRSVND 695
Cdd:PTZ00108 1301 KPSSPTKKKVKKRLEGSLAALKK----KKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSS-EDDDDSEVDD 1375
                         170
                  ....*....|...
gi 568931819  696 EGSSDPKDIDQDN 708
Cdd:PTZ00108 1376 SEDEDDEDDEDDD 1388
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH