NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|119608631|gb|EAW88225|]
View 

small nuclear RNA activating complex, polypeptide 4, 190kDa, isoform CRA_a [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PHA03247 super family cl33720
large tegument protein UL36; Provisional
817-1215 1.91e-14

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.21  E-value: 1.91e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  817 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 896
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  897 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 970
Cdd:PHA03247 2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  971 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1050
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1051 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAGLLATLLPPLTEtRAAQGPR 1126
Cdd:PHA03247 2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTESFALPPDQPERP-PQPQAPP 2915
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1127 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1206
Cdd:PHA03247 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                  ....*....
gi 119608631 1207 PAFGGVIPA 1215
Cdd:PHA03247 2995 PLTGHSLSR 3003
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
401-448 4.00e-14

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


:

Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 67.63  E-value: 4.00e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 119608631    401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
297-357 2.30e-08

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


:

Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 51.93  E-value: 2.30e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 119608631   297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
PLN03091 super family cl33633
hypothetical protein; Provisional
399-502 2.69e-08

hypothetical protein; Provisional


The actual alignment was detected with superfamily member PLN03091:

Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 58.06  E-value: 2.69e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091   12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                          90       100
                  ....*....|....*....|....*....
gi 119608631  474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091   87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
346-397 1.44e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


:

Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.14  E-value: 1.44e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 119608631    346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
SANT super family cl21498
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
262-305 1.36e-04

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


The actual alignment was detected with superfamily member pfam13921:

Pssm-ID: 473887 [Multi-domain]  Cd Length: 60  Bit Score: 41.14  E-value: 1.36e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 119608631   262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921   19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
817-1215 1.91e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.21  E-value: 1.91e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  817 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 896
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  897 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 970
Cdd:PHA03247 2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  971 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1050
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1051 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAGLLATLLPPLTEtRAAQGPR 1126
Cdd:PHA03247 2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTESFALPPDQPERP-PQPQAPP 2915
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1127 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1206
Cdd:PHA03247 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                  ....*....
gi 119608631 1207 PAFGGVIPA 1215
Cdd:PHA03247 2995 PLTGHSLSR 3003
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
401-448 4.00e-14

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 67.63  E-value: 4.00e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 119608631    401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
404-447 1.15e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 66.44  E-value: 1.15e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 119608631  404 WAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:cd00167     2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
401-447 2.37e-12

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 62.52  E-value: 2.37e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 119608631   401 KGYWAPEEDAKLLQAVAKYGEqDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPGRTDNQCKNRWQNYL 46
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
297-357 2.30e-08

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 51.93  E-value: 2.30e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 119608631   297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
PLN03091 PLN03091
hypothetical protein; Provisional
399-502 2.69e-08

hypothetical protein; Provisional


Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 58.06  E-value: 2.69e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091   12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                          90       100
                  ....*....|....*....|....*....
gi 119608631  474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091   87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
294-342 2.73e-08

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 51.46  E-value: 2.73e-08
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 119608631    294 KQEWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQHNK 342
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLK 48
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
346-397 1.44e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.14  E-value: 1.44e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 119608631    346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
290-339 1.64e-07

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 49.23  E-value: 1.64e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 119608631  290 PSINKQEWSREEEERLQAIaAAHGHLEWQKIAEELGtsRSAFQCLQKFQQ 339
Cdd:cd11659     1 PSIKKTEWTREEDEKLLHL-AKLLPTQWRTIAPIVG--RTAQQCLERYNK 47
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
256-357 2.99e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.79  E-value: 2.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  256 NRLDShDWEKISNINFE------GSRSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLeWQKIAEELGtSRS 329
Cdd:COG5147    29 EDLKA-LVKKLGPNNWSkvasllISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGTQ-WSTIADYKD-RRT 105
                          90       100
                  ....*....|....*....|....*...
gi 119608631  330 AFQCLQKFQQHNKALKRKEWTEEEDRML 357
Cdd:COG5147   106 AQQCVERYVNTLEDLSSTHDSKLQRRNE 133
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
836-1230 4.44e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.20  E-value: 4.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   836 SSAQSTPGHLFPNVPAQ-EASKSASHKGSRRLASSRVER----TLPQASLLASTGPrPKPKTVSELLQEKRLQEAR--AR 908
Cdd:pfam17823   11 FSLPLSESHAAPADPRHfVLNKMWNGAGKQNASGDAVPRadnkSSEQ*NFCAATAA-PAPVTLTKGTSAAHLNSTEvtAE 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   909 EATRG-----PVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAAKPGTSGSwqeAGTSAKDKRLS 982
Cdd:pfam17823   90 HTPHGtdlsePATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSeAFSAPRAAACRANAS---AAPRAAIAAAS 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   983 TMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1062
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1063 IGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETraaQGPRAPAlsSSWQPPANMNR 1142
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQA---QGPIIQV--STDQPVHNTAG 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1143 EPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSShadPPEAEPPWSGRLPAfggVIPATEPRGTP 1222
Cdd:pfam17823  322 EPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSM---IPEVEATSPTTQPS---PLLPTQGAAGP 395

                   ....*...
gi 119608631  1223 GSPSGTQE 1230
Cdd:pfam17823  396 GILLAPEQ 403
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
334-453 5.18e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.02  E-value: 5.18e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  334 LQKFQQHNKALKRKE--WTEEEDRMLTQLVQEM------RVGSHIPYRrivyyMEGRDSMqliyRWTKSLDPGLKKGYWA 405
Cdd:COG5147     6 NKELQIKLMQTKRKGgsWKRTEDEDLKALVKKLgpnnwsKVASLLISS-----TGKQSSN----RWNNHLNPQLKKKNWS 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 119608631  406 PEEDAKLLQAVAKYGEQdWFKIREEVPGRSDAQCRDRYLRRLHFSLKK 453
Cdd:COG5147    77 EEEDEQLIDLDKELGTQ-WSTIADYKDRRTAQQCVERYVNTLEDLSST 123
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
349-412 3.31e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 45.76  E-value: 3.31e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119608631   349 WTEEEDRMLTQLVQEMrvgsHIPYRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAKL 412
Cdd:pfam13921    1 WTEEEDEKLLKLVEKY----GNDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
470-496 4.31e-06

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 44.87  E-value: 4.31e-06
                          10        20
                  ....*....|....*....|....*..
gi 119608631  470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:cd00167    16 KYGKNNWEKIAKELPGRTPKQCRERWR 42
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
470-496 5.70e-06

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 44.52  E-value: 5.70e-06
                            10        20
                    ....*....|....*....|....*..
gi 119608631    470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:smart00717   18 KYGKNNWEKIAKELPGRTAEQCRERWR 44
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
399-495 4.68e-05

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 47.86  E-value: 4.68e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  399 LKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGVgHWAK 478
Cdd:COG5147    18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
                          90
                  ....*....|....*..
gi 119608631  479 IASELPHRSGSQCLSKW 495
Cdd:COG5147    97 IADYKDRRTAQQCVERY 113
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
262-305 1.36e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 41.14  E-value: 1.36e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 119608631   262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921   19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
193-359 1.91e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.03  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   193 RKSVVSDRLQRL---LQPKLLKLEYLHQKQSKVSSELERQALEKQGREAEKEIQdiNQLPEEALLGNRLD-SHDWEKISN 268
Cdd:TIGR00618  220 RKQVLEKELKHLreaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELR--AQEAVLEETQERINrARKAAPLAA 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   269 InfegSRSAEEIRKFWQNSeHPSINKQEWSReEEERLQAIAAAHGHLEWQKIAEELGTSRSafQCLQKFQQHNKALKRKE 348
Cdd:TIGR00618  298 H----IKAVTQIEQQAQRI-HTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHS--QEIHIRDAHEVATSIRE 369
                          170
                   ....*....|....*
gi 119608631   349 ----WTEEEDRMLTQ 359
Cdd:TIGR00618  370 iscqQHTLTQHIHTL 384
 
Name Accession Description Interval E-value
PHA03247 PHA03247
large tegument protein UL36; Provisional
817-1215 1.91e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.21  E-value: 1.91e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  817 PRLPQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPKPKTVSEL 896
Cdd:PHA03247 2616 PLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  897 LQ------EKRLQEARAREATRGpvvLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAakpgTSGSwq 970
Cdd:PHA03247 2696 TSladpppPPPTPEPAPHALVSA---TPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT----TAGP-- 2766
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  971 EAGTSAKDKRLSTMQALPLAPVFSEAEgTAPAASQAPALGPGQISVSCPESGLGQSQAPAAsrkqGLPEAPPFLPAAPSP 1050
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSE-SRESLPSPWDPADPPAAVLAPAAALPPAASPAG----PLPPPTSAQPTAPPP 2841
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1051 TPLPVQPlSLTHIGGphVATSVPlpVTWVLTAQGLLPVPV----PAVVSLPRPAGTPGPAGLLATLLPPLTEtRAAQGPR 1126
Cdd:PHA03247 2842 PPGPPPP-SLPLGGS--VAPGGD--VRRRPPSRSPAAKPAaparPPVRRLARPAVSRSTESFALPPDQPERP-PQPQAPP 2915
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1127 APALSSSWQPPANMNREPEPSCRTDTPAPPThALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1206
Cdd:PHA03247 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT-TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTP 2994

                  ....*....
gi 119608631 1207 PAFGGVIPA 1215
Cdd:PHA03247 2995 PLTGHSLSR 3003
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
401-448 4.00e-14

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 67.63  E-value: 4.00e-14
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 119608631    401 KGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLH 448
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLK 48
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
404-447 1.15e-13

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 66.44  E-value: 1.15e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 119608631  404 WAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:cd00167     2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
PHA03247 PHA03247
large tegument protein UL36; Provisional
812-1232 3.54e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 74.97  E-value: 3.54e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  812 RKALPPRLPQAGARD--PPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRVERTLPQASLLASTGPRPK 889
Cdd:PHA03247 2572 RPAPRPSEPAVTSRArrPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPER 2651
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  890 PKTVSELLQEKRLQEAR-------AREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAA 961
Cdd:PHA03247 2652 PRDDPAPGRVSRPRRARrlgraaqASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPpGPAAARQA 2731
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  962 KPGTSGSWQEAGTSAKdkrlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLgqSQAPAASRKQGLPEAP 1041
Cdd:PHA03247 2732 SPALPAAPAPPAVPAG----------PATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL--TRPAVASLSESRESLP 2799
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1042 pfLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSlprPAGTPGPAGLLATLLPPLTETRA 1121
Cdd:PHA03247 2800 --SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP---LGGSVAPGGDVRRRPPSRSPAAK 2874
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1122 AQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEAdgsvafvPGEAQVAREIPEPRTSSHADPPEAEPP 1201
Cdd:PHA03247 2875 PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ-------PQPPPPPQPQPPPPPPPRPQPPLAPTT 2947
                         410       420       430
                  ....*....|....*....|....*....|.
gi 119608631 1202 WSGRLPAFGGVIPAtePRGTPGSPSGTQEPR 1232
Cdd:PHA03247 2948 DPAGAGEPSGAVPQ--PWLGALVPGRVAVPR 2976
PHA03247 PHA03247
large tegument protein UL36; Provisional
812-1307 8.42e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.82  E-value: 8.42e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  812 RKALPPRLPQA--GARDPPvHLLQASSSAQSTPGHLFPNVPAQEASKSASH-------KGSRRLASSRV---ERTLPQAS 879
Cdd:PHA03247 2481 RRPAEARFPFAagAAPDPG-GGGPPDPDAPPAPSRLAPAILPDEPVGEPVHprmltwiRGLEELASDDAgdpPPPLPPAA 2559
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  880 LLASTG-----PRPKPKTVSELLQEKRLQEARAREATRG--PVVLPSQLLVSSSVILQPPLPHTPHgRPAPGPTVLNVPL 952
Cdd:PHA03247 2560 PPAAPDrsvppPRPAPRPSEPAVTSRARRPDAPPQSARPraPVDDRGDPRGPAPPSPLPPDTHAPD-PPPPSPSPAANEP 2638
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  953 SGPGaPAAAKPGTSGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASqaPALGPGQISVSCPESGLGQSQAP-AA 1031
Cdd:PHA03247 2639 DPHP-PPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAAR--PTVGSLTSLADPPPPPPTPEPAPhAL 2715
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1032 SRKQGLPEAP-------PFLPAAPSPTPLPVQPLSlthIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVvSLPRPAGTP- 1103
Cdd:PHA03247 2716 VSATPLPPGPaaarqasPALPAAPAPPAVPAGPAT---PGGPARPARPPTTAGPPAPAPPAAPAAGPPR-RLTRPAVASl 2791
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1104 GPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANmnrEPEPSCRTDTPAPPTHALSQSPAEADGSVAfvPGeAQVARE 1183
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP---LPPPTSAQPTAPPPPPGPPPPSLPLGGSVA--PG-GDVRRR 2865
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1184 IP------------EPRTSSHADPPEAEPPWSGRLPAFGgviPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGA 1251
Cdd:PHA03247 2866 PPsrspaakpaapaRPPVRRLARPAVSRSTESFALPPDQ---PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 119608631 1252 LDLEKPPLPQPGPEKGALDlgllsqegeaatqQWLGGQRGVRVPLLGSRLPYQPPA 1307
Cdd:PHA03247 2943 LAPTTDPAGAGEPSGAVPQ-------------PWLGALVPGRVAVPRFRVPQPAPS 2985
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
401-447 2.37e-12

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 62.52  E-value: 2.37e-12
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 119608631   401 KGYWAPEEDAKLLQAVAKYGEqDWFKIREEVPGRSDAQCRDRYLRRL 447
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPGRTDNQCKNRWQNYL 46
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
404-457 3.28e-11

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 60.02  E-value: 3.28e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 119608631   404 WAPEEDAKLLQAVAKYGeQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWN 457
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWS 53
PHA03378 PHA03378
EBNA-3B; Provisional
901-1267 5.12e-10

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 64.32  E-value: 5.12e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  901 RLQEARAREATRGPVVL----PSQLLVSSSVILQPPLPHTPHgRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSA 976
Cdd:PHA03378  437 RTEQPRATPHSQAPTVVlhrpPTQPLEGPTGPLSVQAPLEPW-QPLPHPQVTPVILHQPPAQGVQAHGSMLDLLEKDDED 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  977 KDKRLSTMQALPLAP----------VFSE---AEGTAPAASQA------PALGPGQISV-------------SCPESGLG 1024
Cdd:PHA03378  516 MEQRVMATLLPPSPPqpragrrapcVYTEdldIESDEPASTEPvhdqllPAPGLGPLQIqpltspttsqlasSAPSYAQT 595
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1025 QSQAPAASRKQGLPEAPPFLPA--APSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGT 1102
Cdd:PHA03378  596 PWPVPHPSQTPEPPTTQSHIPEtsAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQ 675
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1103 PGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSV---AFVPGEAQ 1179
Cdd:PHA03378  676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRArppAAAPGRAR 755
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1180 VAREIPEPRTSSHADPPEAEPpwsgRLPAFGGVIPATEPRgtpGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPL 1259
Cdd:PHA03378  756 PPAAAPGRARPPAAAPGAPTP----QPPPQAPPAPQQRPR---GAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQL 828

                  ....*...
gi 119608631 1260 PQPGPEKG 1267
Cdd:PHA03378  829 LTGGVKRG 836
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
297-357 2.30e-08

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 51.93  E-value: 2.30e-08
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 119608631   297 WSREEEERLQAIAAAHGhLEWQKIAEELGtSRSAFQCLQKFQQHNKA-LKRKEWTEEEDRML 357
Cdd:pfam13921    1 WTEEEDEKLLKLVEKYG-NDWKQIAKELG-RRTPKQCFDRWRRKLNPkISRGPWSKEEDQRL 60
PLN03091 PLN03091
hypothetical protein; Provisional
399-502 2.69e-08

hypothetical protein; Provisional


Pssm-ID: 215570 [Multi-domain]  Cd Length: 459  Bit Score: 58.06  E-value: 2.69e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  399 LKKGYWAPEEDAKLLQAVAKYGEQDWfkirEEVPGRSDAQ-----CRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGv 473
Cdd:PLN03091   12 LRKGLWSPEEDEKLLRHITKYGHGCW----SSVPKQAGLQrcgksCRLRWINYLRPDLKRGTFSQQEENLIIELHAVLG- 86
                          90       100
                  ....*....|....*....|....*....
gi 119608631  474 GHWAKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03091   87 NRWSQIAAQLPGRTDNEIKNLWNSCLKKK 115
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
294-342 2.73e-08

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 51.46  E-value: 2.73e-08
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*....
gi 119608631    294 KQEWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQHNK 342
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNNWEKIAKELPG-RTAEQCRERWRNLLK 48
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
814-1200 5.24e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.69  E-value: 5.24e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  814 ALPPRLPQAGARDPPVhlLQASSSAQSTPghlfPNVPAQEASKSASHKGSRRLASSrvertlPQASLLASTGPRPKPKTV 893
Cdd:PRK07764  380 RLERRLGVAGGAGAPA--AAAPSAAAAAP----AAAPAPAAAAPAAAAAPAPAAAP------QPAPAPAPAPAPPSPAGN 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  894 SELLQEKRLQEARAREATRGPVVLPSQllvssSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPgtSGSWQEAG 973
Cdd:PRK07764  448 APAGGAPSPPPAAAPSAQPAPAPAAAP-----EPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATL--RERWPEIL 520
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  974 TSAKDKRLSTMQAL---------------------PLAPVFSEAE-----------------------GTAPAASQAPAL 1009
Cdd:PRK07764  521 AAVPKRSRKTWAILlpeatvlgvrgdtlvlgfstgGLARRFASPGnaevlvtalaeelggdwqveavvGPAPGAAGGEGP 600
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1010 GPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPV- 1088
Cdd:PRK07764  601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAa 680
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1089 PVPAVVSLPRPAGTPGPAGLLATLLPPlteTRAAQGPRAPAlsSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEAD 1168
Cdd:PRK07764  681 PPPAPAPAAPAAPAGAAPAQPAPAPAA---TPPAGQADDPA--AQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAP 755
                         410       420       430
                  ....*....|....*....|....*....|..
gi 119608631 1169 GSVAFVPGEAQVAREIPEPRTSSHADPPEAEP 1200
Cdd:PRK07764  756 AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAE 787
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
346-397 1.44e-07

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 49.14  E-value: 1.44e-07
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|..
gi 119608631    346 RKEWTEEEDRMLTQLVQEMRVGShipYRRIVYYMEGRDSMQLIYRWTKSLDP 397
Cdd:smart00717    1 KGEWTEEEDELLIELVKKYGKNN---WEKIAKELPGRTAEQCRERWRNLLKP 49
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
290-339 1.64e-07

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 49.23  E-value: 1.64e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 119608631  290 PSINKQEWSREEEERLQAIaAAHGHLEWQKIAEELGtsRSAFQCLQKFQQ 339
Cdd:cd11659     1 PSIKKTEWTREEDEKLLHL-AKLLPTQWRTIAPIVG--RTAQQCLERYNK 47
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
820-1227 2.99e-07

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 55.56  E-value: 2.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  820 PQAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLASSRvERTLPQASLLASTGPRPKPKTVSELLQE 899
Cdd:PHA03307   24 PPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPG-PGTEAPANESRSTPTWSLSTLAPASPAR 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  900 KRLQEARAREATRGPVVLPSQLLVSSSVilQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDK 979
Cdd:PHA03307  103 EGSPTPPGPSSPDPPPPTPPPASPPPSP--APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  980 RLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQA---------PAASRKQGLPEAPPFLPAAPSP 1050
Cdd:PHA03307  181 ETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAAddagasssdSSSSESSGCGWGPENECPLPRP 260
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1051 TPLPVQPLSLTHIgGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGP-RAPA 1129
Cdd:PHA03307  261 APITLPTRIWEAS-GWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSEsSRGA 339
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1130 LSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRLPAF 1209
Cdd:PHA03307  340 AVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDA 419
                         410
                  ....*....|....*...
gi 119608631 1210 GGVIPATEPRGTPGSPSG 1227
Cdd:PHA03307  420 GAASGAFYARYPLLTPSG 437
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
256-357 2.99e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.79  E-value: 2.99e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  256 NRLDShDWEKISNINFE------GSRSAEEIRKFWQNSEHPSINKQEWSREEEERLQAIAAAHGHLeWQKIAEELGtSRS 329
Cdd:COG5147    29 EDLKA-LVKKLGPNNWSkvasllISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGTQ-WSTIADYKD-RRT 105
                          90       100
                  ....*....|....*....|....*...
gi 119608631  330 AFQCLQKFQQHNKALKRKEWTEEEDRML 357
Cdd:COG5147   106 AQQCVERYVNTLEDLSSTHDSKLQRRNE 133
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
296-340 3.14e-07

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 47.96  E-value: 3.14e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 119608631  296 EWSREEEERLQAIAAAHGHLEWQKIAEELGTsRSAFQCLQKFQQH 340
Cdd:cd00167     1 PWTEEEDELLLEAVKKYGKNNWEKIAKELPG-RTPKQCRERWRNL 44
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
836-1230 4.44e-07

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 54.20  E-value: 4.44e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   836 SSAQSTPGHLFPNVPAQ-EASKSASHKGSRRLASSRVER----TLPQASLLASTGPrPKPKTVSELLQEKRLQEAR--AR 908
Cdd:pfam17823   11 FSLPLSESHAAPADPRHfVLNKMWNGAGKQNASGDAVPRadnkSSEQ*NFCAATAA-PAPVTLTKGTSAAHLNSTEvtAE 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   909 EATRG-----PVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLS-GPGAPAAAKPGTSGSwqeAGTSAKDKRLS 982
Cdd:pfam17823   90 HTPHGtdlsePATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSeAFSAPRAAACRANAS---AAPRAAIAAAS 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   983 TMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1062
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1063 IGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETraaQGPRAPAlsSSWQPPANMNR 1142
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQA---QGPIIQV--STDQPVHNTAG 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1143 EPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSShadPPEAEPPWSGRLPAfggVIPATEPRGTP 1222
Cdd:pfam17823  322 EPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSM---IPEVEATSPTTQPS---PLLPTQGAAGP 395

                   ....*...
gi 119608631  1223 GSPSGTQE 1230
Cdd:pfam17823  396 GILLAPEQ 403
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
930-1264 5.02e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.61  E-value: 5.02e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  930 QPPLPHTPHGRPAPGPtvlnvplSGPGAPAAAKPGTSGSWQEAGT-SAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPA 1008
Cdd:PRK07764  397 AAPSAAAAAPAAAPAP-------AAAAPAAAAAPAPAAAPQPAPApAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA 469
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1009 LGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAA---------------------------PSPTPLPVQP--LS 1059
Cdd:PRK07764  470 PAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDaatlrerwpeilaavpkrsrktwaillPEATVLGVRGdtLV 549
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1060 LTH--------IGGPHVATSVplpVTwVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALS 1131
Cdd:PRK07764  550 LGFstgglarrFASPGNAEVL---VT-ALAEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPA 625
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1132 SSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEA-----DGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRL 1206
Cdd:PRK07764  626 APAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDasdggDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 119608631 1207 PAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGP 1264
Cdd:PRK07764  706 AATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA 763
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
876-1248 5.05e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.77  E-value: 5.05e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   876 PQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLpsqllVSSSVILQP---PLPHTP-HGRPAPGPTVLNVP 951
Cdd:pfam03154  189 PGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTL-----IQQTPTLHPqrlPSPHPPlQPMTQPPPPSQVSP 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   952 LSGPGAPAAAKPGTSGSWQEAGTSAKDKRLSTmQALPLAPVFSEAEGTAPAASQAPalgpgqisvscpesglGQSQApaa 1031
Cdd:pfam03154  264 QPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPP-QPFPLTPQSSQSQVPPGPSPAAP----------------GQSQQ--- 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1032 srkqgLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQG-LLPVPVPAVVSLPRPAGTPGPAGLLA 1110
Cdd:pfam03154  324 -----RIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQShKHPPHLSGPSPFQMNSNLPPPPALKP 398
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1111 TLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPA-----PPTHALSQSPAEAD-GSVAFVPGEAQVAREI 1184
Cdd:pfam03154  399 LSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPpaashPPTSGLHQVPSQSPfPQHPFVPGGPPPITPP 478
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1185 PEPRTSSHADPPEAEPPWSGRlPAFGGVIPATEPRGTPG------SPSGTQEPRGPlgleKLPLRQPGPE 1248
Cdd:pfam03154  479 SGPPTSTSSAMPGIQPPSSAS-VSSSGPVPAAVSCPLPPvqikeeALDEAEEPESP----PPPPRSPSPE 543
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
334-453 5.18e-07

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 54.02  E-value: 5.18e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  334 LQKFQQHNKALKRKE--WTEEEDRMLTQLVQEM------RVGSHIPYRrivyyMEGRDSMqliyRWTKSLDPGLKKGYWA 405
Cdd:COG5147     6 NKELQIKLMQTKRKGgsWKRTEDEDLKALVKKLgpnnwsKVASLLISS-----TGKQSSN----RWNNHLNPQLKKKNWS 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 119608631  406 PEEDAKLLQAVAKYGEQdWFKIREEVPGRSDAQCRDRYLRRLHFSLKK 453
Cdd:COG5147    77 EEEDEQLIDLDKELGTQ-WSTIADYKDRRTAQQCVERYVNTLEDLSST 123
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
878-1234 1.52e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 53.25  E-value: 1.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  878 ASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSVIlqPPLPHTPHGRPAPGPTVLNVPLSGPGA 957
Cdd:PHA03307   57 AGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPT--PPGPSSPDPPPPTPPPASPPPSPAPDL 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  958 PAAAKPGTSGSwqeagtsakdKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESG---LGQSQAPAASRK 1034
Cdd:PHA03307  135 SEMLRPVGSPG----------PPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPpaePPPSTPPAAASP 204
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1035 QGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTwVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLP 1114
Cdd:PHA03307  205 RPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGC-GWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPG 283
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1115 PLTETRAAQGPRAPALSSSwqppanmnrepepSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHAD 1194
Cdd:PHA03307  284 PASSSSSPRERSPSPSPSS-------------PGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRS 350
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|
gi 119608631 1195 PPEAEPPwsgrlPAFGGVIPATEPRGTPGSPSGTQEPRGP 1234
Cdd:PHA03307  351 PSPSRPP-----PPADPSSPRKRPRPSRAPSSPAASAGRP 385
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
349-412 3.31e-06

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 45.76  E-value: 3.31e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119608631   349 WTEEEDRMLTQLVQEMrvgsHIPYRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAKL 412
Cdd:pfam13921    1 WTEEEDEKLLKLVEKY----GNDWKQIAKELGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
960-1170 4.24e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 51.42  E-value: 4.24e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  960 AAKPGTSGSWQEAGTSAKDKRLSTMQAL----PLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQ 1035
Cdd:PRK12323  362 AFRPGQSGGGAGPATAAAAPVAQPAPAAaapaAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASA 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1036 GLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGP----AGLLAT 1111
Cdd:PRK12323  442 RGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPaqpdAAPAGW 521
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 119608631 1112 LLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGS 1170
Cdd:PRK12323  522 VAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
470-496 4.31e-06

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 44.87  E-value: 4.31e-06
                          10        20
                  ....*....|....*....|....*..
gi 119608631  470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:cd00167    16 KYGKNNWEKIAKELPGRTPKQCRERWR 42
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1000-1270 4.96e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 4.96e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1000 APAASQAPAlGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVAtsvPLPVTWV 1079
Cdd:PRK07003  361 AVTGGGAPG-GGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAA---PAPPATA 436
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1080 LTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSswqppanmnREPEPSCrtdtpAPPTHA 1159
Cdd:PRK07003  437 DRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA---------FEPAPRA-----AAPSAA 502
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1160 LSQSPAEADGSVAFVPGEAQVAREIPEPRTSShADPPEAEPPWSGrlpafGGVIPATEPRGTPGSPSGTQEPRGPLGLEK 1239
Cdd:PRK07003  503 TPAAVPDARAPAAASREDAPAAAAPPAPEARP-PTPAAAAPAARA-----GGAAAALDVLRNAGMRVSSDRGARAAAAAK 576
                         250       260       270
                  ....*....|....*....|....*....|.
gi 119608631 1240 LPLRQPGPEKGALDLEKPPLPQPGPEKGALD 1270
Cdd:PRK07003  577 PAAAPAAAPKPAAPRVAVQVPTPRARAATGD 607
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
470-496 5.70e-06

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 44.52  E-value: 5.70e-06
                            10        20
                    ....*....|....*....|....*..
gi 119608631    470 KYGVGHWAKIASELPHRSGSQCLSKWK 496
Cdd:smart00717   18 KYGKNNWEKIAKELPGRTAEQCRERWR 44
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
987-1215 8.81e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 50.26  E-value: 8.81e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  987 LPLAPVFSEAEGTAPAASQAPALGPG----QISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTH 1062
Cdd:PRK12323  361 LAFRPGQSGGGAGPATAAAAPVAQPApaaaAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQAS 440
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1063 IGGPhVATSVPLPVtwvltaqgllPVPVPAVVSLPRPAGTPGPAglLATLLPPLTETRAAQGPRAPALSSSWQPPANMNR 1142
Cdd:PRK12323  441 ARGP-GGAPAPAPA----------PAAAPAAAARPAAAGPRPVA--AAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 119608631 1143 EPEPSCRTDTPAPPTHALSQSPAEADGSVAF-VPGEAQVAREIPEPRTSSHADPPEAEPPWS--GRLPAFGGVIPA 1215
Cdd:PRK12323  508 SPAPAQPDAAPAGWVAESIPDPATADPDDAFeTLAPAPAAAPAPRAAAATEPVVAPRPPRASasGLPDMFDGDWPA 583
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
931-1138 9.92e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 50.26  E-value: 9.92e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  931 PPLPHTPHGRPAPG---PTVLNVPLSGPGAPAAAKPGTSGSWQEAGtSAKDKRLSTMQALPLApvfSEAEGTAPAASQAP 1007
Cdd:PRK12323  375 ATAAAAPVAQPAPAaaaPAAAAPAPAAPPAAPAAAPAAAAAARAVA-AAPARRSPAPEALAAA---RQASARGPGGAPAP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1008 ALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAqglLP 1087
Cdd:PRK12323  451 APAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAES---IP 527
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 119608631 1088 VPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAqgPRAPALSSSWQPPA 1138
Cdd:PRK12323  528 DPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA--PRPPRASASGLPDM 576
PHA03247 PHA03247
large tegument protein UL36; Provisional
930-1272 1.69e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.94  E-value: 1.69e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  930 QPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGT---SAKDKRLSTMQALPLAPVFSEaegtaPAASQA 1006
Cdd:PHA03247 2414 QPDPPGPPDVRFVGSEEIEELPFVSPGGDVLAGLAADGDPFFARTilgAPFSLSLLLGELFPGAPVYRR-----PAEARF 2488
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1007 P-ALGPGqisvscPESGLGQSQAPAASRKQGLPeAPPFLPAAPSPTPLPVQPLSLTH---------IGGPhvatSVPLPv 1076
Cdd:PHA03247 2489 PfAAGAA------PDPGGGGPPDPDAPPAPSRL-APAILPDEPVGEPVHPRMLTWIRgleelasddAGDP----PPPLP- 2556
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1077 twvltaqgllPVPVPAVV--SLPRPAGTPGPAGLLAtllpplteTRAAQGPRAPALSSSWQPPANmNREPEPSCRTDTPA 1154
Cdd:PHA03247 2557 ----------PAAPPAAPdrSVPPPRPAPRPSEPAV--------TSRARRPDAPPQSARPRAPVD-DRGDPRGPAPPSPL 2617
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1155 PPTHALSQSPAEADGSVAFVPGEAQVAREIPEPRTSSHADPPEAEPPWSGRLP--AFGGVIPATEPR------------- 1219
Cdd:PHA03247 2618 PPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLgrAAQASSPPQRPRrraarptvgslts 2697
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 119608631 1220 -GTPGSPSGTQEPRGPLGLEKLPLrQPGPEKGALDLEKPPL---PQPGPEKGALDLG 1272
Cdd:PHA03247 2698 lADPPPPPPTPEPAPHALVSATPL-PPGPAAARQASPALPAapaPPAVPAGPATPGG 2753
Myb_DNA-binding pfam00249
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
294-340 2.30e-05

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 459731 [Multi-domain]  Cd Length: 46  Bit Score: 42.88  E-value: 2.30e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 119608631   294 KQEWSREEEERLQAIAAAHGHlEWQKIAEELGTsRSAFQCLQKFQQH 340
Cdd:pfam00249    1 RGPWTPEEDELLLEAVEKLGN-RWKKIAKLLPG-RTDNQCKNRWQNY 45
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
903-1202 3.29e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 48.69  E-value: 3.29e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  903 QEARAREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEaGTSAKDKRLS 982
Cdd:PRK07003  372 VPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGD-DAADGDAPVP 450
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  983 TMQALPLAPVFSEAEGTA-PAASQAPALGPGqisvscpesglgqSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLT 1061
Cdd:PRK07003  451 AKANARASADSRCDERDAqPPADSGSASAPA-------------SDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAAS 517
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1062 HIGGPHVAtSVPLPvtwvltaqgLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMN 1141
Cdd:PRK07003  518 REDAPAAA-APPAP---------EARPPTPAAAAPAARAGGAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKP 587
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 119608631 1142 REPEPSCRTDTPAPPTHALSQSPAEAdgsvafvpgeaqvareipePRTSSHADPPEAEPPW 1202
Cdd:PRK07003  588 AAPRVAVQVPTPRARAATGDAPPNGA-------------------ARAEQAAESRGAPPPW 629
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1064-1302 3.90e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 48.33  E-value: 3.90e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1064 GGPHVATSVPLPVTWVLtaqgllPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPANMNRE 1143
Cdd:PRK12323  370 GGAGPATAAAAPVAQPA------PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARG 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1144 PEPSCRTDTPAPPTHALSQSPAEADgsvafvpgeaqvareiPEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPRGTPG 1223
Cdd:PRK12323  444 PGGAPAPAPAPAAAPAAAARPAAAG----------------PRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFA 507
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1224 SPSGTQEPRGPLGLEKLPLRQPG---PEKGALDLEKPPLPQPGPEKGALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSR 1300
Cdd:PRK12323  508 SPAPAQPDAAPAGWVAESIPDPAtadPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAAR 587

                  ..
gi 119608631 1301 LP 1302
Cdd:PRK12323  588 LP 589
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
399-495 4.68e-05

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 47.86  E-value: 4.68e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  399 LKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVPGRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGVgHWAK 478
Cdd:COG5147    18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLLISSTGKQSSNRWNNHLNPQLKKKNWSEEEDEQLIDLDKELGT-QWST 96
                          90
                  ....*....|....*..
gi 119608631  479 IASELPHRSGSQCLSKW 495
Cdd:COG5147    97 IADYKDRRTAQQCVERY 113
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
931-1216 4.74e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.99  E-value: 4.74e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   931 PPLPHTPHGRPAP---GPTVLNVPLSGP---GAPAAAKPGT-SGSWQEAGTSAKDKRLSTmqalplaPVFSEAEGTAPAA 1003
Cdd:pfam05109  449 PSSTHVPTNLTAPastGPTVSTADVTSPtpaGTTSGASPVTpSPSPRDNGTESKAPDMTS-------PTSAVTTPTPNAT 521
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1004 SQAPALGPGQISVSCPESGlgqSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQP-LSLThigGPHVATSVPLPVTWVLTA 1082
Cdd:pfam05109  522 SPTPAVTTPTPNATSPTLG---KTSPTSAVTTPTPNATSPTPAVTTPTPNATIPtLGKT---SPTSAVTTPTPNATSPTV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1083 QGLLP------------VPVPAVVSLPRPAGTPGPAGLLATLLppltETRAAQGPRAPALSSSWQPPANMNR-------- 1142
Cdd:pfam05109  596 GETSPqanttnhtlggtSSTPVVTSPPKNATSAVTTGQHNITS----SSTSSMSLRPSSISETLSPSTSDNStshmpllt 671
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  1143 EPEPS-----------------CRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPePRTSSHADPPEAEPPWSGR 1205
Cdd:pfam05109  672 SAHPTggenitqvtpaststhhVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTP-PKNATSPQAPSGQKTAVPT 750
                          330
                   ....*....|.
gi 119608631  1206 LPAFGGVIPAT 1216
Cdd:pfam05109  751 VTSTGGKANST 761
SANT_CDC5_II cd11659
SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, ...
397-443 5.69e-05

SANT/myb-like DNA-binding domain of Cell Division Cycle 5-Like Protein repeat II; In humans, cell division cycle 5-like protein (CDC5) functions in pre-mRNA splicing in cell cycle control. The DNA-binding, myb-like domain of CDC5 is a member of the SANT/myb group. SANT is named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. The SANT domain resembles the 3 alpha-helix bundle of DNA-binding Myb domains and is found in a diverse set of proteins.


Pssm-ID: 212557 [Multi-domain]  Cd Length: 53  Bit Score: 41.91  E-value: 5.69e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 119608631  397 PGLKKGYWAPEEDAKLLQAVAKYGEQdWFKIREEVpGRSDAQCRDRY 443
Cdd:cd11659     1 PSIKKTEWTREEDEKLLHLAKLLPTQ-WRTIAPIV-GRTAQQCLERY 45
PHA03247 PHA03247
large tegument protein UL36; Provisional
559-1059 6.24e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.01  E-value: 6.24e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  559 LLSPQYMVPDMDLWVPARQSTSQPWRGGAGAWLGGPAAslsPPKGSSASQGGSKEASTTAAAPgeetsPVQVPARAHGPV 638
Cdd:PHA03247 2554 PLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDA---PPQSARPRAPVDDRGDPRGPAP-----PSPLPPDTHAPD 2625
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  639 PRSAQASHSADTRPAGAEKQALEGGRRLLTVPVETVLRVLRANTAARSCTQKEQLRQPPLPTSSPGVSSGDSVARSHVQw 718
Cdd:PHA03247 2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP- 2704
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  719 lrHRATQSGQRRWRHALHRRLLNRRLLLAVTPWVGDVVVPCTQASqrPAVVQTQADGLREQLQQARLASTPvftlftqlf 798
Cdd:PHA03247 2705 --PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG--PATPGGPARPARPPTTAGPPAPAP--------- 2771
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  799 hidtagclevvrerKALPPRLPQAGARDPPVhllqaSSSAQSTPGHLFPNVPAqEASKSASHKGSRRLASSRVERTLPQA 878
Cdd:PHA03247 2772 --------------PAAPAAGPPRRLTRPAV-----ASLSESRESLPSPWDPA-DPPAAVLAPAAALPPAASPAGPLPPP 2831
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  879 SLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSvilQPPLPHTPhgRPAPGPTVLNVPLSGPGAP 958
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPA---RPPVRRLA--RPAVSRSTESFALPPDQPE 2906
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  959 AAAKPgtsgswqEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQ--SQAPAASRKQG 1036
Cdd:PHA03247 2907 RPPQP-------QAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAlvPGRVAVPRFRV 2979
                         490       500
                  ....*....|....*....|...
gi 119608631 1037 LPEAPPFLPAAPSPTPLPVQPLS 1059
Cdd:PHA03247 2980 PQPAPSREAPASSTPPLTGHSLS 3002
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
938-1156 7.36e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 7.36e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  938 HGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDKRlstmqalplAPVFSEAEGTAPAASQAPALGPGQISVS 1017
Cdd:PRK07764  590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA---------AAAPAEASAAPAPGVAAPEHHPKHVAVP 660
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1018 CPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLP 1097
Cdd:PRK07764  661 DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVP 740
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 119608631 1098 RPaGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSWQPPAnmnrEPEPSCRTDTPAPP 1156
Cdd:PRK07764  741 LP-PEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS----EEEEMAEDDAPSMD 794
PHA03378 PHA03378
EBNA-3B; Provisional
874-1209 1.19e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.98  E-value: 1.19e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  874 TLPQASLLASTGP----RPKPKTVSELLQEKRLQEARAREAT---RGPVVL---PSQLLVSSSVILQPPLPHTPHGRPAP 943
Cdd:PHA03378  578 TSPTTSQLASSAPsyaqTPWPVPHPSQTPEPPTTQSHIPETSaprQWPMPLrpiPMRPLRMQPITFNVLVFPTPHQPPQV 657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  944 GPTVLNV----PLSGPGAPAAAKPGTSGSWQEAGTsakdkrlsTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCP 1019
Cdd:PHA03378  658 EITPYKPtwtqIGHIPYQPSPTGANTMLPIQWAPG--------TMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAA 729
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1020 ESGLGQSQAPAASRKQGlPEAPPFLPAAPSPTPLPVQPLSlthiGGPHVATSVPLPVTWVLTAQ----GLLPVPVPAV-- 1093
Cdd:PHA03378  730 APGRARPPAAAPGRARP-PAAAPGRARPPAAAPGRARPPA----AAPGAPTPQPPPQAPPAPQQrprgAPTPQPPPQAgp 804
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1094 ----VSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPALSSSwQPPANMNREPEPSCRTDT-PAPPTHALSQSPAEAD 1168
Cdd:PHA03378  805 tsmqLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALER-QAAAGPTPSPGSGTSDKIvQAPVFYPPVLQPIQVM 883
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 119608631 1169 GSVAFV---------------PGEAQVA-----REIPEPRTSSHADPPEAEPPWSGRLPAF 1209
Cdd:PHA03378  884 RQLGSVraaaastvtqapteyTGERRGVgpmhpTDIPPSKRAKTDAYVESQPPHGGQSHSF 944
SANT_TRF cd11660
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ...
404-443 1.36e-04

Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.


Pssm-ID: 212558 [Multi-domain]  Cd Length: 50  Bit Score: 41.01  E-value: 1.36e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 119608631  404 WAPEEDAKLLQAVAKYGEQDWFKIREE---VPGRSDAQCRDRY 443
Cdd:cd11660     3 WTDEEDEALVEGVEKYGVGNWAKILKDyffVNNRTSVDLKDKW 45
Myb_DNA-bind_6 pfam13921
Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, ...
262-305 1.36e-04

Myb-like DNA-binding domain; This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.


Pssm-ID: 372817 [Multi-domain]  Cd Length: 60  Bit Score: 41.14  E-value: 1.36e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 119608631   262 DWEKISNInfEGSRSAEEIRKFWQNSEHPSINKQEWSREEEERL 305
Cdd:pfam13921   19 DWKQIAKE--LGRRTPKQCFDRWRRKLNPKISRGPWSKEEDQRL 60
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
292-411 3.11e-04

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 45.16  E-value: 3.11e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  292 INKQEWSREEEERLQAIAAAHGHLEWQKIAEELgTSRSAFQC-LQKFQQHNKALKRKEWTEEEDRMLTQLVQEMrvGSHI 370
Cdd:COG5147    18 RKGGSWKRTEDEDLKALVKKLGPNNWSKVASLL-ISSTGKQSsNRWNNHLNPQLKKKNWSEEEDEQLIDLDKEL--GTQW 94
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 119608631  371 pyRRIVYYMEGRDSMQLIYRWTKSLDPGLKKGYWAPEEDAK 411
Cdd:COG5147    95 --STIADYKDRRTAQQCVERYVNTLEDLSSTHDSKLQRRNE 133
REB1 COG5147
Myb superfamily proteins, including transcription factors and mRNA splicing factors ...
233-375 3.50e-04

Myb superfamily proteins, including transcription factors and mRNA splicing factors [Transcription / RNA processing and modification / Cell division and chromosome partitioning];


Pssm-ID: 227476 [Multi-domain]  Cd Length: 512  Bit Score: 45.16  E-value: 3.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  233 KQGREAEKEiQDINQLPE-----EALLGNRLDSHDWEKISNINFE----GSRSAEEIRKFWQNSEHPSINKQEWSREEEE 303
Cdd:COG5147   222 KKGETLALE-QEINEYKEkkglsRKQFCERIWSTDRDEDKFWPNIykklPYRDKKSIYKHLRRKYNIFEQRGKWTKEEEQ 300
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 119608631  304 RLQAIAAAHGHLeWQKIAEELGTSRSafQCLQKFQQHNK---ALKRKEWTEEEDRMLTQLVQEMRVGSHiPYRRI 375
Cdd:COG5147   301 ELAKLVVEHGGS-WTEIGKLLGRMPN--DCRDRWRDYVKcgdTLKRNRWSIEEEELLDKVVNEMRLEAQ-QSSRI 371
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
792-1319 5.91e-04

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 44.48  E-value: 5.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  792 TLFTQLFHIDTAGCLEVVRERKALPPRLP----QAGARDPPVHLLQASSSAQSTPGHLFPNVPAQEASKSASHKGSRRLA 867
Cdd:COG3321   839 QLWVAGVPVDWSALYPGRGRRRVPLPTYPfqreDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAA 918
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  868 SSRVERTLPQASLLASTGPRPKPKTVSELLQEKRLQEARAREATRGPVVLPSQLLVSSSVILQPPLPHTPHGRPAPGPTV 947
Cdd:COG3321   919 LALAAAALAALLALVALAAAAAALLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAA 998
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  948 LNVPLSGPGAPAAAKPGTSGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQ 1027
Cdd:COG3321   999 AAALALLAAAALLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELA 1078
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1028 APAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAG 1107
Cdd:COG3321  1079 LAAAALALAAALAAAALALALAALAAALLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAA 1158
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1108 LLATLLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVAREIPEP 1187
Cdd:COG3321  1159 LAAALAAALLAAAALLLALALALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLAL 1238
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1188 RTSSHADPPEAE-PPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGPEK 1266
Cdd:COG3321  1239 AAAAAAVAALAAaAAALLAALAALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAA 1318
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|...
gi 119608631 1267 GALDLGLLSQEGEAATQQWLGGQRGVRVPLLGSRLPYQPPALCSLRALSGLLL 1319
Cdd:COG3321  1319 AALAAALLAAALAALAAAVAAALALAAAAAAAAAAAAAAAAAAALAAAAGAAA 1371
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
966-1235 7.10e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.39  E-value: 7.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  966 SGSWQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGlgqSQAPAASRKQGLPEAPPflP 1045
Cdd:PHA03307   37 SGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLS---TLAPASPAREGSPTPPG--P 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1046 AAPSPTPLPVQPLSLTHIGGPHVATSVPLPVTWVLTAQGllpVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGP 1125
Cdd:PHA03307  112 SSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAA---SPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSS 188
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1126 RAPALSSSWQPPANMNREPEP------SCRTDTPAPPTHALSQSPAEADGSVAFVPGEAQVARE----IPEPRTSSHADP 1195
Cdd:PHA03307  189 PPAEPPPSTPPAAASPRPPRRsspisaSASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPEnecpLPRPAPITLPTR 268
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 119608631 1196 PEAEPPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGPL 1235
Cdd:PHA03307  269 IWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPA 308
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1006-1307 1.06e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 1.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1006 APALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPflPAAPSPTPLPVQplslthigGPHVATSVPLPVTWVLTAQGL 1085
Cdd:PRK07764  385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAA--APAPAAAPQPAP--------APAPAPAPPSPAGNAPAGGAP 454
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1086 LPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRAPAlSSSWQPPANMNREPEPS-----------CRTDTPA 1154
Cdd:PRK07764  455 SPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPA-APAAPAGADDAATLRERwpeilaavpkrSRKTWAI 533
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1155 PPTHAlsqSPAEADGSV---AFV----------PGEAQVAREIPEPRT-------------SSHADPPEAEPPWSGRLPA 1208
Cdd:PRK07764  534 LLPEA---TVLGVRGDTlvlGFStgglarrfasPGNAEVLVTALAEELggdwqveavvgpaPGAAGGEGPPAPASSGPPE 610
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1209 FGGVIPATEPRGTPGSPSGTQEPRGPLGLEKLPLRQPGPEKGALDLEKPPLPQPGPEKGALDLGLLSQEGEAATQQWLG- 1287
Cdd:PRK07764  611 EAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAp 690
                         330       340
                  ....*....|....*....|.
gi 119608631 1288 -GQRGVRVPLLGSRLPYQPPA 1307
Cdd:PRK07764  691 aAPAGAAPAQPAPAPAATPPA 711
PLN03212 PLN03212
Transcription repressor MYB5; Provisional
398-502 1.11e-03

Transcription repressor MYB5; Provisional


Pssm-ID: 178751 [Multi-domain]  Cd Length: 249  Bit Score: 42.37  E-value: 1.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  398 GLKKGYWAPEEDAKLLQAVAKYGEQDWFKIREEVP-GRSDAQCRDRYLRRLHFSLKKGRWNLKEEEQLIELIEKYGvGHW 476
Cdd:PLN03212   22 GMKRGPWTVEEDEILVSFIKKEGEGRWRSLPKRAGlLRCGKSCRLRWMNYLRPSVKRGGITSDEEDLILRLHRLLG-NRW 100
                          90       100
                  ....*....|....*....|....*.
gi 119608631  477 AKIASELPHRSGSQCLSKWKIMMGKK 502
Cdd:PLN03212  101 SLIAGRIPGRTDNEIKNYWNTHLRKK 126
PHA03247 PHA03247
large tegument protein UL36; Provisional
942-1186 1.25e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  942 APGPTVLNVPLSGpGAPAAAKPGTSGSWQ-EAGTSAKDKRlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPE 1020
Cdd:PHA03247  254 APAPPPVVGEGAD-RAPETARGATGPPPPpEAAAPNGAAA-------PPDGVWGAALAGAPLALPAPPDPPPPAPAGDAE 325
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1021 SGLGQSQAPAASRKQGLPEA--PPFLPAAPSPTPLPvqPLSLTHI-GGPHVATSVPLPVTWVLTA--------------- 1082
Cdd:PHA03247  326 EEDDEDGAMEVVSPLPRPRQhyPLGFPKRRRPTWTP--PSSLEDLsAGRHHPKRASLPTRKRRSArhaatpfargpggdd 403
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1083 QGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTEtraAQGPRAPALSSSWQPPANMNREPEPSCRTDT---------- 1152
Cdd:PHA03247  404 QTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAE---PGSDDGPAPPPERQPPAPATEPAPDDPDDATrkaldalrer 480
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 119608631 1153 --PAPPTHALSQ----SPAEADGSVAFVPGEAQVAREIPE 1186
Cdd:PHA03247  481 rpPEPPGADLAEllgrHPDTAGTVVRLAAREAAIAREVAE 520
PHA03379 PHA03379
EBNA-3A; Provisional
676-1234 1.25e-03

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 43.51  E-value: 1.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  676 RVLRANTAARSCTQKEQLRQPPLPTSSpgvssgdsVARSHVQWLRHRATQSGQRRWRHALHRRLLNRRLLLAVTPWVGDV 755
Cdd:PHA03379  388 RLLLMRAGKLTERAREALEKASEPTYG--------TPRPPVEKPRPEVPQSLETATSHGSAQVPEPPPVHDLEPGPLHDQ 459
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  756 --VVPCTQASQRPAVVQTQADGLREQ--LQQARLASTPVFTLFTQLFHIDTAGCLEVvrERKALPPRLPQAGARDP-PVH 830
Cdd:PHA03379  460 hsMAPCPVAQLPPGPLQDLEPGDQLPgvVQDGRPACAPVPAPAGPIVRPWEASLSQV--PGVAFAPVMPQPMPVEPvPVP 537
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  831 LLQASSSAQSTPGHLFPNVPAQEAsksashkGSRRLAssrvERTLPqasllASTGPRPkPKTVSELLQEKRLQEARA-RE 909
Cdd:PHA03379  538 TVALERPVCPAPPLIAMQGPGETS-------GIVRVR----ERWRP-----APWTPNP-PRSPSQMSVRDRLARLRAeAQ 600
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  910 ATRGPV-VLPSQL-LVSSSVILQPPLPHTPHGRPAPGPTVLNVPLSGPGAPAAAKPGTSGSWQEAGTsakdkrlstmQAL 987
Cdd:PHA03379  601 PYQASVeVQPPQLtQVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGVPAMQPQYFDLPLQQPIS----------QGA 670
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  988 PLAPVFSEAEGTAPAASQAPALgpgqisvscPESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLTHIGGPH 1067
Cdd:PHA03379  671 PLAPLRASMGPVPPVPATQPQY---------FDIPLTEPINQGASAAHFLPQQPMEGPLVPERWMFQGATLSQSVRPGVA 741
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1068 VATSVPLPVTWVLTAQGllpvpvPAVVSLPRPAgTPGP-AGLLATLLPPLTETRAAQGPRapALSSSWQPPANMNREPEP 1146
Cdd:PHA03379  742 QSQYFDLPLTQPINHGA------PAAHFLHQPP-MEGPwVPEQWMFQGAPPSQGTDVVQH--QLDALGYVLHVLNHPGVP 812
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1147 ScrtdTPAPPTHALSQS----PAEADGSvafvpGEAQVAREIPEP-RTSSHADPPEAEPPWSGRLPafgGVIPATEPRGT 1221
Cdd:PHA03379  813 V----SPAVNQYHVSQAafglPIDEDES-----GEGSDTSEPCEAlDLSIHGRPCPQAPEWPVQGE---GGQDATEVLDL 880
                         570
                  ....*....|...
gi 119608631 1222 pgSPSGTQEPRGP 1234
Cdd:PHA03379  881 --SIHGRPRPRTP 891
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
193-359 1.91e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 43.03  E-value: 1.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   193 RKSVVSDRLQRL---LQPKLLKLEYLHQKQSKVSSELERQALEKQGREAEKEIQdiNQLPEEALLGNRLD-SHDWEKISN 268
Cdd:TIGR00618  220 RKQVLEKELKHLreaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELR--AQEAVLEETQERINrARKAAPLAA 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631   269 InfegSRSAEEIRKFWQNSeHPSINKQEWSReEEERLQAIAAAHGHLEWQKIAEELGTSRSafQCLQKFQQHNKALKRKE 348
Cdd:TIGR00618  298 H----IKAVTQIEQQAQRI-HTELQSKMRSR-AKLLMKRAAHVKQQSSIEEQRRLLQTLHS--QEIHIRDAHEVATSIRE 369
                          170
                   ....*....|....*
gi 119608631   349 ----WTEEEDRMLTQ 359
Cdd:TIGR00618  370 iscqQHTLTQHIHTL 384
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
954-1234 3.17e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 3.17e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  954 GPGAPAAAKPGtsgswqeagtsakdkrlstmqALPlapvfseaegtAPAASQAPALGPGQISVSCPESGlgqsQAPAASR 1033
Cdd:PRK07003  368 PGGGVPARVAG---------------------AVP-----------APGARAAAAVGASAVPAVTAVTG----AAGAALA 411
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1034 KQGLPEAPPFLPAAPSPTPLPVQplslthiggphVATSVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTP--GPAGLLAT 1111
Cdd:PRK07003  412 PKAAAAAAATRAEAPPAAPAPPA-----------TADRGDDAADGDAPVPAKANARASADSRCDERDAQPpaDSGSASAP 480
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1112 LLPPLTETRAAQGPRAPALSSSWQPPANMNREPEPSCRTDTPAPPTHA--LSQSPAEADGSVAFVPGEAQVAREI----- 1184
Cdd:PRK07003  481 ASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPapEARPPTPAAAAPAARAGGAAAALDVlrnag 560
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 119608631 1185 -----PEPRTSSHADPPEAEPPWSGRLPAFGGVIPATEPRGTPGSPSGTQEPRGP 1234
Cdd:PRK07003  561 mrvssDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAAR 615
SANT_TRF cd11660
Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human ...
470-499 4.38e-03

Telomere repeat binding factor-like DNA-binding domains of the SANT/myb-like family; Human telomere repeat binding factors, TRF1 and TRF2, function as part of the 6 component shelterin complex. TRF2 binds DNA and recruits RAP1 (via binding to the RAP1 protein c-terminal (RCT)) and TIN2 in the protection of telomeres from DNA repair machinery. Metazoan shelterin consists of 3 DNA binding proteins (TRF2, TRF1, and POT1) and 3 recruited proteins that bind to one or more of these DNA-binding proteins (RAP1, TIN2, TPP1). Schizosaccharomyces pombe TAZ1 is an orthlog and binds RAP1. Human TRF1 and TRF2 bind double-stranded DNA. hTRF2 consists of a basic N-terminus, a TRF homology domain, the RAP1 binding motif (RBM), the TIN2 binding motif (TBM) and a myb-like DNA binding domain, SANT, named after 'SWI3, ADA2, N-CoR and TFIIIB', several factors that share this domain. Tandem copies of the domain bind telomeric DNA tandem repeats as part of the capping complex. The single myb-like domain of TRF-type proteins is similar to the tandem myb_like domains found in yeast RAP1.


Pssm-ID: 212558 [Multi-domain]  Cd Length: 50  Bit Score: 36.78  E-value: 4.38e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 119608631  470 KYGVGHWAKIASELP---HRSGSQCLSKWKIMM 499
Cdd:cd11660    17 KYGVGNWAKILKDYFfvnNRTSVDLKDKWRNLK 49
PRK10263 PRK10263
DNA translocase FtsK; Provisional
905-1232 7.26e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.22  E-value: 7.26e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  905 ARAREATRGPVVLPSQLLVSSSVILQP-------PLPHTPHGRPAPGPTvlnvplSGPGAPAAAKPGTSGS--WQEagts 975
Cdd:PRK10263  330 TQSWAAPVEPVTQTPPVASVDVPPAQPtvawqpvPGPQTGEPVIAPAPE------GYPQQSQYAQPAVQYNepLQQ---- 399
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  976 akdkrlstmqalPLAPVFSEAEGTAPAASQAPALGPGQISVSCPESGLGQSQAPAASRKQGLPEAPPflPAAPSPTPLPV 1055
Cdd:PRK10263  400 ------------PVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQS--TFAPQSTYQTE 465
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1056 QPlslthiggphvatsVPLPVTWVLTAQGLLPVPVPAVVSLPRPAGTPGPAGLLATLLPPLTETRAAQGPRapaLSSSWQ 1135
Cdd:PRK10263  466 QT--------------YQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQ---LAAWYQ 528
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631 1136 PPANMNREPEPSCRTdtpAPPTHALSQSPAEAdgsvafVPGEAQVAREIPEPRTSSHADPPEAEPPWSgrlPAFGGVipa 1215
Cdd:PRK10263  529 PIPEPVKEPEPIKSS---LKAPSVAAVPPVEA------AAAVSPLASGVKKATLATGAAATVAAPVFS---LANSGG--- 593
                         330
                  ....*....|....*..
gi 119608631 1216 tePRGTPGSPSGTQEPR 1232
Cdd:PRK10263  594 --PRPQVKEGIGPQLPR 608
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
939-1061 9.20e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.47  E-value: 9.20e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119608631  939 GRPAPGPTVLNVPLSGPGAPAAAKPGTSGSwQEAGTSAKDKRLSTMQALPLAPVFSEAEGTAPAASQAPALGPGQISVSC 1018
Cdd:PRK14951  369 AAEAAAPAEKKTPARPEAAAPAAAPVAQAA-AAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVAL 447
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 119608631 1019 PESGLGQSQAPAASRKQGLPEAPPFLPAAPSPTPLPVQPLSLT 1061
Cdd:PRK14951  448 APAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH