NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|688577521|ref|XP_009304365|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform X2 [Danio rerio]

Protein Classification

eukaryotic translation initiation factor 4 gamma( domain architecture ID 10501430)

eukaryotic translation initiation factor 4 gamma (EIF4G) plays a key functional role in the initiation of cap-dependent translation by acting as an adapter to nucleate the assembly of eIF4F complex

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
877-1105 1.13e-62

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 212.61  E-value: 1.13e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   877 FRKVRSILNKLTPQMFHQLMKQVTDLTINTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLATLKvptadkpntTVNFR 956
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLRN---------PTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   957 KLLLNRCQKEFERDkvddvvlerkqkeidsatsptekerlqEELEEAKDKARRRSTGNIKFIGELFKLKMLTEPIMHDCV 1036
Cdd:pfam02854   72 IHLLNRLQEEFEKR---------------------------FELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 688577521  1037 VKLLKNH-------DDESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLHN 1105
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1548-1683 1.40e-47

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 166.69  E-value: 1.40e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521 1548 LSPEELFKQLEQLLLEDMSSDEqIFDWIEANLDESQMSSSPFLRALMTAICKAAVKDESTsCRVDTAIIQKRLPILHKYF 1627
Cdd:cd11559     1 LPLLRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSL-PEKEKALLEKYAPLLQKYL 78
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 688577521 1628 DSDTERQLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPT 1683
Cdd:cd11559    79 DDDEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1349-1460 7.07e-34

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 126.62  E-value: 7.07e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  1349 IERKSKAIIDEFLHINDYKEAVQCVLEIEQPSMLCVFVRMGLESTLERSQKAREHMGLLYYQLIQKGILPHSQLYKGFSE 1428
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|..
gi 688577521  1429 MLEQADDMAIDIPFIWLYLAELLSPLLKEGGI 1460
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGL 112
PHA03247 super family cl33720
large tegument protein UL36; Provisional
4-426 2.01e-13

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 76.13  E-value: 2.01e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    4 PPKVVPKPAAVAVSGHVTGP-APPTQLRAaltsvSLPPGAQNAPPSAVPPTQIPRAALSLDErmfPAhsgvtavysvsrh 82
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPdAPPQSARP-----RAPVDDRGDPRGPAPPSPLPPDTHAPDP---PP------------- 2628
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   83 PGPPFPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAGPSyRILKPWETGGAP--PYNPAQNAGSAPLVYSPQTQPMNVQ 160
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRAR-RLGRAAQASSPPqrPRRRAARPTVGSLTSLADPPPPPPT 2707
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  161 PQTRPFVTGPR-PTHHQFIHRSQMQPARPTLPTNNPS----IRPGSQTPTATVYPPNQPIMMTMTPMPFATqthqyyiPQ 235
Cdd:PHA03247 2708 PEPAPHALVSAtPLPPGPAAARQASPALPAAPAPPAVpagpATPGGPARPARPPTTAGPPAPAPPAAPAAG-------PP 2780
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  236 YRHSAPYVGPPQQYAVQPPGSGTfyPGPSPAEYPTPYAAGPPYYTGQTVYPPSPPIIVPAPMpppptkrekKPSSQIRIR 315
Cdd:PHA03247 2781 RRLTRPAVASLSESRESLPSPWD--PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP---------PPPGPPPPS 2849
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  316 DPNQGGkdITEEIMFGSRNPT-PPAGHPASTLTPPAGR-PSSTPTPPSGRLSSTPTPPQRPSNCQTPEQTAYVNQNQRlS 393
Cdd:PHA03247 2850 LPLGGS--VAPGGDVRRRPPSrSPAAKPAAPARPPVRRlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP-P 2926
                         410       420       430
                  ....*....|....*....|....*....|...
gi 688577521  394 ESPAPMDGKPSlddRPKMESGPIKSISPGPRPS 426
Cdd:PHA03247 2927 PQPQPPPPPPP---RPQPPLAPTTDPAGAGEPS 2956
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
877-1105 1.13e-62

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 212.61  E-value: 1.13e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   877 FRKVRSILNKLTPQMFHQLMKQVTDLTINTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLATLKvptadkpntTVNFR 956
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLRN---------PTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   957 KLLLNRCQKEFERDkvddvvlerkqkeidsatsptekerlqEELEEAKDKARRRSTGNIKFIGELFKLKMLTEPIMHDCV 1036
Cdd:pfam02854   72 IHLLNRLQEEFEKR---------------------------FELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 688577521  1037 VKLLKNH-------DDESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLHN 1105
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
878-1102 1.12e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 181.02  E-value: 1.12e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    878 RKVRSILNKLTPQMFHQLMKQVTDLTINTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLAtLKVPtadkpnttvNFRK 957
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    958 LLLNRCQKEFERDkvddvvlerkqkeidsatsptekerlqeeLEEAKDKARRRSTGNIKFIGELFKLKMLTEPIMHDCVV 1037
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 688577521   1038 KLLKNH-------DDESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1102
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1548-1683 1.40e-47

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 166.69  E-value: 1.40e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521 1548 LSPEELFKQLEQLLLEDMSSDEqIFDWIEANLDESQMSSSPFLRALMTAICKAAVKDESTsCRVDTAIIQKRLPILHKYF 1627
Cdd:cd11559     1 LPLLRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSL-PEKEKALLEKYAPLLQKYL 78
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 688577521 1628 DSDTERQLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPT 1683
Cdd:cd11559    79 DDDEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1349-1460 7.07e-34

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 126.62  E-value: 7.07e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  1349 IERKSKAIIDEFLHINDYKEAVQCVLEIEQPSMLCVFVRMGLESTLERSQKAREHMGLLYYQLIQKGILPHSQLYKGFSE 1428
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|..
gi 688577521  1429 MLEQADDMAIDIPFIWLYLAELLSPLLKEGGI 1460
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGL 112
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1349-1461 2.52e-33

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 125.05  E-value: 2.52e-33
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   1349 IERKSKAIIDEFLHINDYKEAVQCVLEIEQPSMLCVFVRMGLESTLERSQKAREHMGLLYYQLIQKGILPHSQLYKGFSE 1428
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 688577521   1429 MLEQADDMAIDIPFIWLYLAELLSPLLKEGGIN 1461
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1634-1710 3.25e-25

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 100.29  E-value: 3.25e-25
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 688577521  1634 QLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPTEQlGKGVALKSVNAFFTWLREAEEESE 1710
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1621-1705 1.20e-24

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 99.29  E-value: 1.20e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   1621 PILHKYFDSDTERQLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPTEqlGKGVALKSVNAFFT 1700
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 688577521   1701 WLREA 1705
Cdd:smart00515   79 WLQEA 83
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-426 2.01e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 76.13  E-value: 2.01e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    4 PPKVVPKPAAVAVSGHVTGP-APPTQLRAaltsvSLPPGAQNAPPSAVPPTQIPRAALSLDErmfPAhsgvtavysvsrh 82
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPdAPPQSARP-----RAPVDDRGDPRGPAPPSPLPPDTHAPDP---PP------------- 2628
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   83 PGPPFPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAGPSyRILKPWETGGAP--PYNPAQNAGSAPLVYSPQTQPMNVQ 160
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRAR-RLGRAAQASSPPqrPRRRAARPTVGSLTSLADPPPPPPT 2707
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  161 PQTRPFVTGPR-PTHHQFIHRSQMQPARPTLPTNNPS----IRPGSQTPTATVYPPNQPIMMTMTPMPFATqthqyyiPQ 235
Cdd:PHA03247 2708 PEPAPHALVSAtPLPPGPAAARQASPALPAAPAPPAVpagpATPGGPARPARPPTTAGPPAPAPPAAPAAG-------PP 2780
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  236 YRHSAPYVGPPQQYAVQPPGSGTfyPGPSPAEYPTPYAAGPPYYTGQTVYPPSPPIIVPAPMpppptkrekKPSSQIRIR 315
Cdd:PHA03247 2781 RRLTRPAVASLSESRESLPSPWD--PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP---------PPPGPPPPS 2849
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  316 DPNQGGkdITEEIMFGSRNPT-PPAGHPASTLTPPAGR-PSSTPTPPSGRLSSTPTPPQRPSNCQTPEQTAYVNQNQRlS 393
Cdd:PHA03247 2850 LPLGGS--VAPGGDVRRRPPSrSPAAKPAAPARPPVRRlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP-P 2926
                         410       420       430
                  ....*....|....*....|....*....|...
gi 688577521  394 ESPAPMDGKPSlddRPKMESGPIKSISPGPRPS 426
Cdd:PHA03247 2927 PQPQPPPPPPP---RPQPPLAPTTDPAGAGEPS 2956
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
11-426 1.94e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 69.41  E-value: 1.94e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    11 PAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPraalsldermfPAHSGVTAVYSVSRHPGPPFPGH 90
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQP-----------PNQTQSTAAPHTLIQQTPTLHPQ 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    91 DLSKTHPNLAGTPPghatSPALSQVSVPAGPSyrilkPWETGGAPPYNPAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGP 170
Cdd:pfam03154  241 RLPSPHPPLQPMTQ----PPPPSQVSPQPLPQ-----PSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPP 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   171 RPTHHQFIHRSQmqpaRPTLPTNNPSIRPGsQTPTATVYPPnQPIMMTMTPMPFATQTHQYYIPQ-YRHSAPYVGP-PQQ 248
Cdd:pfam03154  312 GPSPAAPGQSQQ----RIHTPPSQSQLQSQ-QPPREQPLPP-APLSMPHIKPPPTTPIPQLPNPQsHKHPPHLSGPsPFQ 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   249 YAVQPPGSGTFYPGPSPAEYPTPYAAGPPYY---TGQTVYPPSPPIIVPAPMPPPPTKREKKPSSQIRIRDPNQGGKDIT 325
Cdd:pfam03154  386 MNSNLPPPPALKPLSSLSTHHPPSAHPPPLQlmpQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQH 465
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   326 EEIMFGSRNPTPPAGHPAStlTPPAGrPSSTPtPPSGRLSSTPTPPQRPSNCQTPEQtayvnqnqrLSESPapmdgkpsL 405
Cdd:pfam03154  466 PFVPGGPPPITPPSGPPTS--TSSAM-PGIQP-PSSASVSSSGPVPAAVSCPLPPVQ---------IKEEA--------L 524
                          410       420
                   ....*....|....*....|.
gi 688577521   406 DDRPKMESGPIKSISPGPRPS 426
Cdd:pfam03154  525 DEAEEPESPPPPPRSPSPEPT 545
KLF1_N cd21581
N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as ...
139-277 7.58e-03

N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as Krueppel-like factor 1 or Erythroid Kruppel-like Factor/EKLF) was the first Kruppel-like factor discovered. It was found to be vitally important for embryonic erythropoiesis in promoting the switch from fetal hemoglobin (Hemoglobin F) to adult hemoglobin (Hemoglobin A) gene expression by binding to highly conserved CACCC domains. EKLF ablation in mouse embryos produces a lethal anemic phenotype, causing death by embryonic day 14, and natural mutations lead to beta+ thalassemia in humans. However, expression of embryonic hemoglobin and fetal hemoglobin genes is normal in EKLF-deficient mice, suggesting other factors may be involved. KLF1 functions as a transcriptional activator. It belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF1, which is related to the N-terminal domains of KLF2 and KLF4.


Pssm-ID: 409227 [Multi-domain]  Cd Length: 278  Bit Score: 40.41  E-value: 7.58e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  139 PAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGP----------RPTHHQFIHRSQMQPAR-PTL-PTNNPSIRPGSQTPTA 206
Cdd:cd21581    93 EEQPGAYYEPPKKDQPGTEGLQVGGPGLMAELlspeestgwaPPEPHHGYPDAFVGPALfPAPaNVDQFGFPQGGSVDRR 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  207 TV------------YPPNQPIMMTMTPMPF----ATQT------HQYYIPQYRHSApyvGPPQQYAvQPPGSGTFYPGPS 264
Cdd:cd21581   173 GNlsksgswdfgsyYPQQHPSVVAFPDSRFgplsGPQAltpdpqHYGYFQLFRHNA---ALFPDYA-HSPGPGHLPLGQQ 248
                         170
                  ....*....|....*
gi 688577521  265 P--AEYPTPYAAGPP 277
Cdd:cd21581   249 PllPDPPLPPGGAEG 263
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
877-1105 1.13e-62

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 212.61  E-value: 1.13e-62
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   877 FRKVRSILNKLTPQMFHQLMKQVTDLTINTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLATLKvptadkpntTVNFR 956
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLRN---------PTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   957 KLLLNRCQKEFERDkvddvvlerkqkeidsatsptekerlqEELEEAKDKARRRSTGNIKFIGELFKLKMLTEPIMHDCV 1036
Cdd:pfam02854   72 IHLLNRLQEEFEKR---------------------------FELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 688577521  1037 VKLLKNH-------DDESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLHN 1105
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
878-1102 1.12e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 181.02  E-value: 1.12e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    878 RKVRSILNKLTPQMFHQLMKQVTDLTINTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLAtLKVPtadkpnttvNFRK 957
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    958 LLLNRCQKEFERDkvddvvlerkqkeidsatsptekerlqeeLEEAKDKARRRSTGNIKFIGELFKLKMLTEPIMHDCVV 1037
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 688577521   1038 KLLKNH-------DDESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1102
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1548-1683 1.40e-47

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 166.69  E-value: 1.40e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521 1548 LSPEELFKQLEQLLLEDMSSDEqIFDWIEANLDESQMSSSPFLRALMTAICKAAVKDESTsCRVDTAIIQKRLPILHKYF 1627
Cdd:cd11559     1 LPLLRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSL-PEKEKALLEKYAPLLQKYL 78
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 688577521 1628 DSDTERQLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPT 1683
Cdd:cd11559    79 DDDEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1349-1460 7.07e-34

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 126.62  E-value: 7.07e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  1349 IERKSKAIIDEFLHINDYKEAVQCVLEIEQPSMLCVFVRMGLESTLERSQKAREHMGLLYYQLIQKGILPHSQLYKGFSE 1428
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|..
gi 688577521  1429 MLEQADDMAIDIPFIWLYLAELLSPLLKEGGI 1460
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGL 112
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1349-1461 2.52e-33

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 125.05  E-value: 2.52e-33
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   1349 IERKSKAIIDEFLHINDYKEAVQCVLEIEQPSMLCVFVRMGLESTLERSQKAREHMGLLYYQLIQKGILPHSQLYKGFSE 1428
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 688577521   1429 MLEQADDMAIDIPFIWLYLAELLSPLLKEGGIN 1461
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1634-1710 3.25e-25

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 100.29  E-value: 3.25e-25
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 688577521  1634 QLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPTEQlGKGVALKSVNAFFTWLREAEEESE 1710
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1621-1705 1.20e-24

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 99.29  E-value: 1.20e-24
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   1621 PILHKYFDSDTERQLQALYALQSLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWETSKDPTEqlGKGVALKSVNAFFT 1700
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 688577521   1701 WLREA 1705
Cdd:smart00515   79 WLQEA 83
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1551-1677 5.68e-20

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 87.53  E-value: 5.68e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521 1551 EELFKQLEQLLLEDMSSDEQIFDWIEANLDESQMSSSPFLRALMTAIC---KAAVKDESTSCRVDTAIIQKRLPILHKYF 1627
Cdd:cd11473     4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVnavESADSISLTQKEQLVLVLKKYGPVLRELL 83
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 688577521 1628 DSDTERQLQALYALQ--SLIVALDQPPNLLRMFFDCLYDEDVISEDAFYQWE 1677
Cdd:cd11473    84 KLIKKDQLYLLLKIEklCLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1591-1710 2.34e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 75.37  E-value: 2.34e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521 1591 RALMTAICKAAVKDESTS---CRVDTAIIQKRL-PILHKYFDSDTErQLQALYALQSLIVALDQPPNLLRMFFDCLYDED 1666
Cdd:cd11558    47 RAVVKALLELILEVSSTStaeLLEALKKLLSKWgPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 688577521 1667 VISEDAFYQWETSKDPTEQLGKGVALKSVNAFFTWLREAEEESE 1710
Cdd:cd11558   126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-426 2.01e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 76.13  E-value: 2.01e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    4 PPKVVPKPAAVAVSGHVTGP-APPTQLRAaltsvSLPPGAQNAPPSAVPPTQIPRAALSLDErmfPAhsgvtavysvsrh 82
Cdd:PHA03247 2570 PPRPAPRPSEPAVTSRARRPdAPPQSARP-----RAPVDDRGDPRGPAPPSPLPPDTHAPDP---PP------------- 2628
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   83 PGPPFPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAGPSyRILKPWETGGAP--PYNPAQNAGSAPLVYSPQTQPMNVQ 160
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRAR-RLGRAAQASSPPqrPRRRAARPTVGSLTSLADPPPPPPT 2707
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  161 PQTRPFVTGPR-PTHHQFIHRSQMQPARPTLPTNNPS----IRPGSQTPTATVYPPNQPIMMTMTPMPFATqthqyyiPQ 235
Cdd:PHA03247 2708 PEPAPHALVSAtPLPPGPAAARQASPALPAAPAPPAVpagpATPGGPARPARPPTTAGPPAPAPPAAPAAG-------PP 2780
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  236 YRHSAPYVGPPQQYAVQPPGSGTfyPGPSPAEYPTPYAAGPPYYTGQTVYPPSPPIIVPAPMpppptkrekKPSSQIRIR 315
Cdd:PHA03247 2781 RRLTRPAVASLSESRESLPSPWD--PADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP---------PPPGPPPPS 2849
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  316 DPNQGGkdITEEIMFGSRNPT-PPAGHPASTLTPPAGR-PSSTPTPPSGRLSSTPTPPQRPSNCQTPEQTAYVNQNQRlS 393
Cdd:PHA03247 2850 LPLGGS--VAPGGDVRRRPPSrSPAAKPAAPARPPVRRlARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPP-P 2926
                         410       420       430
                  ....*....|....*....|....*....|...
gi 688577521  394 ESPAPMDGKPSlddRPKMESGPIKSISPGPRPS 426
Cdd:PHA03247 2927 PQPQPPPPPPP---RPQPPLAPTTDPAGAGEPS 2956
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
11-426 1.94e-11

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 69.41  E-value: 1.94e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    11 PAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPraalsldermfPAHSGVTAVYSVSRHPGPPFPGH 90
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQP-----------PNQTQSTAAPHTLIQQTPTLHPQ 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    91 DLSKTHPNLAGTPPghatSPALSQVSVPAGPSyrilkPWETGGAPPYNPAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGP 170
Cdd:pfam03154  241 RLPSPHPPLQPMTQ----PPPPSQVSPQPLPQ-----PSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPP 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   171 RPTHHQFIHRSQmqpaRPTLPTNNPSIRPGsQTPTATVYPPnQPIMMTMTPMPFATQTHQYYIPQ-YRHSAPYVGP-PQQ 248
Cdd:pfam03154  312 GPSPAAPGQSQQ----RIHTPPSQSQLQSQ-QPPREQPLPP-APLSMPHIKPPPTTPIPQLPNPQsHKHPPHLSGPsPFQ 385
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   249 YAVQPPGSGTFYPGPSPAEYPTPYAAGPPYY---TGQTVYPPSPPIIVPAPMPPPPTKREKKPSSQIRIRDPNQGGKDIT 325
Cdd:pfam03154  386 MNSNLPPPPALKPLSSLSTHHPPSAHPPPLQlmpQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQH 465
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   326 EEIMFGSRNPTPPAGHPAStlTPPAGrPSSTPtPPSGRLSSTPTPPQRPSNCQTPEQtayvnqnqrLSESPapmdgkpsL 405
Cdd:pfam03154  466 PFVPGGPPPITPPSGPPTS--TSSAM-PGIQP-PSSASVSSSGPVPAAVSCPLPPVQ---------IKEEA--------L 524
                          410       420
                   ....*....|....*....|.
gi 688577521   406 DDRPKMESGPIKSISPGPRPS 426
Cdd:pfam03154  525 DEAEEPESPPPPPRSPSPEPT 545
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1569-1710 7.12e-11

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 62.25  E-value: 7.12e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521 1569 EQIFDWIEANLDESQMSSspflralMTAICKAAVKDESTSCRV---------DTAI---IQKRLPILHKYFDSDtERQLQ 1636
Cdd:cd11561     9 DELGEFLKKNKDESGLSE-------LKEILKEAERLDVVKDKAvlvlaevlfDENIvkeIKKRKALLLKLVTDE-KAQKA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521 1637 ALYALQSLIV-----ALDQPPNLLRmffdCLYDEDVISEDAFYQWETsKDPTEQLGKGVA---LKSVNAFFTWLREAEEE 1708
Cdd:cd11561    81 LLGGIERFCGkhspeLLKKVPLILK----ALYDNDILEEEVILKWYE-KVSKKYVSKEKSkkvRKAAEPFVEWLEEAEEE 155

                  ..
gi 688577521 1709 SE 1710
Cdd:cd11561   156 EE 157
PHA03247 PHA03247
large tegument protein UL36; Provisional
58-593 2.31e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 66.12  E-value: 2.31e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   58 AALSLDERMFPAHSGVTAVYSVSRHPGPPFPGHDLSKTHPNLA------GTP-PGHATSPALSQVSVPAGPSYRILKPWE 130
Cdd:PHA03247 2445 AGLAADGDPFFARTILGAPFSLSLLLGELFPGAPVYRRPAEARfpfaagAAPdPGGGGPPDPDAPPAPSRLAPAILPDEP 2524
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  131 TGGAPPYN-----------PAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGPRPTHHQFIHRSqmqpARPTLPtnnpsirP 199
Cdd:PHA03247 2525 VGEPVHPRmltwirgleelASDDAGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRA----RRPDAP-------P 2593
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  200 GSQTPTATVYPPNQPimmtmtpmpfatqthqyyipqYRHSAPYVGPPQQYAVQPPGSGtfyPGPSPAEYPTPYAAGPPyy 279
Cdd:PHA03247 2594 QSARPRAPVDDRGDP---------------------RGPAPPSPLPPDTHAPDPPPPS---PSPAANEPDPHPPPTVP-- 2647
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  280 tgqtvyPPSPPIIVPAPMPPPPTKREKKPSSQIRIRDPNQGgkditeeimfgsrnPTPPAGHPA-STLTPPAGRPSSTPT 358
Cdd:PHA03247 2648 ------PPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQR--------------PRRRAARPTvGSLTSLADPPPPPPT 2707
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  359 P---PSGRLSSTPTPPQRPSNCQTPEQTAYVNQNQRLSESPApMDGKPSLDDRPKMESGPIKSISPGPR---PSESCLEK 432
Cdd:PHA03247 2708 PepaPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPA-TPGGPARPARPPTTAGPPAPAPPAAPaagPPRRLTRP 2786
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  433 REISSLPLLVSSSPEVDVSSHPTSgcIKPTAAGEPEFISPSATKAQTYQVISGEESVPEASPRLSASLSLRVVNGVNEPQ 512
Cdd:PHA03247 2787 AVASLSESRESLPSPWDPADPPAA--VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRR 2864
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  513 TPSSYEEPEVQEAlkmSSSCEIQGTSFMEESGQEVPVALEELQAEHLPSLAAHVPliPGVQASSITSSTTSVLAPPPGLA 592
Cdd:PHA03247 2865 RPPSRSPAAKPAA---PARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPP--PQPQPQPPPPPQPQPPPPPPPRP 2939

                  .
gi 688577521  593 P 593
Cdd:PHA03247 2940 Q 2940
PHA03247 PHA03247
large tegument protein UL36; Provisional
2-360 6.12e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 6.12e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    2 SLPPKVVPKPAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPRAALSLDERMFPAHSGVTAVYSVSR 81
Cdd:PHA03247 2718 ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   82 HPGPPFPGHDLSKTHPNLAGTPPGHATSPAL-----SQVSVPAGPSYRILKPWETGG----------------------A 134
Cdd:PHA03247 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLppptsAQPTAPPPPPGPPPPSLPLGGsvapggdvrrrppsrspaakpaA 2877
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  135 PPYNPAQNAGSAPLVYSPQTQPMNVQPQTRPfvTGPRPTHHQFIHRSQMQPARPTLPTNNPSIRPGSQTPTATVYPPNQP 214
Cdd:PHA03247 2878 PARPPVRRLARPAVSRSTESFALPPDQPERP--PQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEP 2955
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  215 IMMTMTPMPFATQTHQYYIPQYRHSAPyvGPPQQYAVQPPGSGTFYPGPSPAEYPTPYA-----AGPPYYTGQTVYPPSp 289
Cdd:PHA03247 2956 SGAVPQPWLGALVPGRVAVPRFRVPQP--APSREAPASSTPPLTGHSLSRVSSWASSLAlheetDPPPVSLKQTLWPPD- 3032
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 688577521  290 piivpapmpppptkrEKKPSSQIRIRDPNQGGKDiTEEIMFGSRNPTPPAGHPASTLTPPAGR---PSSTPTPP 360
Cdd:PHA03247 3033 ---------------DTEDSDADSLFDSDSERSD-LEALDPLPPEPHDPFAHEPDPATPEAGAresPSSQFGPP 3090
PHA03379 PHA03379
EBNA-3A; Provisional
39-423 8.65e-09

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 60.46  E-value: 8.65e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   39 PPGAQNAPPSavPPTQIPRAALSLDERMFPAHSGVTAVYSVSRHPGPPFPGHDLSKTHPN-LAGTPPG------------ 105
Cdd:PHA03379  408 ASEPTYGTPR--PPVEKPRPEVPQSLETATSHGSAQVPEPPPVHDLEPGPLHDQHSMAPCpVAQLPPGplqdlepgdqlp 485
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  106 ---HATSPALSQVSVPAGPsyrILKPWETggappyNPAQNAGSAPLVYSPqtQPMNVQPQTRPFVTGPRPTHHQFIHRSQ 182
Cdd:PHA03379  486 gvvQDGRPACAPVPAPAGP---IVRPWEA------SLSQVPGVAFAPVMP--QPMPVEPVPVPTVALERPVCPAPPLIAM 554
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  183 MQPARPT-LPTNNPSIRPGSQTPTatvyPPNQPIMMTMTPMPFATQTHQYyipQYRHSApyvgppqqyAVQPPgSGTFYP 261
Cdd:PHA03379  555 QGPGETSgIVRVRERWRPAPWTPN----PPRSPSQMSVRDRLARLRAEAQ---PYQASV---------EVQPP-QLTQVS 617
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  262 GPSPAEYPTPyaagppyyTGQTVYPPSPPIIVPAPMPPPPTKREKKPSSQIRIRDPNQGGKDITeeIMFGSRNPTPPAGH 341
Cdd:PHA03379  618 PQQPMEYPLE--------PEQQMFPGSPFSQVADVMRAGGVPAMQPQYFDLPLQQPISQGAPLA--PLRASMGPVPPVPA 687
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  342 PAST-LTPPAGRPSSTPTPPSGRLSSTP-TPPQRPSncQTPEQTAYVNQNQRLSESPAPMDGKPSldDRPKMESGPIKSI 419
Cdd:PHA03379  688 TQPQyFDIPLTEPINQGASAAHFLPQQPmEGPLVPE--RWMFQGATLSQSVRPGVAQSQYFDLPL--TQPINHGAPAAHF 763

                  ....
gi 688577521  420 SPGP 423
Cdd:PHA03379  764 LHQP 767
PHA03378 PHA03378
EBNA-3B; Provisional
80-426 2.77e-08

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 58.93  E-value: 2.77e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   80 SRHPGPPFPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAgPSYrILKPWetggaPPYNPAQNAGSaplvysPQTQ---P 156
Cdd:PHA03378  550 SDEPASTEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSA-PSY-AQTPW-----PVPHPSQTPEP------PTTQshiP 616
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  157 MNVQPQTRPFVTGPRPthhqfIHRSQMQPArptlpTNNPSIRPGSQTPTATVYPPNQPIMMTMTPMPFATQTHQYYIPQY 236
Cdd:PHA03378  617 ETSAPRQWPMPLRPIP-----MRPLRMQPI-----TFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLP 686
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  237 RHSAPYvgppqqyAVQPPGSGtfyPGPSPAEYPTPYAAGPPYYTGQTVYPPSPPIIVPAPMPPPPTKREKKPSSQIRIRD 316
Cdd:PHA03378  687 IQWAPG-------TMQPPPRA---PTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARP 756
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  317 PNQG-GKDITEEIMFGSRNPTPPAGHPASTLTPPAGRPSSTPTPPSGRLSSTPTPPqrpsncQTPEQTAYVNQNQRLSES 395
Cdd:PHA03378  757 PAAApGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPR------AAPGQQGPTKQILRQLLT 830
                         330       340       350
                  ....*....|....*....|....*....|.
gi 688577521  396 PAPMDGKPSLddrpKMESGPIKSISPGPRPS 426
Cdd:PHA03378  831 GGVKRGRPSL----KKPAALERQAAAGPTPS 857
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
91-426 5.23e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.24  E-value: 5.23e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    91 DLSKTHPNLAGTPPG-HATSPALSQVSVPagpsyrilkPWETGGAPPYNPAQNAGSAPLVYSPQTQPMNVQPQTrpfvtg 169
Cdd:pfam03154  159 DSSAQQQILQTQPPVlQAQSGAASPPSPP---------PPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQS------ 223
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   170 PRPTHHQFIHRSQMQPARptLPTNNPSIRPGSQTPTATVYPPN---QPIMMT-MTPMPFATQTHQYYIPQYRHSAPYVGP 245
Cdd:pfam03154  224 TAAPHTLIQQTPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQplpQPSLHGqMPPMPHSLQTGPSHMQHPVPPQPFPLT 301
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   246 PQQYAVQ-PPGSGTFYPGPSPAEYPTPyaagPPYYTGQTVYPPSPPIIVPAPMPPPPTKR------EKKPSSQIRIRDPN 318
Cdd:pfam03154  302 PQSSQSQvPPGPSPAAPGQSQQRIHTP----PSQSQLQSQQPPREQPLPPAPLSMPHIKPppttpiPQLPNPQSHKHPPH 377
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   319 QGGKditEEIMFGSRNPTPPAGHPASTLtPPAGRPSSTPTPPSGRLSSTPTPPqrpsncqTPEQTAYVNQNQRLSESPAP 398
Cdd:pfam03154  378 LSGP---SPFQMNSNLPPPPALKPLSSL-STHHPPSAHPPPLQLMPQSQQLPP-------PPAQPPVLTQSQSLPPPAAS 446
                          330       340
                   ....*....|....*....|....*...
gi 688577521   399 MDGKPSLDDRPKMESGPIKSISPGPRPS 426
Cdd:pfam03154  447 HPPTSGLHQVPSQSPFPQHPFVPGGPPP 474
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
12-434 1.36e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.45  E-value: 1.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   12 AAVAVSGHVTGPAPPTQLRAAltsvslPPGAQNAPPSAVPPTQIPRAAlsldermfPAHSGVTAVYSVSRHPGPPfpghd 91
Cdd:PRK07764  385 LGVAGGAGAPAAAAPSAAAAA------PAAAPAPAAAAPAAAAAPAPA--------AAPQPAPAPAPAPAPPSPA----- 445
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   92 lsktHPNLAGTPPGHATSPALSQVSVPAGpsyrilkpwetgGAPPYNPAQNAGSAPLVYSPQTQPmnvqPQTRPFVTGPR 171
Cdd:PRK07764  446 ----GNAPAGGAPSPPPAAAPSAQPAPAP------------AAAPEPTAAPAPAPPAAPAPAAAP----AAPAAPAAPAG 505
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  172 PTHHQFIhRSQMQPARPTLPTNNPSIRPGSqTPTATVYPPNQPimmTMTpMPFATQTHQYYIPQYRHSAPYV-------- 243
Cdd:PRK07764  506 ADDAATL-RERWPEILAAVPKRSRKTWAIL-LPEATVLGVRGD---TLV-LGFSTGGLARRFASPGNAEVLVtalaeelg 579
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  244 --------------GPPQQYAVQPPGSGTFYPGPSPAEYPTPYAAGPPYYTGQTVYPPSPPIIVPAPMPPPPTKREKKPS 309
Cdd:PRK07764  580 gdwqveavvgpapgAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  310 SQIRIRDPNQGGKDITEEIMFGSRNPTPPAGHPASTLTPPAGRPSSTPTPPSGRLSSTPTPPQRPSNCQTPEQTAYVNQn 389
Cdd:PRK07764  660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDP- 738
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*
gi 688577521  390 QRLSESPAPMDGKPSLDDRPKMESGPIKSISPGPRPSESCLEKRE 434
Cdd:PRK07764  739 VPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
11-371 6.67e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 51.07  E-value: 6.67e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    11 PAAVAVSGHVTGPAP--PTQLRAALTSVSlpPGAQNAPPSAVPPTQIPR--AALSLDERMFPAHSGVTAVYSVSRHPGPP 86
Cdd:pfam05109  449 PSSTHVPTNLTAPAStgPTVSTADVTSPT--PAGTTSGASPVTPSPSPRdnGTESKAPDMTSPTSAVTTPTPNATSPTPA 526
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    87 FPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAgPSYRILKPWET----GGAPPYN----PAQNAgSAPLV--YSPQTQP 156
Cdd:pfam05109  527 VTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT-PAVTTPTPNATiptlGKTSPTSavttPTPNA-TSPTVgeTSPQANT 604
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   157 MNVQ---PQTRPFVTGPRPTHHQFIHRSQMQPARPTlpTNNPSIRPGSQTPTATVYPPNQ-----PIMMTMTP------- 221
Cdd:pfam05109  605 TNHTlggTSSTPVVTSPPKNATSAVTTGQHNITSSS--TSSMSLRPSSISETLSPSTSDNstshmPLLTSAHPtggenit 682
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   222 --MPFATQTHQYYI----PQYRHSAPYVGPPQQYAVQPPGSGTFYPGPSPAEYPTPYAAgppyyTGQTVYPPSPPIIVPA 295
Cdd:pfam05109  683 qvTPASTSTHHVSTsspaPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQAP-----SGQKTAVPTVTSTGGK 757
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   296 PMPPPPTKREKkpssqirirdpNQGGKDITEEIM-FGSRNPTPPAGHPASTLTPPAG----RPSSTPTPP--SGRLSSTP 368
Cdd:pfam05109  758 ANSTTGGKHTT-----------GHGARTSTEPTTdYGGDSTTPRTRYNATTYLPPSTssklRPRWTFTSPpvTTAQATVP 826

                   ...
gi 688577521   369 TPP 371
Cdd:pfam05109  827 VPP 829
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1549-1708 1.19e-05

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 47.98  E-value: 1.19e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521 1549 SPEELFKQLEQLLLEDMSSDEqifdwIEANLDEsQMSSSPFL---------RALMTAICKAAVKDESTscrvDTAI--IQ 1617
Cdd:cd11560    37 IKKELQQELKEMIAEEEPVKE-----IIAAVKE-QMKKSSLPehevvgllwTALMDAVEWSKKEDQIA----EQALrhLK 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521 1618 KRLPILhKYFDSDTERQLQALYALQslIVALDQPpNLLRMFFDC---LYDEDVISEDAFYQW--ETSKDPteqlGKGVAL 1692
Cdd:cd11560   107 KYAPLL-AAFCTTARAELALLNKIQ--EYCYENM-KFMKVFQKIvklLYKADVLSEDAILKWykKGHSPK----GKQVFL 178
                         170
                  ....*....|....*.
gi 688577521 1693 KSVNAFFTWLREAEEE 1708
Cdd:cd11560   179 KQMEPFVEWLQEAEEE 194
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
7-196 4.01e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 48.49  E-value: 4.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521     7 VVPKPAAVAVSGHVTGPAPPTQ------------LRAALTSVSLPPGAQNAPPSAVPPTQIPrAALSLDERMFPAHSGVT 74
Cdd:pfam09770  165 VAPKKAAAPAPAPQPAAQPASLpapsrkmmsleeVEAAMRAQAKKPAQQPAPAPAQPPAAPP-AQQAQQQQQFPPQIQQQ 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    75 AVYSVSRHPGPPFPGHDLSKT---HPNLAGTPPGHATSPALSQVSVPAGPSyrilkpwetggaPPYNPAQ-----NAGSA 146
Cdd:pfam09770  244 QQPQQQPQQPQQHPGQGHPVTilqRPQSPQPDPAQPSIQPQAQQFHQQPPP------------VPVQPTQilqnpNRLSA 311
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|
gi 688577521   147 PLVYSPQTQPMNVQPQtrpfvtgprPTHHQfiHRSQMQPARPTLPTNNPS 196
Cdd:pfam09770  312 ARVGYPQNPQPGVQPA---------PAHQA--HRQQGSFGRQAPIITHPQ 350
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
4-222 4.23e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 48.33  E-value: 4.23e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    4 PPKVVPKPAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPRAAlsldERMFPAHSGVTAVYSVSRHP 83
Cdd:PRK12323  380 APVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAA----ARQASARGPGGAPAPAPAPA 455
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   84 GPPFPGhdlskTHPNLAGTPPGHATSPALSQVSVPAG---PSYRILKPWET--GGAPPYNPAQN-AGSAPLVYSPQTQPM 157
Cdd:PRK12323  456 AAPAAA-----ARPAAAGPRPVAAAAAAAPARAAPAAapaPADDDPPPWEElpPEFASPAPAQPdAAPAGWVAESIPDPA 530
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 688577521  158 NVQPqtrpfvTGPRPTHHQFIHRSQMQPARPTLPTNNPSIRPG-SQTPTATVYPPNQPIMMTMTPM 222
Cdd:PRK12323  531 TADP------DDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRaSASGLPDMFDGDWPALAARLPV 590
PRK10263 PRK10263
DNA translocase FtsK; Provisional
63-288 1.01e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.39  E-value: 1.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   63 DERMFPAHSGVTA-----VYSVSRHPGPPFPGHDlskthPNLAG---TPPGHATSPALSQVSVPAGPSYRILKpwetggA 134
Cdd:PRK10263  275 DEEITYTARGVAAdpddvLFSGNRATQPEYDEYD-----PLLNGapiTEPVAVAAAATTATQSWAAPVEPVTQ------T 343
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  135 PPYNPAQNAGSAPLVySPQTQPmnvQPQTRPFVTGPRPTHHQfihrSQMQPARPTLPTNNPSIRP-GSQTPTATVYPPNQ 213
Cdd:PRK10263  344 PPVASVDVPPAQPTV-AWQPVP---GPQTGEPVIAPAPEGYP----QQSQYAQPAVQYNEPLQQPvQPQQPYYAPAAEQP 415
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 688577521  214 PIMMTMTPMPFATQTHQYYIPQYRHSA---PYVGPPQQYAVQPPGSGTFYpgpspAEYPTPYAAGPPYYTGQTVYPPS 288
Cdd:PRK10263  416 AQQPYYAPAPEQPAQQPYYAPAPEQPVagnAWQAEEQQSTFAPQSTYQTE-----QTYQQPAAQEPLYQQPQPVEQQP 488
dnaA PRK14086
chromosomal replication initiator protein DnaA;
110-364 1.13e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 47.13  E-value: 1.13e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  110 PALSQ-VSVPAGPSYRILKPWETGGAPPYNPAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGPRPTHHQFihrSQMQPARP 188
Cdd:PRK14086   68 PIISEtLSRELGRPIRIAITVDPSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQ---DQLPTARP 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  189 TLPTNNPSIRPGSQTPTATVYPPNQPIMMTMTPMPFATQTHQYyipqyrhsapyvgPPQQYAVQPPGSGTfypgpspAEY 268
Cdd:PRK14086  145 AYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASYA-------------PEQERDREPYDAGR-------PEY 204
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  269 PTPYaaGPPYYTGQTVYPPSPPIIVPApmpppptkrEKKPSSQIRIRDpNQGGKDITEEIMFGSRNPTPpaghpastlTP 348
Cdd:PRK14086  205 DQRR--RDYDHPRPDWDRPRRDRTDRP---------EPPPGAGHVHRG-GPGPPERDDAPVVPIRPSAP---------GP 263
                         250
                  ....*....|....*...
gi 688577521  349 PAGRPSSTPTP--PSGRL 364
Cdd:PRK14086  264 LAAQPAPAPGPgePTARL 281
PHA03378 PHA03378
EBNA-3B; Provisional
4-254 2.51e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.83  E-value: 2.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    4 PPKVVPKPAAV--AVSGHV------TGPAPPTQLRAALTSVSLPPGAQN-APPSAVPPT--QIPRAALSldeRMFPAHSG 72
Cdd:PHA03378  654 PPQVEITPYKPtwTQIGHIpyqpspTGANTMLPIQWAPGTMQPPPRAPTpMRPPAAPPGraQRPAAATG---RARPPAAA 730
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   73 VTAVYSVSRHPGPPFPGHDLSKTHPNLAGTP----PGHATSPALSQVSVPAGPSYRILKPwetGGAPPYNPAQNAGSAPL 148
Cdd:PHA03378  731 PGRARPPAAAPGRARPPAAAPGRARPPAAAPgrarPPAAAPGAPTPQPPPQAPPAPQQRP---RGAPTPQPPPQAGPTSM 807
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  149 VYSPQTQPMNVQPQT---RPFVTGP----RPTHHQfihRSQMQPARPTLPTNNPSIRPGSQTPTATV-YPP-NQPIMMTM 219
Cdd:PHA03378  808 QLMPRAAPGQQGPTKqilRQLLTGGvkrgRPSLKK---PAALERQAAAGPTPSPGSGTSDKIVQAPVfYPPvLQPIQVMR 884
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 688577521  220 ---TPMPFATQTHQYYIPQYRHSAPYVGPPQQYAVQPP 254
Cdd:PHA03378  885 qlgSVRAAAASTVTQAPTEYTGERRGVGPMHPTDIPPS 922
dnaA PRK14086
chromosomal replication initiator protein DnaA;
185-372 4.75e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 44.82  E-value: 4.75e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  185 PARPTLPTNNPSIRPGSQTPTATVYPpnQPIMMTMTPMPFATQTHQYYIPQyrhsAPYVGPPQQYAVQPPGSGTFYPGPS 264
Cdd:PRK14086   81 PIRIAITVDPSAGEPAPPPPHARRTS--EPELPRPGRRPYEGYGGPRADDR----PPGLPRQDQLPTARPAYPAYQQRPE 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  265 PAEYPTPYAAGPPYYtgQTVYPPSPPIIVPAPMPPPPTKREKKPSSQIR------IRDPNQGGKDITEEIMFGSRNPTPP 338
Cdd:PRK14086  155 PGAWPRAADDYGWQQ--QRLGFPPRAPYASPASYAPEQERDREPYDAGRpeydqrRRDYDHPRPDWDRPRRDRTDRPEPP 232
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 688577521  339 --AGHPASTLTPPAGRPS--------STPTPPSGRLSSTPTPPQ 372
Cdd:PRK14086  233 pgAGHVHRGGPGPPERDDapvvpirpSAPGPLAAQPAPAPGPGE 276
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
133-271 5.00e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 45.00  E-value: 5.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   133 GAPPYNPAQNAGSAPLVYSPQTQPMNVQPQTRP---FVTGPRPTHHQFIHRSQMQPAR--------PTLPTNNPSIRPGS 201
Cdd:pfam09606  281 GQPMGPPGQQPGAMPNVMSIGDQNNYQQQQTRQqqqQQGGNHPAAHQQQMNQSVGQGGqvvalgglNHLETWNPGNFGGL 360
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 688577521   202 QTPTATvypPNQPIMMTM-TPMPFAT----QTHQYYIPQYRHSAPYVGPPQQyavQPPGSGTFYPGPSPAEYPTP 271
Cdd:pfam09606  361 GANPMQ---RGQPGMMSSpSPVPGQQvrqvTPNQFMRQSPQPSVPSPQGPGS---QPPQSHPGGMIPSPALIPSP 429
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
3-274 1.31e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.60  E-value: 1.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521     3 LPPKVVPKPAAVAVSGhvTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPraalslderMFPAHSGVTAVYSVSRH 82
Cdd:pfam03154  293 VPPQPFPLTPQSSQSQ--VPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP---------LPPAPLSMPHIKPPPTT 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    83 PGPPFPGHDLSKTHPNLAGTPPGHATSpalsqvSVPAGPSYRILKPWETGGAPPYNPaqnagsAPLVYSPQTQPMNVQPQ 162
Cdd:pfam03154  362 PIPQLPNPQSHKHPPHLSGPSPFQMNS------NLPPPPALKPLSSLSTHHPPSAHP------PPLQLMPQSQQLPPPPA 429
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   163 TRPFVT-----GPRPTHH---QFIHRSQMQPARPT---LPTNNPSIRPGSQTPTAT------VYPPNQPIMMTMTPMPFA 225
Cdd:pfam03154  430 QPPVLTqsqslPPPAASHpptSGLHQVPSQSPFPQhpfVPGGPPPITPPSGPPTSTssampgIQPPSSASVSSSGPVPAA 509
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 688577521   226 TQTHQYYIpQYRHSAPYVGPPQQYAVQPPGSgtfyPGPSPAEYPTPYAA 274
Cdd:pfam03154  510 VSCPLPPV-QIKEEALDEAEEPESPPPPPRS----PSPEPTVVNTPSHA 553
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
4-121 1.83e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.16  E-value: 1.83e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    4 PPKVVPKPAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPRAALSldermfpahsgVTAVYSVSRHP 83
Cdd:PRK14951  386 AAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA-----------AVALAPAPPAQ 454
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 688577521   84 GPPFPGHDLSKTHPNLAGTPPGHATSPALSQVSVPAGP 121
Cdd:PRK14951  455 AAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
4-147 1.94e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.91  E-value: 1.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521    4 PPKVVPKPAAVAvsghvtGPAPPTQLRAALTSVSLPPGAQNAPPSAVP--------PTQIPRAALSLDERMFPAHSGVTA 75
Cdd:PRK07003  376 VAGAVPAPGARA------AAAVGASAVPAVTAVTGAAGAALAPKAAAAaaatraeaPPAAPAPPATADRGDDAADGDAPV 449
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 688577521   76 VYSVSRHPGPPFPGHDLS---KTHPNLAGTPPGHATSPALSQVSVPAGPSYRILKPWETGGAPPYNPAQNAGSAP 147
Cdd:PRK07003  450 PAKANARASADSRCDERDaqpPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAA 524
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
263-426 3.63e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.37  E-value: 3.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  263 PSPAEYPTPYAAGPPYYTGQTVYPPSPPIIVPApmppppTKREKKPSSQIRirDPNQGGKDITEEIMFGSRNPTPPAGHP 342
Cdd:PTZ00449  497 APIEEEDSDKHDEPPEGPEASGLPPKAPGDKEG------EEGEHEDSKESD--EPKEGGKPGETKEGEVGKKPGPAKEHK 568
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  343 ASTLTPPAGRPSSTPTPPSGRLSSTPTPPQRPSNCQTPeqtayvnqnqrlSESPAPMdgKPSLDDRPKMESGPIKSISPG 422
Cdd:PTZ00449  569 PSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRP------------TRPKSPK--LPELLDIPKSPKRPESPKSPK 634

                  ....
gi 688577521  423 PRPS 426
Cdd:PTZ00449  635 RPPP 638
TYA pfam01021
Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a ...
158-287 5.14e-03

Ty transposon capsid protein; Ty are yeast transposons. A 5.7kb transcript codes for p3 a fusion protein of TYA and TYB. The TYA protein is analogous to the gag protein of retroviruses. TYA a is cleaved to form 46kd protein which can form mature virion like particles. This entry corresponds to the capsid protein from Ty1 and Ty2 transposons.


Pssm-ID: 425992  Cd Length: 384  Bit Score: 41.10  E-value: 5.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   158 NVQPQTRPfVTGPRPTHHqfiHRSQMQPARPTLPTN--------------NPSIRPGSQTPTATVYPPNQpimmtMTPMP 223
Cdd:pfam01021   35 NSQQTTTP-GSSAVPENH---HHASPQPASVPPPQNgpysqqcmmtpnqaNPSGWPFYGHPSMMPYTPYQ-----MSPMY 105
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 688577521   224 FATQTHqYYIPQYrhsAPYVGPPqqYAVQPPGSGTFYPGPSPAEYPTPyaagppyYTGQTVYPP 287
Cdd:pfam01021  106 FPPGPQ-SQFPQY---PSSVGTP--LSTPSPESGNTFTDSSSAKSDMT-------STNKYVRPP 156
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
11-371 5.91e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 5.91e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   11 PAAVAVSGHVTGPAPPTQLRAALTSVSLPPGAQNAPPSAVPPTQIPRAALSLDERMFPAHSGVTAVYSVSRHPGPPFPGH 90
Cdd:PHA03307   31 AADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPG 110
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521   91 DLSKTHPNLAGTPPGHATSPAlSQVSVPAGPSYRILKPWETGGAPPYNPAQNAGSAPlvySPQTQPMNVQPQTRPFVTGP 170
Cdd:PHA03307  111 PSSPDPPPPTPPPASPPPSPA-PDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDA---ASSRQAALPLSSPEETARAP 186
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  171 RPTHHQFIHRSQMQPARPTLPTNNPSIRPGSQTPTAT------VYPPNQPIMMTMTPMPFATQTHQYYIPQYRHSaPYVG 244
Cdd:PHA03307  187 SSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPApgrsaaDDAGASSSDSSSSESSGCGWGPENECPLPRPA-PITL 265
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  245 PPQQYAVQPPGSGTFYPGPSPAEYPTPYAAGPPYytgqtvyPPSPPIIVPAPMPPPPTKREKKPSSQIrirDPNQGGKDI 324
Cdd:PHA03307  266 PTRIWEASGWNGPSSRPGPASSSSSPRERSPSPS-------PSSPGSGPAPSSPRASSSSSSSRESSS---SSTSSSSES 335
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|...
gi 688577521  325 TEEIMFG-----SRNPTPPAGHPASTLTPPAGRPSSTPTPPSGRLSS-TPTPP 371
Cdd:PHA03307  336 SRGAAVSpgpspSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAgRPTRR 388
KLF1_N cd21581
N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as ...
139-277 7.58e-03

N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as Krueppel-like factor 1 or Erythroid Kruppel-like Factor/EKLF) was the first Kruppel-like factor discovered. It was found to be vitally important for embryonic erythropoiesis in promoting the switch from fetal hemoglobin (Hemoglobin F) to adult hemoglobin (Hemoglobin A) gene expression by binding to highly conserved CACCC domains. EKLF ablation in mouse embryos produces a lethal anemic phenotype, causing death by embryonic day 14, and natural mutations lead to beta+ thalassemia in humans. However, expression of embryonic hemoglobin and fetal hemoglobin genes is normal in EKLF-deficient mice, suggesting other factors may be involved. KLF1 functions as a transcriptional activator. It belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF1, which is related to the N-terminal domains of KLF2 and KLF4.


Pssm-ID: 409227 [Multi-domain]  Cd Length: 278  Bit Score: 40.41  E-value: 7.58e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  139 PAQNAGSAPLVYSPQTQPMNVQPQTRPFVTGP----------RPTHHQFIHRSQMQPAR-PTL-PTNNPSIRPGSQTPTA 206
Cdd:cd21581    93 EEQPGAYYEPPKKDQPGTEGLQVGGPGLMAELlspeestgwaPPEPHHGYPDAFVGPALfPAPaNVDQFGFPQGGSVDRR 172
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 688577521  207 TV------------YPPNQPIMMTMTPMPF----ATQT------HQYYIPQYRHSApyvGPPQQYAvQPPGSGTFYPGPS 264
Cdd:cd21581   173 GNlsksgswdfgsyYPQQHPSVVAFPDSRFgplsGPQAltpdpqHYGYFQLFRHNA---ALFPDYA-HSPGPGHLPLGQQ 248
                         170
                  ....*....|....*
gi 688577521  265 P--AEYPTPYAAGPP 277
Cdd:cd21581   249 PllPDPPLPPGGAEG 263
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH