NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1207195306|ref|XP_021330220|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform X2 [Danio rerio]

Protein Classification

eukaryotic translation initiation factor 4 gamma 3( domain architecture ID 10501431)

eukaryotic translation initiation factor 4 gamma 3 (EIF4G3) is component of the protein complex eIF4F, which is involved in the recognition of the mRNA cap, ATP-dependent unwinding of 5'-terminal secondary structure and recruitment of mRNA to the ribosome

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
886-1114 3.44e-60

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 205.29  E-value: 3.44e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  886 FRKVRSILNKLTPQMFSQLMKQVTDLTIDTEERLKGVIDLVFEKAINEPSFSVAYGNMCSCLATLkvpmtdkpnSTVNFR 965
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  966 KLLLNRCQKEFEKDKMdddafekKHREleaatassererlqeeleeaKDKARRRSIGNIKFIGELFKLRMLTEAIMHDCV 1045
Cdd:pfam02854   72 IHLLNRLQEEFEKRFE-------LEEN--------------------EQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1207195306 1046 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLHN 1114
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1552-1687 2.45e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 168.62  E-value: 2.45e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1552 LSPEELNKQLEKLLLEDMVGDEqIFDWVEANLDESEMSSAPFVRALMTAVCKAAVkTEGSSCKVDLSIIQTRLPVLHKYL 1631
Cdd:cd11559      1 LPLLRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAI-EEKSLPEKEKALLEKYAPLLQKYL 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1207195306 1632 NSDTERQLQALYALQALIVKLDQPANLLRMFFDCLYDEDVISEDAFYKWEVSKDPA 1687
Cdd:cd11559     79 DDDEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1354-1465 1.63e-33

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 125.47  E-value: 1.63e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1354 ERRSKSIIDEFLHINDYKEALQCVEELEQSAMLYVFVRVGVESTLERSQITRDHMGQLLFQLLQAGVLLKLQFFKGFSET 1433
Cdd:pfam02847    2 KRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWRV 81
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1207195306 1434 LELADDMAIDIPHIWLYLAELVTPVLREGGIS 1465
Cdd:pfam02847   82 LEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PHA03247 super family cl33720
large tegument protein UL36; Provisional
59-615 3.07e-15

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 82.29  E-value: 3.07e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   59 PPPLDERIFSTQPVSAVYSVQ---RPPGPPFTAHEINKGHPNLAATP--PG--HASSPGLSQVSVSTVSTahlygHPKGW 131
Cdd:PHA03247  2552 PPPLPPAAPPAAPDRSVPPPRpapRPSEPAVTSRARRPDAPPQSARPraPVddRGDPRGPAPPSPLPPDT-----HAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  132 EPGGGSPYTTgQNAGTTPLVYSPPTQPMNAQPQSRpfAPGPRPTHHQG-GFRSIQFFQRTQMQTARPTIPSNTPPIRPTS 210
Cdd:PHA03247  2627 PPPSPSPAAN-EPDPHPPPTVPPPERPRDDPAPGR--VSRPRRARRLGrAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  211 QTPTAAvysPNQHIMMTMAHMPFHSPQTAQYYIPQYRHSAPQyvgPPQQYPVQPTGPSTFYAAASPGEFPAPYAAGPPYY 290
Cdd:PHA03247  2704 PPPTPE---PAPHALVSATPLPPGPAAARQASPALPAAPAPP---AVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  291 PGQPVYTPSP--PIIVPTPQQPPPAKREKKTIRIRDPNQGgkdvtdEILSGVGLSRNPTPPVGRPSSTPTPPQQLNSQVA 368
Cdd:PHA03247  2778 GPPRRLTRPAvaSLSESRESLPSPWDPADPPAAVLAPAAA------LPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  369 DHGHIMYNVDSS---PHLPAPFNLKADDKPKLEF----SLQRTASPGLRQPDTPLERRDPSSPVQ---TPSSPPHKPELP 438
Cdd:PHA03247  2852 LGGSVAPGGDVRrrpPSRSPAAKPAAPARPPVRRlarpAVSRSTESFALPPDQPERPPQPQAPPPpqpQPQPPPPPQPQP 2931
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  439 PSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLTKAITPEPESSEPEKSSSPPPQSLSGSLTQHekAVNGLTDVdA 518
Cdd:PHA03247  2932 PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH--SLSRVSSW-A 3008
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  519 APLSEELETQPREASpLLPTSSVP-QSEPRPVTPVLEEESDPINMDS--PLPPVEDD--AGCPDNVSPSLSTSTTAAIST 593
Cdd:PHA03247  3009 SSLALHEETDPPPVS-LKQTLWPPdDTEDSDADSLFDSDSERSDLEAldPLPPEPHDpfAHEPDPATPEAGARESPSSQF 3087
                          570       580
                   ....*....|....*....|..
gi 1207195306  594 TPPapppglshPSQVSAALDRR 615
Cdd:PHA03247  3088 GPP--------PLSANAALSRR 3101
W2 super family cl17013
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1666-1712 1.46e-06

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


The actual alignment was detected with superfamily member cd11560:

Pssm-ID: 473053 [Multi-domain]  Cd Length: 194  Bit Score: 50.67  E-value: 1.46e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1207195306 1666 LYDEDVISEDAFYKWevSKDPAEQQGKGVALKSVTAFFTWLREAEEE 1712
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PTZ00108 super family cl36510
DNA topoisomerase 2-like protein; Provisional
1077-1354 4.44e-03

DNA topoisomerase 2-like protein; Provisional


The actual alignment was detected with superfamily member PTZ00108:

Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.96  E-value: 4.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1077 KPRMDQYFNQMEKIVKERKTSSRIrfmlqDVIDLrlhnWVsRRADQGPKTIEQIHKDAKLEEQEEQRKVHQQLLSKDNKR 1156
Cdd:PTZ00108  1101 KEKVEKLNAELEKKEKELEKLKNT-----TPKDM----WL-EDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLR 1170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1157 RPVVQREETWSTVPMTKNSRTIDPAKIPKFSKSAIDEKIQLGPRAQVNWMKGSSGGAGAKASESDASRPSaslNRYSPLQ 1236
Cdd:PTZ00108  1171 KPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSV---KRLKSKK 1247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1237 PSALQTSSLPSTSPDFDSRRVLGSRG--SSGRERNDKPLSAGPARTGPISLSSSNKETPEELVQ---EVSRRDSNASDTP 1311
Cdd:PTZ00108  1248 NNSSKSSEDNDEFSSDDLSKEGKPKNapKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKkrlEGSLAALKKKKKS 1327
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1207195306 1312 KLLVSTADKS--RLENSQPRESAVKLEALSGPSPDKPALSEEEME 1354
Cdd:PTZ00108  1328 EKKTARKKKSktRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSE 1372
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
886-1114 3.44e-60

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 205.29  E-value: 3.44e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  886 FRKVRSILNKLTPQMFSQLMKQVTDLTIDTEERLKGVIDLVFEKAINEPSFSVAYGNMCSCLATLkvpmtdkpnSTVNFR 965
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  966 KLLLNRCQKEFEKDKMdddafekKHREleaatassererlqeeleeaKDKARRRSIGNIKFIGELFKLRMLTEAIMHDCV 1045
Cdd:pfam02854   72 IHLLNRLQEEFEKRFE-------LEEN--------------------EQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1207195306 1046 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLHN 1114
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
887-1111 1.93e-50

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 177.55  E-value: 1.93e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   887 RKVRSILNKLTPQMFSQLMKQVTDLTIDTEERLKGVIDLVFEKAINEPSFSVAYGNMCSCLAtLKVPmtdkpnstvNFRK 966
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   967 LLLNRCQKEFEKDKMDDDAfekkhreleaatassererlqeeleeakdKARRRSIGNIKFIGELFKLRMLTEAIMHDCVV 1046
Cdd:smart00543   72 LLLERLQEEFEKGLESEEE-----------------------------SDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1207195306  1047 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1111
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1552-1687 2.45e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 168.62  E-value: 2.45e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1552 LSPEELNKQLEKLLLEDMVGDEqIFDWVEANLDESEMSSAPFVRALMTAVCKAAVkTEGSSCKVDLSIIQTRLPVLHKYL 1631
Cdd:cd11559      1 LPLLRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAI-EEKSLPEKEKALLEKYAPLLQKYL 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1207195306 1632 NSDTERQLQALYALQALIVKLDQPANLLRMFFDCLYDEDVISEDAFYKWEVSKDPA 1687
Cdd:cd11559     79 DDDEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1354-1465 1.63e-33

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 125.47  E-value: 1.63e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1354 ERRSKSIIDEFLHINDYKEALQCVEELEQSAMLYVFVRVGVESTLERSQITRDHMGQLLFQLLQAGVLLKLQFFKGFSET 1433
Cdd:pfam02847    2 KRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWRV 81
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1207195306 1434 LELADDMAIDIPHIWLYLAELVTPVLREGGIS 1465
Cdd:pfam02847   82 LEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1354-1465 1.05e-31

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 120.43  E-value: 1.05e-31
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  1354 ERRSKSIIDEFLHINDYKEALQCVEELEQSAMLYVFVRVGVESTLERSQITRDHMGQLLFQLLQAGVLLKLQFFKGFSET 1433
Cdd:smart00544    2 KKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWRL 81
                            90       100       110
                    ....*....|....*....|....*....|..
gi 1207195306  1434 LELADDMAIDIPHIWLYLAELVTPVLREGGIS 1465
Cdd:smart00544   82 LEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1625-1709 1.34e-26

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 104.68  E-value: 1.34e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  1625 PVLHKYLNSDTERQLQALYALQALIVKLDQPANLLRMFFDCLYDEDVISEDAFYKWEVSKDPAEqqGKGVALKSVTAFFT 1704
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1207195306  1705 WLREA 1709
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1638-1714 1.02e-25

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 101.84  E-value: 1.02e-25
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1207195306 1638 QLQALYALQALIVKLDQPANLLRMFFDCLYDEDVISEDAFYKWEvSKDPAEQQGKGVALKSVTAFFTWLREAEEESE 1714
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWW-EDVSSAEKGMKKVRKQAKPFVEWLEEAEEESD 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
59-615 3.07e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 82.29  E-value: 3.07e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   59 PPPLDERIFSTQPVSAVYSVQ---RPPGPPFTAHEINKGHPNLAATP--PG--HASSPGLSQVSVSTVSTahlygHPKGW 131
Cdd:PHA03247  2552 PPPLPPAAPPAAPDRSVPPPRpapRPSEPAVTSRARRPDAPPQSARPraPVddRGDPRGPAPPSPLPPDT-----HAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  132 EPGGGSPYTTgQNAGTTPLVYSPPTQPMNAQPQSRpfAPGPRPTHHQG-GFRSIQFFQRTQMQTARPTIPSNTPPIRPTS 210
Cdd:PHA03247  2627 PPPSPSPAAN-EPDPHPPPTVPPPERPRDDPAPGR--VSRPRRARRLGrAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  211 QTPTAAvysPNQHIMMTMAHMPFHSPQTAQYYIPQYRHSAPQyvgPPQQYPVQPTGPSTFYAAASPGEFPAPYAAGPPYY 290
Cdd:PHA03247  2704 PPPTPE---PAPHALVSATPLPPGPAAARQASPALPAAPAPP---AVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  291 PGQPVYTPSP--PIIVPTPQQPPPAKREKKTIRIRDPNQGgkdvtdEILSGVGLSRNPTPPVGRPSSTPTPPQQLNSQVA 368
Cdd:PHA03247  2778 GPPRRLTRPAvaSLSESRESLPSPWDPADPPAAVLAPAAA------LPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  369 DHGHIMYNVDSS---PHLPAPFNLKADDKPKLEF----SLQRTASPGLRQPDTPLERRDPSSPVQ---TPSSPPHKPELP 438
Cdd:PHA03247  2852 LGGSVAPGGDVRrrpPSRSPAAKPAAPARPPVRRlarpAVSRSTESFALPPDQPERPPQPQAPPPpqpQPQPPPPPQPQP 2931
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  439 PSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLTKAITPEPESSEPEKSSSPPPQSLSGSLTQHekAVNGLTDVdA 518
Cdd:PHA03247  2932 PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH--SLSRVSSW-A 3008
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  519 APLSEELETQPREASpLLPTSSVP-QSEPRPVTPVLEEESDPINMDS--PLPPVEDD--AGCPDNVSPSLSTSTTAAIST 593
Cdd:PHA03247  3009 SSLALHEETDPPPVS-LKQTLWPPdDTEDSDADSLFDSDSERSDLEAldPLPPEPHDpfAHEPDPATPEAGARESPSSQF 3087
                          570       580
                   ....*....|....*....|..
gi 1207195306  594 TPPapppglshPSQVSAALDRR 615
Cdd:PHA03247  3088 GPP--------PLSANAALSRR 3101
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
188-606 1.73e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 59.78  E-value: 1.73e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  188 QRTQMQTARPTIPSNTPPIRPTSQTPTAAvyspnqhimmtmAHMPFHSPQTAQYYIPqyrhsaPQYVGPPQQYPVQPTGP 267
Cdd:pfam03154  163 QQQILQTQPPVLQAQSGAASPPSPPPPGT------------TQAATAGPTPSAPSVP------PQGSPATSQPPNQTQST 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  268 STFYAAASPGEFPAPYAAGPPYYPGQPVYTPSPPIIVPtPQQPPPAKREKKTIRIRDPNQGGKDVTDEILSGVGLsrnPT 347
Cdd:pfam03154  225 AAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVS-PQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPF---PL 300
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  348 PPVGRPSSTPTPPQqlnSQVADHGHimynvdSSPHLPAPFNLKADDKPKLEFSLQRT--ASPGLRQPDT--------PLE 417
Cdd:pfam03154  301 TPQSSQSQVPPGPS---PAAPGQSQ------QRIHTPPSQSQLQSQQPPREQPLPPAplSMPHIKPPPTtpipqlpnPQS 371
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  418 RRDPS-----SPVQTPSSPPHKPELPPSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLTKAITPEPESSEPEKSS 492
Cdd:pfam03154  372 HKHPPhlsgpSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTS 451
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  493 SPPPQSLSGSLTQHEKAVNGLTDVDAAplseelETQPREASPLLPTSSVPQSEPRPVTPVLeeesdPINMDSPLPPVEDD 572
Cdd:pfam03154  452 GLHQVPSQSPFPQHPFVPGGPPPITPP------SGPPTSTSSAMPGIQPPSSASVSSSGPV-----PAAVSCPLPPVQIK 520
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1207195306  573 AGCPDNvspslststtaaiSTTPPAPPPGLSHPS 606
Cdd:pfam03154  521 EEALDE-------------AEEPESPPPPPRSPS 541
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1666-1712 1.46e-06

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 50.67  E-value: 1.46e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1207195306 1666 LYDEDVISEDAFYKWevSKDPAEQQGKGVALKSVTAFFTWLREAEEE 1712
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
176-316 5.51e-04

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 42.47  E-value: 5.51e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   176 HHQGGFRSIQF--FQRTQMQTARPTIPSNTPPIRPTSQTPTaavysPNQHIMMTMAHMPFHSPQTAQYYIPqyrhsaPQY 253
Cdd:smart00818   36 HHQIIPVSQQHppTHTLQPHHHIPVLPAQQPVVPQQPLMPV-----PGQHSMTPTQHHQPNLPQPAQQPFQ------PQP 104
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1207195306   254 VGPPQ-QYPVQPTGPstfyaaASPGEFPAPYAAGPPYYPGQPVytpsPPIIVPTPQQPPPA----KRE 316
Cdd:smart00818  105 LQPPQpQQPMQPQPP------VHPIPPLPPQPPLPPMFPMQPL----PPLLPDLPLEAWPAtdktKRE 162
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1077-1354 4.44e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.96  E-value: 4.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1077 KPRMDQYFNQMEKIVKERKTSSRIrfmlqDVIDLrlhnWVsRRADQGPKTIEQIHKDAKLEEQEEQRKVHQQLLSKDNKR 1156
Cdd:PTZ00108  1101 KEKVEKLNAELEKKEKELEKLKNT-----TPKDM----WL-EDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLR 1170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1157 RPVVQREETWSTVPMTKNSRTIDPAKIPKFSKSAIDEKIQLGPRAQVNWMKGSSGGAGAKASESDASRPSaslNRYSPLQ 1236
Cdd:PTZ00108  1171 KPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSV---KRLKSKK 1247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1237 PSALQTSSLPSTSPDFDSRRVLGSRG--SSGRERNDKPLSAGPARTGPISLSSSNKETPEELVQ---EVSRRDSNASDTP 1311
Cdd:PTZ00108  1248 NNSSKSSEDNDEFSSDDLSKEGKPKNapKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKkrlEGSLAALKKKKKS 1327
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1207195306 1312 KLLVSTADKS--RLENSQPRESAVKLEALSGPSPDKPALSEEEME 1354
Cdd:PTZ00108  1328 EKKTARKKKSktRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSE 1372
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
288-657 6.94e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.19  E-value: 6.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  288 PYYPGQPVYTPSP---PIIVPTPQQPPPAKREKKTIRIRDPNQGGKDVTDEILSGVGLSRnptppVGRPSSTPTPPqqln 364
Cdd:COG5665    177 IAVPSAPAAPPNAvdySVLVPIAAQDPAASVSTPQAFNASATSGRSQHIVQAAKRVGVEW-----WGDPSLLATPP---- 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  365 sqvadHGHIMYNVDSSPHLPAPFNLKADDKPKLEFSLQRTASPGLR-----QPDTPLE--RRDPSSPVQTPSSPPHKPEL 437
Cdd:COG5665    248 -----ATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTSNTPTSTakaqpQPPTKKQpaKEPPSDTASGNPSAPSVLIN 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  438 PPSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLTKAITPEPESSEPEKSSSPPPQSLSGSLTQHEKAVNGLTdvD 517
Cdd:COG5665    323 SDSPTSEDPATASVPTTEETTAFTTPSSVPSTPAEKDTPATDLATPVSPTPPETSVDKKVSPDSATSSTKSEKEGGT--A 400
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  518 AAPLSEelETQPREASPLLPTSSVPQSEPRPVTPVLEEESDPINMDSPLPPVEDDAGCPDNVSPSLsTSTTAAISTTPPA 597
Cdd:COG5665    401 SSPMPP--NIAIGAKDDVDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAGSDLEPENTT-LRDPAPNAIPPPE 477
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  598 PPPGLSHPSQVSAALDrrpSNGAEIKETGKENEALPDKRGEPFLQSRKSSNQATSSAPKT 657
Cdd:COG5665    478 DPSTIGRLSSGDKLAN---ETGPPVIRRDSTPSSTADQSIVGVLAFGLDQRTQAEISVEA 534
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
886-1114 3.44e-60

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 205.29  E-value: 3.44e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  886 FRKVRSILNKLTPQMFSQLMKQVTDLTIDTEERLKGVIDLVFEKAINEPSFSVAYGNMCSCLATLkvpmtdkpnSTVNFR 965
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  966 KLLLNRCQKEFEKDKMdddafekKHREleaatassererlqeeleeaKDKARRRSIGNIKFIGELFKLRMLTEAIMHDCV 1045
Cdd:pfam02854   72 IHLLNRLQEEFEKRFE-------LEEN--------------------EQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1207195306 1046 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLHN 1114
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
887-1111 1.93e-50

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 177.55  E-value: 1.93e-50
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   887 RKVRSILNKLTPQMFSQLMKQVTDLTIDTEERLKGVIDLVFEKAINEPSFSVAYGNMCSCLAtLKVPmtdkpnstvNFRK 966
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   967 LLLNRCQKEFEKDKMDDDAfekkhreleaatassererlqeeleeakdKARRRSIGNIKFIGELFKLRMLTEAIMHDCVV 1046
Cdd:smart00543   72 LLLERLQEEFEKGLESEEE-----------------------------SDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1207195306  1047 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1111
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1552-1687 2.45e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 168.62  E-value: 2.45e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1552 LSPEELNKQLEKLLLEDMVGDEqIFDWVEANLDESEMSSAPFVRALMTAVCKAAVkTEGSSCKVDLSIIQTRLPVLHKYL 1631
Cdd:cd11559      1 LPLLRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAI-EEKSLPEKEKALLEKYAPLLQKYL 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1207195306 1632 NSDTERQLQALYALQALIVKLDQPANLLRMFFDCLYDEDVISEDAFYKWEVSKDPA 1687
Cdd:cd11559     79 DDDEQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1354-1465 1.63e-33

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 125.47  E-value: 1.63e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1354 ERRSKSIIDEFLHINDYKEALQCVEELEQSAMLYVFVRVGVESTLERSQITRDHMGQLLFQLLQAGVLLKLQFFKGFSET 1433
Cdd:pfam02847    2 KRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWRV 81
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1207195306 1434 LELADDMAIDIPHIWLYLAELVTPVLREGGIS 1465
Cdd:pfam02847   82 LEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1354-1465 1.05e-31

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 120.43  E-value: 1.05e-31
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  1354 ERRSKSIIDEFLHINDYKEALQCVEELEQSAMLYVFVRVGVESTLERSQITRDHMGQLLFQLLQAGVLLKLQFFKGFSET 1433
Cdd:smart00544    2 KKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWRL 81
                            90       100       110
                    ....*....|....*....|....*....|..
gi 1207195306  1434 LELADDMAIDIPHIWLYLAELVTPVLREGGIS 1465
Cdd:smart00544   82 LEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1625-1709 1.34e-26

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 104.68  E-value: 1.34e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  1625 PVLHKYLNSDTERQLQALYALQALIVKLDQPANLLRMFFDCLYDEDVISEDAFYKWEVSKDPAEqqGKGVALKSVTAFFT 1704
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1207195306  1705 WLREA 1709
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1638-1714 1.02e-25

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 101.84  E-value: 1.02e-25
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1207195306 1638 QLQALYALQALIVKLDQPANLLRMFFDCLYDEDVISEDAFYKWEvSKDPAEQQGKGVALKSVTAFFTWLREAEEESE 1714
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWW-EDVSSAEKGMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1555-1681 2.51e-21

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 91.38  E-value: 2.51e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1555 EELNKQLEKLLLEDMVGDEQIFDWVEANLDESEMSSAPFVRALMTAVCKAAVKTEGSS---CKVDLSIIQTRLPVLHKYL 1631
Cdd:cd11473      4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADSISltqKEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1207195306 1632 NSDTERQLQALYALQAL--IVKLDQPANLLRMFFDCLYDEDVISEDAFYKWE 1681
Cdd:cd11473     84 KLIKKDQLYLLLKIEKLclQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1595-1714 4.57e-16

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 77.68  E-value: 4.57e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1595 RALMTAVCKAAVKTEGSSCKVDLSIIQTRL----PVLHKYLNSDTErQLQALYALQALIVKLDQPANLLRMFFDCLYDED 1670
Cdd:cd11558     47 RAVVKALLELILEVSSTSTAELLEALKKLLskwgPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1207195306 1671 VISEDAFYKWEVSKDPAEQQGKGVALKSVTAFFTWLREAEEESE 1714
Cdd:cd11558    126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
PHA03247 PHA03247
large tegument protein UL36; Provisional
59-615 3.07e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 82.29  E-value: 3.07e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   59 PPPLDERIFSTQPVSAVYSVQ---RPPGPPFTAHEINKGHPNLAATP--PG--HASSPGLSQVSVSTVSTahlygHPKGW 131
Cdd:PHA03247  2552 PPPLPPAAPPAAPDRSVPPPRpapRPSEPAVTSRARRPDAPPQSARPraPVddRGDPRGPAPPSPLPPDT-----HAPDP 2626
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  132 EPGGGSPYTTgQNAGTTPLVYSPPTQPMNAQPQSRpfAPGPRPTHHQG-GFRSIQFFQRTQMQTARPTIPSNTPPIRPTS 210
Cdd:PHA03247  2627 PPPSPSPAAN-EPDPHPPPTVPPPERPRDDPAPGR--VSRPRRARRLGrAAQASSPPQRPRRRAARPTVGSLTSLADPPP 2703
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  211 QTPTAAvysPNQHIMMTMAHMPFHSPQTAQYYIPQYRHSAPQyvgPPQQYPVQPTGPSTFYAAASPGEFPAPYAAGPPYY 290
Cdd:PHA03247  2704 PPPTPE---PAPHALVSATPLPPGPAAARQASPALPAAPAPP---AVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  291 PGQPVYTPSP--PIIVPTPQQPPPAKREKKTIRIRDPNQGgkdvtdEILSGVGLSRNPTPPVGRPSSTPTPPQQLNSQVA 368
Cdd:PHA03247  2778 GPPRRLTRPAvaSLSESRESLPSPWDPADPPAAVLAPAAA------LPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  369 DHGHIMYNVDSS---PHLPAPFNLKADDKPKLEF----SLQRTASPGLRQPDTPLERRDPSSPVQ---TPSSPPHKPELP 438
Cdd:PHA03247  2852 LGGSVAPGGDVRrrpPSRSPAAKPAAPARPPVRRlarpAVSRSTESFALPPDQPERPPQPQAPPPpqpQPQPPPPPQPQP 2931
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  439 PSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLTKAITPEPESSEPEKSSSPPPQSLSGSLTQHekAVNGLTDVdA 518
Cdd:PHA03247  2932 PPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH--SLSRVSSW-A 3008
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  519 APLSEELETQPREASpLLPTSSVP-QSEPRPVTPVLEEESDPINMDS--PLPPVEDD--AGCPDNVSPSLSTSTTAAIST 593
Cdd:PHA03247  3009 SSLALHEETDPPPVS-LKQTLWPPdDTEDSDADSLFDSDSERSDLEAldPLPPEPHDpfAHEPDPATPEAGARESPSSQF 3087
                          570       580
                   ....*....|....*....|..
gi 1207195306  594 TPPapppglshPSQVSAALDRR 615
Cdd:PHA03247  3088 GPP--------PLSANAALSRR 3101
PHA03247 PHA03247
large tegument protein UL36; Provisional
57-667 1.39e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 79.98  E-value: 1.39e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   57 RVPPPLDERIFSTQPVSAVYSVQRPPGPP---FTAHE-------INKGHPNLAATP----PGHASSPGLSQVSVSTVSTA 122
Cdd:PHA03247  2393 RSPPCLVLVDISMAPLFVLWEQPDPPGPPdvrFVGSEeieelpfVSPGGDVLAGLAadgdPFFARTILGAPFSLSLLLGE 2472
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  123 HLYGHPKGWEPG--------GGSPyttGQNAGTTPLVYSPPTqPMNAQPQSRPFAPGPRPTHHQ--GGFRSIQFFQRTQM 192
Cdd:PHA03247  2473 LFPGAPVYRRPAearfpfaaGAAP---DPGGGGPPDPDAPPA-PSRLAPAILPDEPVGEPVHPRmlTWIRGLEELASDDA 2548
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  193 QTARPTIPSNTPPIRPTSQTPTA-AVYSPNQHIMMTMAHMPFHSPQTAQYYIP-----QYRHSAPQYVGPPQQYPVQPTG 266
Cdd:PHA03247  2549 GDPPPPLPPAAPPAAPDRSVPPPrPAPRPSEPAVTSRARRPDAPPQSARPRAPvddrgDPRGPAPPSPLPPDTHAPDPPP 2628
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  267 PSTFYAAASPGEFPApyAAGPPyyPGQPVYTPSPPIIVPtpqqPPPAKREKKTIRIRDPNQGGKD--VTDEILSGVGLSR 344
Cdd:PHA03247  2629 PSPSPAANEPDPHPP--PTVPP--PERPRDDPAPGRVSR----PRRARRLGRAAQASSPPQRPRRraARPTVGSLTSLAD 2700
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  345 NPTPPvgrPSSTPTPPQQLNSQVADHGHIMYNVDSSPHLPAPFNLKADDKPKLEFSLQRTASPGLrqPDTPLERRDPSSP 424
Cdd:PHA03247  2701 PPPPP---PTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPT--TAGPPAPAPPAAP 2775
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  425 VQTPssPPHKPelPPSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLTKAITPEPESSEPEKSSSPPPQSLSGSLT 504
Cdd:PHA03247  2776 AAGP--PRRLT--RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  505 QHEKAVNGltdvdaAPLSEELETQPREASPLLPTSSVPQSEPRPVTPvLEEESDPINMDSPLPPVEDDAGCPdnvsPSLS 584
Cdd:PHA03247  2852 LGGSVAPG------GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVS-RSTESFALPPDQPERPPQPQAPPP----PQPQ 2920
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  585 TSTTAAISTTPPAPPPGLShPSQVSAALDRRPSNGAEIKETGKENEALPDKRgEPFLQSRKSSNQATSSAPKTWKKPKED 664
Cdd:PHA03247  2921 PQPPPPPQPQPPPPPPPRP-QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGR-VAVPRFRVPQPAPSREAPASSTPPLTG 2998

                   ...
gi 1207195306  665 MPV 667
Cdd:PHA03247  2999 HSL 3001
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1573-1714 6.00e-11

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 62.63  E-value: 6.00e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1573 EQIFDWVEANLDESEMS-------SAPFVRALMTAVCKAAVKTEGSSCKVDLsiIQTRLPVLHKYLNSDtERQLQALYAL 1645
Cdd:cd11561      9 DELGEFLKKNKDESGLSelkeilkEAERLDVVKDKAVLVLAEVLFDENIVKE--IKKRKALLLKLVTDE-KAQKALLGGI 85
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1207195306 1646 QALIVK-----LDQPANLLRmffdCLYDEDVISEDAFYKW--EVSKDPAEQQGKGVALKSVTAFFTWLREAEEESE 1714
Cdd:cd11561     86 ERFCGKhspelLKKVPLILK----ALYDNDILEEEVILKWyeKVSKKYVSKEKSKKVRKAAEPFVEWLEEAEEEEE 157
PHA03247 PHA03247
large tegument protein UL36; Provisional
52-459 1.35e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.42  E-value: 1.35e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   52 PSRYPRV--PPPLDERIFSTQPVSAVYSVQRPPGPPFT----AHEINKGHPNLAATPPGHASSPGLSQVSVsTVSTAHLY 125
Cdd:PHA03247  2670 LGRAAQAssPPQRPRRRAARPTVGSLTSLADPPPPPPTpepaPHALVSATPLPPGPAAARQASPALPAAPA-PPAVPAGP 2748
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  126 GHPKGWEPGGGSPYTTGQNAGTTPLVYSPPTQPMNAQP------QSRPFAPGPR-PTHHQGGFRSIQFFQRTQMQ----- 193
Cdd:PHA03247  2749 ATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPavaslsESRESLPSPWdPADPPAAVLAPAAALPPAASpagpl 2828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  194 ----TARPTIPSNTPPIRPTSQTPTAAVySPNQHIMMTMAHMPFHSPQTAQYYIPQYRHSAPQYVGPPQQYPVQPTGPst 269
Cdd:PHA03247  2829 ppptSAQPTAPPPPPGPPPPSLPLGGSV-APGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP-- 2905
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  270 fyaAASPGEFPAPYAAGPPYYPGQPVYTPSPPiivPTPQQPPPAKREKKTIRIRDPNQGgkdVTDEILSGVGLSRNPTPP 349
Cdd:PHA03247  2906 ---ERPPQPQAPPPPQPQPQPPPPPQPQPPPP---PPPRPQPPLAPTTDPAGAGEPSGA---VPQPWLGALVPGRVAVPR 2976
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  350 VGRPSSTPTPPQQLNSQVADHGHIMYNVDSSPHLPApFNLKADDKPkleFSLQRTASPglrqPDTpLERRDPSSPVQTPS 429
Cdd:PHA03247  2977 FRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLA-LHEETDPPP---VSLKQTLWP----PDD-TEDSDADSLFDSDS 3047
                          410       420       430
                   ....*....|....*....|....*....|
gi 1207195306  430 SPPHKPELPPSDSETASSVATAPTPSIPAS 459
Cdd:PHA03247  3048 ERSDLEALDPLPPEPHDPFAHEPDPATPEA 3077
PHA03247 PHA03247
large tegument protein UL36; Provisional
32-322 5.55e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.49  E-value: 5.55e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   32 RTTLTTVPLQSVAQQVFLNFPSR--YPRVPPPlderifSTQPVSAVySVQRPPGPPFTAHEINKGHPNL-AATPPGHASS 108
Cdd:PHA03247  2713 HALVSATPLPPGPAAARQASPALpaAPAPPAV------PAGPATPG-GPARPARPPTTAGPPAPAPPAApAAGPPRRLTR 2785
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  109 PGLSQVSVSTVSTahlyghPKGWEPGGG----SPYTTGQNAGTTPLVYSPPtqPMNAQPQSRPFAPGPRPTHH--QGGFR 182
Cdd:PHA03247  2786 PAVASLSESRESL------PSPWDPADPpaavLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSLplGGSVA 2857
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  183 SIQFFQR---TQMQTARPTIPSNtPPIRPTSQTPTAAvySPNQHIMMTMAHMPFHSPQTAQYYIPQYRHSAPQYVGPPQQ 259
Cdd:PHA03247  2858 PGGDVRRrppSRSPAAKPAAPAR-PPVRRLARPAVSR--STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPP 2934
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  260 YPVQPTGP-----STFYAAASPGEFPAPY--AAGPPYYPGQPVYTPSPPIIVPTPQQPPPAKREKKTIRI 322
Cdd:PHA03247  2935 PPPRPQPPlapttDPAGAGEPSGAVPQPWlgALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
188-606 1.73e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 59.78  E-value: 1.73e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  188 QRTQMQTARPTIPSNTPPIRPTSQTPTAAvyspnqhimmtmAHMPFHSPQTAQYYIPqyrhsaPQYVGPPQQYPVQPTGP 267
Cdd:pfam03154  163 QQQILQTQPPVLQAQSGAASPPSPPPPGT------------TQAATAGPTPSAPSVP------PQGSPATSQPPNQTQST 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  268 STFYAAASPGEFPAPYAAGPPYYPGQPVYTPSPPIIVPtPQQPPPAKREKKTIRIRDPNQGGKDVTDEILSGVGLsrnPT 347
Cdd:pfam03154  225 AAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVS-PQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPF---PL 300
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  348 PPVGRPSSTPTPPQqlnSQVADHGHimynvdSSPHLPAPFNLKADDKPKLEFSLQRT--ASPGLRQPDT--------PLE 417
Cdd:pfam03154  301 TPQSSQSQVPPGPS---PAAPGQSQ------QRIHTPPSQSQLQSQQPPREQPLPPAplSMPHIKPPPTtpipqlpnPQS 371
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  418 RRDPS-----SPVQTPSSPPHKPELPPSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLTKAITPEPESSEPEKSS 492
Cdd:pfam03154  372 HKHPPhlsgpSPFQMNSNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTS 451
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  493 SPPPQSLSGSLTQHEKAVNGLTDVDAAplseelETQPREASPLLPTSSVPQSEPRPVTPVLeeesdPINMDSPLPPVEDD 572
Cdd:pfam03154  452 GLHQVPSQSPFPQHPFVPGGPPPITPP------SGPPTSTSSAMPGIQPPSSASVSSSGPV-----PAAVSCPLPPVQIK 520
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1207195306  573 AGCPDNvspslststtaaiSTTPPAPPPGLSHPS 606
Cdd:pfam03154  521 EEALDE-------------AEEPESPPPPPRSPS 541
dnaA PRK14086
chromosomal replication initiator protein DnaA;
154-313 9.72e-07

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 53.68  E-value: 9.72e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  154 PPTQPMNAQPQSRpfAPGPRP----THHQGGFRSIQFFQRTQMQTARPTIPSNTPPIRPTSQTPTAAVYSPNQHIMMTMA 229
Cdd:PRK14086    99 PPHARRTSEPELP--RPGRRPyegyGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPP 176
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  230 HMPFHSPqtAQYYIPQYRHSAPQYVGPPQQYPVQPTGPSTFYAAASPGE------FPAPYAA-------GPPYYPGQPVY 296
Cdd:PRK14086   177 RAPYASP--ASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRdrtdrpEPPPGAGhvhrggpGPPERDDAPVV 254
                          170
                   ....*....|....*..
gi 1207195306  297 TPSPPIIVPTPQQPPPA 313
Cdd:PRK14086   255 PIRPSAPGPLAAQPAPA 271
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
53-478 1.36e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 53.62  E-value: 1.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   53 SRYPRVPPPLDERIFSTQPVSAVYSVQRPPGppftaheINKGHPNLAATPPGhasSPGLSQVSVSTVSTAHLYGHPKGWE 132
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPPV-------LQAQSGAASPPSPP---PPGTTQAATAGPTPSAPSVPPQGSP 212
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  133 PGGGSPYTTGQNAGTTPLVYS-------------PPTQPMN--AQPQSRPFAPGPRPTHHQggfrsiqffQRTQMQTARP 197
Cdd:pfam03154  213 ATSQPPNQTQSTAAPHTLIQQtptlhpqrlpsphPPLQPMTqpPPPSQVSPQPLPQPSLHG---------QMPPMPHSLQ 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  198 TIPSNTPPIRPTSQTPTAAVYSPNQHIMMTMAHMPFHSPQTAQYYIPQYRHSAPQyvgPPQQYPVqPTGPSTFYAAASPG 277
Cdd:pfam03154  284 TGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQ---PPREQPL-PPAPLSMPHIKPPP 359
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  278 EFPAPYAAGPPYYPGQPVYTPSPPIIVPTPQQPPPAKREKKTIRIRDPNqggkdvtdeilsgvglSRNPTPPVGRPSSTP 357
Cdd:pfam03154  360 TTPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----------------SAHPPPLQLMPQSQQ 423
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  358 TPPQQLNSQVADHGHIMYNVDSSPHLPAPFNLKADDKPKLEFSLQRTASPGLRQPDTPLERRDPSSP-VQTPSSpphkpe 436
Cdd:pfam03154  424 LPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPgIQPPSS------ 497
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|..
gi 1207195306  437 LPPSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLTKA 478
Cdd:pfam03154  498 ASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRS 539
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1666-1712 1.46e-06

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 50.67  E-value: 1.46e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1207195306 1666 LYDEDVISEDAFYKWevSKDPAEQQGKGVALKSVTAFFTWLREAEEE 1712
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
266-668 2.31e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.87  E-value: 2.31e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  266 GPSTFYAAASPGEFPAPyAAGPPYYPGQPvyTPSPPIIVPTPQQPPPAKREKKTIRIRDPNQGGKDVTDeilsgvglSRN 345
Cdd:PHA03307    44 VSDSAELAAVTVVAGAA-ACDRFEPPTGP--PPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPP--------GPS 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  346 PTPPVGRPSSTPTPPqqlnsqvADHGHIMYNVDSSPHLPAPFNLKADDKPKLEFSLQRTASPGLRQPDTPLERrdPSSPV 425
Cdd:PHA03307   113 SPDPPPPTPPPASPP-------PSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSS--PEETA 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  426 QTPSSPPhkPELPPSDSETASSvATAPTPSIPASTEESADAPSPL--------AEPSLTKAITPEPESSEPEKSSSPPPQ 497
Cdd:PHA03307   184 RAPSSPP--AEPPPSTPPAAAS-PRPPRRSSPISASASSPAPAPGrsaaddagASSSDSSSSESSGCGWGPENECPLPRP 260
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  498 SLSGSLTQHEKAVNG-LTDVDAAPLSEELETQPREASPLLPTSSVPQSEPRPVTPVLEEESDPINMDSPLPPVEDDAGCP 576
Cdd:PHA03307   261 APITLPTRIWEASGWnGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAA 340
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  577 DNVSPSLSTSTTAAISTTPPAPPPGLSHPSQVSAALDRRPSNG-AEIKETGKENEALPDKRGEPFLQ----SRKSSNQAT 651
Cdd:PHA03307   341 VSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGrPTRRRARAAVAGRARRRDATGRFpagrPRPSPLDAG 420
                          410
                   ....*....|....*..
gi 1207195306  652 SSAPKTWKKPKEDMPVG 668
Cdd:PHA03307   421 AASGAFYARYPLLTPSG 437
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
79-469 8.04e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.94  E-value: 8.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   79 QRPPGPP-FTAHEINKGHPNLAATPPGHASSPglsqVSVSTvSTAHLYGHPKGWEPGGGSPYTTGQNAGTTPLVYSPPTQ 157
Cdd:PHA03307    22 PRPPATPgDAADDLLSGSQGQLVSDSAELAAV----TVVAG-AAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLA 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  158 PMNAQPQSRPFAPGPRPThhqggfrsiqffqRTQMQTARPTIPSNTPPIRPTSQTPTAAVYSPNQHIMMTMAHMPFHSPQ 237
Cdd:PHA03307    97 PASPAREGSPTPPGPSSP-------------DPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  238 TAQyyipqyrhsapqyVGPPQQYPVQPTGPSTFYAAASPGEFPAPYAAGPPYYPGQPVytPSPPIIVPTPQQPPPAKREK 317
Cdd:PHA03307   164 SDA-------------ASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPR--RSSPISASASSPAPAPGRSA 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  318 KTIRIRDPNQGGKDVTDEILSGvGLSRNPTPPVGrPSSTPTPPQQlnsqvadhgHIMYNVDSSPHLPAPfnlkaddkpkl 397
Cdd:PHA03307   229 ADDAGASSSDSSSSESSGCGWG-PENECPLPRPA-PITLPTRIWE---------ASGWNGPSSRPGPAS----------- 286
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1207195306  398 efslqrtASPGLRQPDTPLERRDPSSPVQTPSSPPHKPELPPSDSETASSVATAPTPSIPASTEESADAPSP 469
Cdd:PHA03307   287 -------SSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSP 351
PHA03378 PHA03378
EBNA-3B; Provisional
68-453 2.18e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 49.68  E-value: 2.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   68 STQPVSAVYSVQRPPGPPFTAHEINKGHPNLAATPPGHASSPGLsqvsvstvsTAHLYGHPKGWEPGGGSPYTTGQNAGT 147
Cdd:PHA03378   555 STEPVHDQLLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWP---------VPHPSQTPEPPTTQSHIPETSAPRQWP 625
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  148 TPLvYSPPTQPMNAQPQsrPFAPGPRPTHHQGGFRSIQFFQRTQMQTarPTIPSNTPPIRPTSQTPTAAvySPnqhimmT 227
Cdd:PHA03378   626 MPL-RPIPMRPLRMQPI--TFNVLVFPTPHQPPQVEITPYKPTWTQI--GHIPYQPSPTGANTMLPIQW--AP------G 692
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  228 MAHMPFHSPQtaqyyipqyRHSAPQYVGPPQQYPVQPTGPSTFYAAASPGEFPAPYAAGPPYYPGQPVYTPSPPIIVPTP 307
Cdd:PHA03378   693 TMQPPPRAPT---------PMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGR 763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  308 QQPPPAKREKKTirIRDPNQGGKDVTDEILSGvglsrnPTP---PVGRPSSTPTPPQQLNSQVADHGHIMYNVDSSPHLP 384
Cdd:PHA03378   764 ARPPAAAPGAPT--PQPPPQAPPAPQQRPRGA------PTPqppPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR 835
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1207195306  385 APFNLKADDKPKLEFSLQRTASPGLRQPDTPLERRDPSSPVQTPSSPPHKpeLPPSDSETASSVATAPT 453
Cdd:PHA03378   836 GRPSLKKPAALERQAAAGPTPSPGSGTSDKIVQAPVFYPPVLQPIQVMRQ--LGSVRAAAASTVTQAPT 902
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
25-264 2.42e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 49.26  E-value: 2.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   25 QLSASQLRTTLTtVPLQSVAQQVFLNFPSRYPRVPPPLDERIFSTQPVSAVYSVQRPPGPpftaheINKGHPNLA----- 99
Cdd:pfam09770   94 AIEEEQVRFNRQ-QPAARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTGYEKYKEPEP------IPDLQVDASlwgva 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  100 ---ATPPGHASSPGLSQVSVSTVS---------TAHLYGHPKGWEPGGGSPYTTGQNAGTTPLVYSPPTQPMNAQPQSRP 167
Cdd:pfam09770  167 pkkAAAPAPAPQPAAQPASLPAPSrkmmsleevEAAMRAQAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQP 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  168 FAPGPRPTHHQGGFRSIQFFQRTQMQTARPTIPSNTPPIRPTSQTPTAAVYSPNQHI------MMTMAHMPFHSPQTAQY 241
Cdd:pfam09770  247 QQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQILqnpnrlSAARVGYPQNPQPGVQP 326
                          250       260
                   ....*....|....*....|...
gi 1207195306  242 YIPQYRHSAPQYVGPPQQYPVQP 264
Cdd:pfam09770  327 APAHQAHRQQGSFGRQAPIITHP 349
PRK10263 PRK10263
DNA translocase FtsK; Provisional
149-311 3.27e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 48.93  E-value: 3.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  149 PLVYSPPTQPMNA-QPQSRPFAPGPRPTHHQGGFRSiqffqrtQMQTARPTIPSNTPPIRPTSQTPTAAVYSPNQHIMMT 227
Cdd:PRK10263   347 ASVDVPPAQPTVAwQPVPGPQTGEPVIAPAPEGYPQ-------QSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQP 419
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  228 MAHMPFHSPQTAQYYIPQYRHSAP--QYVGPPQQYPVQPTGPSTFYAAASPGEFPAPYAAGPPYYPGQPVYTPSPPIIVP 305
Cdd:PRK10263   420 YYAPAPEQPAQQPYYAPAPEQPVAgnAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEET 499

                   ....*.
gi 1207195306  306 TPQQPP 311
Cdd:PRK10263   500 KPARPP 505
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
265-480 3.61e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 48.72  E-value: 3.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  265 TGPSTfyAAASPGEFPAPYAAGPPYYPGQPVYTPSPPIIVPTPQQPPPAKREKKTIRIRDPNQGGKDVTDEILSGVGLSR 344
Cdd:PRK12323   372 AGPAT--AAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPA 449
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  345 NPTPPVGRPSSTPTPPQQLNSQVADHGHIMYNVDSSPHLPAPfnlKADDKPKLEFSLQRTASPGLRQPDTPLERRDPSSP 424
Cdd:PRK12323   450 PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAP---ADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESI 526
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1207195306  425 VQTPSSPPHKPELPPSDSETASSVATAPTPSIPAsteeSADAPSPLAEPSLTKAIT 480
Cdd:PRK12323   527 PDPATADPDDAFETLAPAPAAAPAPRAAAATEPV----VAPRPPRASASGLPDMFD 578
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
291-552 5.97e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.61  E-value: 5.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  291 PGQPVYT--PSPPIIVPTPQQPPPAKREKktirirDPNQGgKDVTDEILSGVGLSRNPTPPVGrPSSTPTPPQQLNSQVA 368
Cdd:PLN03209   330 PKESDAAdgPKPVPTKPVTPEAPSPPIEE------EPPQP-KAVVPRPLSPYTAYEDLKPPTS-PIPTPPSSSPASSKSV 401
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  369 DHGHIMYNVDSSPHLPAPFNLKADDKPKLEFSLQRTASPGLRQPDtplerrdpsspVQTPSSPPHKPELPPSDSETASSV 448
Cdd:PLN03209   402 DAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYED-----------LKPPTSPSPTAPTGVSPSVSSTSS 470
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  449 ATA-PTPSIPASTEESADAPSPLAEPSLTKAITPEPESSEPEKSSSPPPQSLSGSLTQHEKAVNGLTDVDAAPLSEELET 527
Cdd:PLN03209   471 VPAvPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQP 550
                          250       260
                   ....*....|....*....|....*.
gi 1207195306  528 QPREASPLLPTSSV-PQSEPRPVTPV 552
Cdd:PLN03209   551 KPRPLSPYTMYEDLkPPTSPTPSPVL 576
PRK10263 PRK10263
DNA translocase FtsK; Provisional
101-610 1.42e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.00  E-value: 1.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  101 TPPGHASSPGLSQVSVStvstahlyghpkgWEPGGGsPYTTGQNAGTTPLVYSPptQPMNAQPQSRPFAPGPRPTHHQGG 180
Cdd:PRK10263   343 TPPVASVDVPPAQPTVA-------------WQPVPG-PQTGEPVIAPAPEGYPQ--QSQYAQPAVQYNEPLQQPVQPQQP 406
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  181 FRSIQFFQRTQMQ--TARPTIPSNTPPIRPTSQTPTAAVYSPNQHIMMTMAHMPFHSPQTA--QYYIPQYRHSAPQYVGP 256
Cdd:PRK10263   407 YYAPAAEQPAQQPyyAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTyqQPAAQEPLYQQPQPVEQ 486
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  257 P---QQYPV----QPTGPSTFY--------------AAASPGEFPAPYAAGPPYYPGQPVYTP-SPPIIVPTPQQPPPAk 314
Cdd:PRK10263   487 QpvvEPEPVveetKPARPPLYYfeeveekrarereqLAAWYQPIPEPVKEPEPIKSSLKAPSVaAVPPVEAAAAVSPLA- 565
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  315 rekktirirdpnQGGKDVTdeilSGVGLSRNPTPPVGRPSSTPTPPQQLNSQVAdhghimynvdssPHLPAPFNLKADDK 394
Cdd:PRK10263   566 ------------SGVKKAT----LATGAAATVAAPVFSLANSGGPRPQVKEGIG------------PQLPRPKRIRVPTR 617
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  395 PKL-----EFSLQRTASPGLRQPDtpleRRDPSSPVQTPSSPP---HKPELPPSDSETASS-VATAPTPSIPASTEESAD 465
Cdd:PRK10263   618 RELasygiKLPSQRAAEEKAREAQ----RNQYDSGDQYNDDEIdamQQDELARQFAQTQQQrYGEQYQHDVPVNAEDADA 693
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  466 ApsplAEPSLTKAITPEPESSEPEKSSSPPPQSLsgsltqhekavngLTDVDAAPLSEELETQPREasPLLPTSSVPQSE 545
Cdd:PRK10263   694 A----AEAELARQFAQTQQQRYSGEQPAGANPFS-------------LDDFEFSPMKALLDDGPHE--PLFTPIVEPVQQ 754
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1207195306  546 PRPVTPVLEEESDPinmDSPLPPVEDDAGCPDNVSPSlstsTTAAISTTPPAPPPGLSHPSQVSA 610
Cdd:PRK10263   755 PQQPVAPQQQYQQP---QQPVAPQPQYQQPQQPVAPQ----PQYQQPQQPVAPQPQYQQPQQPVA 812
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
210-474 1.63e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.45  E-value: 1.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  210 SQTPTAAVYSPnqhimmTMAHMPFHSPQTAQYyIPQYRH-----SAPQYVGPPQQYP--VQPTGPSTFYAAASPGEFPAP 282
Cdd:pfam05109  422 SKAPESTTTSP------TLNTTGFAAPNTTTG-LPSSTHvptnlTAPASTGPTVSTAdvTSPTPAGTTSGASPVTPSPSP 494
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  283 YAAGPP------YYPGQPVYTPSPPIIVPTPQQPPPakrekkTIRIRDPNQGGKDVTDEILSGVGLSRNPTPPVGRPS-- 354
Cdd:pfam05109  495 RDNGTEskapdmTSPTSAVTTPTPNATSPTPAVTTP------TPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTpn 568
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  355 ---------------STPTPPQ------QLNSQVADHGHIMYNVDSSPHLPAPFN--LKADDKPKLEFSLQRTASPGLRq 411
Cdd:pfam05109  569 atiptlgktsptsavTTPTPNAtsptvgETSPQANTTNHTLGGTSSTPVVTSPPKnaTSAVTTGQHNITSSSTSSMSLR- 647
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1207195306  412 PDTPLERRDPSSPVQTPSSPPHKPELPPSDSETASSVATAPTPSIPASTEESADAPSPLAEPS 474
Cdd:pfam05109  648 PSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQAS 710
PRK10263 PRK10263
DNA translocase FtsK; Provisional
211-320 3.91e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.46  E-value: 3.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  211 QTPTAAVYSPNQhimmtmahMPFHSPQTAQYYIPQYRHSApQYVGPPQQY--PVQPTGPSTFYAAASPGEFPAPYAAGP- 287
Cdd:PRK10263   738 DGPHEPLFTPIV--------EPVQQPQQPVAPQQQYQQPQ-QPVAPQPQYqqPQQPVAPQPQYQQPQQPVAPQPQYQQPq 808
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 1207195306  288 -PYYPgQPVYTPSPPIIVPTPQ----QPPPAKREKKTI 320
Cdd:PRK10263   809 qPVAP-QPQYQQPQQPVAPQPQyqqpQQPVAPQPQDTL 845
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
133-331 4.82e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.98  E-value: 4.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  133 PGGGSPYTTGQNAGTTPLVYSPPTQPMNAQPQSRPFAPGPRPTHHQGGFRSIQFFQRTQMQTARPTIPSNTPPIRPTSQT 212
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  213 PTAAVYSPNQHIMMTMAHMPFHSPQTAQYYIPQYRHSAPQYVGPPQQYPVQPTGPSTFYAAAS---------PGEFPAPY 283
Cdd:PRK07764   670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSpaaddpvplPPEPDDPP 749
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1207195306  284 AAGPPYYPGQPVYTPSPPiiVPTPQQPPPAKREKKTIRIRDPNQGGKD 331
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPA--AAPAAAPPPSPPSEEEEMAEDDAPSMDD 795
PRK10905 PRK10905
cell division protein DamX; Validated
404-478 4.99e-04

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 44.16  E-value: 4.99e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  404 TASPGLRQPDTPLERR----DPSSPVQTPSSPPHKPELPPSDSETASSVATAPTP---SIPASTEESADAPSPLAEPSLT 476
Cdd:PRK10905   144 KTQTAERPATTRPARKqaviEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPaatSTPAPKETATTAPVQTASPAQT 223

                   ..
gi 1207195306  477 KA 478
Cdd:PRK10905   224 TA 225
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
176-316 5.51e-04

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 42.47  E-value: 5.51e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306   176 HHQGGFRSIQF--FQRTQMQTARPTIPSNTPPIRPTSQTPTaavysPNQHIMMTMAHMPFHSPQTAQYYIPqyrhsaPQY 253
Cdd:smart00818   36 HHQIIPVSQQHppTHTLQPHHHIPVLPAQQPVVPQQPLMPV-----PGQHSMTPTQHHQPNLPQPAQQPFQ------PQP 104
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1207195306   254 VGPPQ-QYPVQPTGPstfyaaASPGEFPAPYAAGPPYYPGQPVytpsPPIIVPTPQQPPPA----KRE 316
Cdd:smart00818  105 LQPPQpQQPMQPQPP------VHPIPPLPPQPPLPPMFPMQPL----PPLLPDLPLEAWPAtdktKRE 162
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
412-577 1.25e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.71  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  412 PDTPLERRDPSSPVQTPSSPPHKPELPPSDSETASSVATAPTPSIPA----------------STEESADAPSPLAEPSL 475
Cdd:PRK12323   383 AQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPApealaaarqasargpgGAPAPAPAPAAAPAAAA 462
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  476 TKAITPEPESSEPEKSSSPPPQSLSGSLTQHEK-----------AVNGLTDVDAAPLSEELETQPREASPLLPTSSVPQS 544
Cdd:PRK12323   463 RPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpppweelppefASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLA 542
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1207195306  545 EPRPVTPVLEEESDPINMDSPLPPVEDDAGCPD 577
Cdd:PRK12323   543 PAPAAAPAPRAAAATEPVVAPRPPRASASGLPD 575
PHA02682 PHA02682
ORF080 virion core protein; Provisional
295-479 2.09e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 42.16  E-value: 2.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  295 VYTPSPPIIVPTPQQPPPAKREKKTIRIRDPNQGGKDVTDEILSGVGLSRNPTppvGRPSSTPTPPQQLNSQVADhghim 374
Cdd:PHA02682    25 LFTKCPQATIPAPAAPCPPDADVDPLDKYSVKEAGRYYQSRLKANSACMQRPS---GQSPLAPSPACAAPAPACP----- 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  375 ynvDSSPHLPAPFNLKADDKPKLEFSLQRTASPGLRQPDTPleRRDPSSPVQTPSSPPHKPELPPSDSETASSV---ATA 451
Cdd:PHA02682    97 ---ACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPA--RPAPACPPSTRQCPPAPPLPTPKPAPAAKPIflhNQL 171
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1207195306  452 PTPSIPAS---TEESADAPSPLAEPSLTKAI 479
Cdd:PHA02682   172 PPPDYPAAscpTIETAPAASPVLEPRIPDKI 202
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
250-473 2.33e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 2.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  250 APQYVGPPQQYPVQPTGPSTFYAAASPGEFPAPYAAGPPYYPGQPVyTPSPPIIVPTPQQPPPAkrekktirirdPNQGG 329
Cdd:PRK07764   589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAA-APAEASAAPAPGVAAPE-----------HHPKH 656
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  330 KDVTDEILSGVGLSRNPTPPVGrpsSTPTPPQQLNSQVADHGhimynvdSSPHLPAPfnlKADDKPKLEFSLQRTASPgl 409
Cdd:PRK07764   657 VAVPDASDGGDGWPAKAGGAAP---AAPPPAPAPAAPAAPAG-------AAPAQPAP---APAATPPAGQADDPAAQP-- 721
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1207195306  410 RQPDTPLERRDPSSPVQTPssPPHKPELPPSDSETASSvATAPTPSIPASTEESADAPSPLAEP 473
Cdd:PRK07764   722 PQAAQGASAPSPAADDPVP--LPPEPDDPPDPAGAPAQ-PPPPPAPAPAAAPAAAPPPSPPSEE 782
PRK10263 PRK10263
DNA translocase FtsK; Provisional
197-300 2.90e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.76  E-value: 2.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  197 PTIPSNTPPIRPTSQTPTAAVYSPNQHIMMTMAHMPFHSPQTAQYYIPQYRHSAPQYVGPPQ---QYPVQPTGPSTFYAA 273
Cdd:PRK10263   740 PHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQpqyQQPQQPVAPQPQYQQ 819
                           90       100
                   ....*....|....*....|....*..
gi 1207195306  274 ASpgefpAPYAAGPPYYPGQPVYTPSP 300
Cdd:PRK10263   820 PQ-----QPVAPQPQYQQPQQPVAPQP 841
PHA03247 PHA03247
large tegument protein UL36; Provisional
255-473 4.27e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 4.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  255 GPPQQYPVQPTGPSTFYAAASPGEFPAPYAAGPPYYPGQPVYTP----SPPIIVPTPQQPPPAKREKktirirDPNQGGK 330
Cdd:PHA03247   257 PPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVWGAalagAPLALPAPPDPPPPAPAGD------AEEEDDE 330
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  331 DVTDEILSGVGLSRN--PTPPVGRPSSTPTPPQQLNsqvadhghimynvdssphlpapfNLKADDKPKLEFSLQRTASPG 408
Cdd:PHA03247   331 DGAMEVVSPLPRPRQhyPLGFPKRRRPTWTPPSSLE-----------------------DLSAGRHHPKRASLPTRKRRS 387
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1207195306  409 LRQPDTPLERRDPSSPVQTPSSPPHKPELPPSDSETASSVATAPTPSIPASTEESADAPSPLAEP 473
Cdd:PHA03247   388 ARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPER 452
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
1077-1354 4.44e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.96  E-value: 4.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1077 KPRMDQYFNQMEKIVKERKTSSRIrfmlqDVIDLrlhnWVsRRADQGPKTIEQIHKDAKLEEQEEQRKVHQQLLSKDNKR 1156
Cdd:PTZ00108  1101 KEKVEKLNAELEKKEKELEKLKNT-----TPKDM----WL-EDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKASKLR 1170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1157 RPVVQREETWSTVPMTKNSRTIDPAKIPKFSKSAIDEKIQLGPRAQVNWMKGSSGGAGAKASESDASRPSaslNRYSPLQ 1236
Cdd:PTZ00108  1171 KPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSV---KRLKSKK 1247
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306 1237 PSALQTSSLPSTSPDFDSRRVLGSRG--SSGRERNDKPLSAGPARTGPISLSSSNKETPEELVQ---EVSRRDSNASDTP 1311
Cdd:PTZ00108  1248 NNSSKSSEDNDEFSSDDLSKEGKPKNapKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKkrlEGSLAALKKKKKS 1327
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1207195306 1312 KLLVSTADKS--RLENSQPRESAVKLEALSGPSPDKPALSEEEME 1354
Cdd:PTZ00108  1328 EKKTARKKKSktRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSE 1372
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
288-657 6.94e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.19  E-value: 6.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  288 PYYPGQPVYTPSP---PIIVPTPQQPPPAKREKKTIRIRDPNQGGKDVTDEILSGVGLSRnptppVGRPSSTPTPPqqln 364
Cdd:COG5665    177 IAVPSAPAAPPNAvdySVLVPIAAQDPAASVSTPQAFNASATSGRSQHIVQAAKRVGVEW-----WGDPSLLATPP---- 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  365 sqvadHGHIMYNVDSSPHLPAPFNLKADDKPKLEFSLQRTASPGLR-----QPDTPLE--RRDPSSPVQTPSSPPHKPEL 437
Cdd:COG5665    248 -----ATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTSNTPTSTakaqpQPPTKKQpaKEPPSDTASGNPSAPSVLIN 322
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  438 PPSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLTKAITPEPESSEPEKSSSPPPQSLSGSLTQHEKAVNGLTdvD 517
Cdd:COG5665    323 SDSPTSEDPATASVPTTEETTAFTTPSSVPSTPAEKDTPATDLATPVSPTPPETSVDKKVSPDSATSSTKSEKEGGT--A 400
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  518 AAPLSEelETQPREASPLLPTSSVPQSEPRPVTPVLEEESDPINMDSPLPPVEDDAGCPDNVSPSLsTSTTAAISTTPPA 597
Cdd:COG5665    401 SSPMPP--NIAIGAKDDVDATDPSQEAKEYTKNAPMTPEADSAPESSVRTEASPSAGSDLEPENTT-LRDPAPNAIPPPE 477
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  598 PPPGLSHPSQVSAALDrrpSNGAEIKETGKENEALPDKRGEPFLQSRKSSNQATSSAPKT 657
Cdd:COG5665    478 DPSTIGRLSSGDKLAN---ETGPPVIRRDSTPSSTADQSIVGVLAFGLDQRTQAEISVEA 534
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
421-604 7.38e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.00  E-value: 7.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  421 PSSPVQTPSSPPhkPELPPSDSETASSVATAPTPSIPASTEESADAPSPLAEPSLtkaitpepessepekssspppqsls 500
Cdd:PRK07994   361 PAAPLPEPEVPP--QSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAV------------------------- 413
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  501 gsltQHEKAVNGLTDVDAAPLSEELETQPREASPllptssVPQSEPRPVTPVLEE--ESDPINMDSPLPPVEDDAGCPDN 578
Cdd:PRK07994   414 ----PLPETTSQLLAARQQLQRAQGATKAKKSEP------AAASRARPVNSALERlaSVRPAPSALEKAPAKKEAYRWKA 483
                          170       180
                   ....*....|....*....|....*.
gi 1207195306  579 VSPSLstsTTAAISTTPPAPPPGLSH 604
Cdd:PRK07994   484 TNPVE---VKKEPVATPKALKKALEH 506
PHA03291 PHA03291
envelope glycoprotein I; Provisional
395-469 8.16e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 40.71  E-value: 8.16e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1207195306  395 PKLEFSLQRTASPGLRQPDTPLERRDPSSPVQTPSSPPHKPELPPSDSETASSVATAPTPSIPASTEESADAPSP 469
Cdd:PHA03291   188 PALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
348-624 8.41e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.83  E-value: 8.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  348 PPVGrPSSTPTPPQQLNSQVADHGHimynvdsspHLPApfnlKADDKPKLEFSLQRTASPGLRQPDTPLERRDPSS-PVQ 426
Cdd:PTZ00449   510 PPEG-PEASGLPPKAPGDKEGEEGE---------HEDS----KESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKiPTL 575
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  427 T--PSSP-----PHKPELP--PSDSETASSVATAPTPS------IPASTEESADAPSPLAEPSLTKAITPEPESSEPEKS 491
Cdd:PTZ00449   576 SkkPEFPkdpkhPKDPEEPkkPKRPRSAQRPTRPKSPKlpelldIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIK 655
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1207195306  492 SSPPPQSLSGSL----------TQHEKAVNGLTDVDAAPLSEELETQPREASPLLPTSsvPQSEPRPVTPVLeeesdPIN 561
Cdd:PTZ00449   656 SPKPPKSPKPPFdpkfkekfydDYLDAAAKSKETKTTVVLDESFESILKETLPETPGT--PFTTPRPLPPKL-----PRD 728
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1207195306  562 MDSPL-PPVEDDAGCPDNVSPSLSTSTTAAISTTPPAPPPglshPSQVSAALDRRPSNGAEIKE 624
Cdd:PTZ00449   729 EEFPFePIGDPDAEQPDDIEFFTPPEEERTFFHETPADTP----LPDILAEEFKEEDIHAETGE 788
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH