|
Name |
Accession |
Description |
Interval |
E-value |
| MEF2_binding |
pfam09047 |
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ... |
2154-2188 |
9.57e-16 |
|
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.
Pssm-ID: 370261 [Multi-domain] Cd Length: 35 Bit Score: 72.58 E-value: 9.57e-16
10 20 30
....*....|....*....|....*....|....*
gi 1622841216 2154 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2188
Cdd:pfam09047 1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
|
|
| MEF2_binding |
cd13839 |
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ... |
2154-2188 |
5.56e-14 |
|
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.
Pssm-ID: 260103 [Multi-domain] Cd Length: 35 Bit Score: 67.79 E-value: 5.56e-14
10 20 30
....*....|....*....|....*....|....*
gi 1622841216 2154 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2188
Cdd:cd13839 1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1918-2184 |
7.17e-12 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 71.36 E-value: 7.17e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1918 ASGDTPTTPKHPKDSRENFFPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATT--VITCPPSASAsTLDLSKDPG-- 1993
Cdd:PHA03307 124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPeeTARAPSSPPA-EPPPSTPPAaa 202
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1994 ---PPRPHRHEATPSM---ASLGPEGEELARVAEGTGFPPQEPRCSAQVKTA---PTSSPAEPHCWPAEAAPGTGTEPTC 2064
Cdd:PHA03307 203 sprPPRRSSPISASASspaPAPGRSAADDAGASSSDSSSSESSGCGWGPENEcplPRPAPITLPTRIWEASGWNGPSSRP 282
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2065 SQEGKLRPEPRREGEAQEAASETQPLSSPPTAASSKAPS--GGSAQPPEGHPGKAEPSRAKSRPLPNMPKLVIPSAATKF 2142
Cdd:PHA03307 283 GPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSreSSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADP 362
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1622841216 2143 PP---EITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVR 2184
Cdd:PHA03307 363 SSprkRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRF 407
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
30-205 |
2.97e-11 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 65.80 E-value: 2.97e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEarllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457 2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457 62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
|
170
....*....|....*.
gi 1622841216 190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457 142 DDADALYNLGIALEKL 157
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
34-199 |
3.67e-10 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 62.72 E-value: 3.67e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 34 AEAFALYHKALDLQkhdrfEESAKAYHELleARLLREAvssGDEKEGLKH--------PGLIlkySTYKNLAQLAAQRED 105
Cdd:COG0457 25 EEAIEDYEKALELD-----PDDAEALYNL--GLAYLRL---GRYEEALADyeqaleldPDDA---EALNNLGLALQALGR 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 106 LETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKAL 185
Cdd:COG0457 92 YEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEKLGRYEEALELLEKLE 171
|
170
....*....|....
gi 1622841216 186 EKDCRYSKGLVLKE 199
Cdd:COG0457 172 AAALAALLAAALGE 185
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1910-2153 |
4.15e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 65.73 E-value: 4.15e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1910 LGAAAQRQASGDTPTTPKHPKDSRENFFPVTVAPTAPDPVPADSAqrpsdAHTKPRPALAAATTVITCPPSASASTlDLS 1989
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP-----APAPPAAPAAGPPRRLTRPAVASLSE-SRE 2796
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1990 KDPGPPRPhrheATPSMASLGPEGEELARVAEGTGFPPqePRCSAQVKTAPTSSPAEPHCWPAEA-APGtgteptcsqeG 2068
Cdd:PHA03247 2797 SLPSPWDP----ADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSLPLGGSvAPG----------G 2860
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2069 KL-RPEPRREGEAQEAASETQPLSSPPTAASSKAPSGgSAQPPEGHPGKAEPSrAKSRPLPNMPKLVIPSAATkfPPEIT 2147
Cdd:PHA03247 2861 DVrRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTES-FALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQP--PPPPP 2936
|
....*.
gi 1622841216 2148 VTPPTP 2153
Cdd:PHA03247 2937 PRPQPP 2942
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1912-2155 |
4.37e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 65.73 E-value: 4.37e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1912 AAAQRQASGDTPTTPKHPKdsrenffpVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAAttvitcPPSASASTLDLSKD 1991
Cdd:PHA03247 2581 AVTSRARRPDAPPQSARPR--------APVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPS------PAANEPDPHPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1992 PGPPRPhRHEATPSMASLGPEGEELARVAEGTGfPPQEPRCSAQVKTA--------PTSSPAEPHCWPAEAAPGTGTEPT 2063
Cdd:PHA03247 2647 PPPERP-RDDPAPGRVSRPRRARRLGRAAQASS-PPQRPRRRAARPTVgsltsladPPPPPPTPEPAPHALVSATPLPPG 2724
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2064 CSQEGKLRPEPRREGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRPLPNMPKLV--IPSAATK 2141
Cdd:PHA03247 2725 PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESResLPSPWDP 2804
|
250
....*....|....
gi 1622841216 2142 FPPEITVTPPTPTL 2155
Cdd:PHA03247 2805 ADPPAAVLAPAAAL 2818
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1914-2192 |
7.41e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 64.96 E-value: 7.41e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1914 AQRQASGDTPTTPKH--PKDSRENFFPVTVAPTAPDPVPADSAQRPSdahTKPRPALAAATTVITCPPSASASTLDLSKD 1991
Cdd:PHA03247 2544 ASDDAGDPPPPLPPAapPAAPDRSVPPPRPAPRPSEPAVTSRARRPD---APPQSARPRAPVDDRGDPRGPAPPSPLPPD 2620
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1992 PGPPRPHRHEATPSMASLGPEGEELARVAEGTGFPPQEPRCS----AQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCSQE 2067
Cdd:PHA03247 2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSrprrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD 2700
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2068 gklRPEPRREGE-AQEAASETQPLSSPPTAASSKAPSGGSAQ----PPEGHPGKAEPSRAKSRPLPNMPklviPSAAtkf 2142
Cdd:PHA03247 2701 ---PPPPPPTPEpAPHALVSATPLPPGPAAARQASPALPAAPappaVPAGPATPGGPARPARPPTTAGP----PAPA--- 2770
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2143 PPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPA 2192
Cdd:PHA03247 2771 PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
|
|
| Spy |
COG3914 |
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ... |
27-188 |
1.53e-09 |
|
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443119 [Multi-domain] Cd Length: 658 Bit Score: 63.09 E-value: 1.53e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 27 QTKEAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEARLLREAvssGDEKEGLKHPGLILK-----YSTYKNLAQLAA 101
Cdd:COG3914 47 LLAALAEAAAAALLALAAGEAAAAAAALLLLAALLELAALLLQAL---GRYEEALALYRRALAlnpdnAEALFNLGNLLL 123
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 102 QREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFI 181
Cdd:COG3914 124 ALGRLEEALAALRRALALNPDFAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNALQDLGRLEEAIAAY 203
|
....*..
gi 1622841216 182 CKALEKD 188
Cdd:COG3914 204 RRALELD 210
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1910-2160 |
3.41e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 62.65 E-value: 3.41e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1910 LGAAAQRQASGDTPTTPKHPKdsrenffpVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASASTLDLS 1989
Cdd:PHA03247 2695 LTSLADPPPPPPTPEPAPHAL--------VSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP 2766
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1990 KDPGPPR-----PHRHEATPSMASLGPEGEELARVAEGTGFP-PQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEP- 2062
Cdd:PHA03247 2767 PAPAPPAapaagPPRRLTRPAVASLSESRESLPSPWDPADPPaAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPp 2846
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2063 --TCSQEGKLRP--EPRREGEAQEAASETQPLSSPPTAASSKAPSGGS----AQPPEGHPGKAEPSrAKSRPLPNMPKLV 2134
Cdd:PHA03247 2847 ppSLPLGGSVAPggDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRStesfALPPDQPERPPQPQ-APPPPQPQPQPPP 2925
|
250 260
....*....|....*....|....*.
gi 1622841216 2135 IPSAATKFPPEITVTPPTPTLLSPKG 2160
Cdd:PHA03247 2926 PPQPQPPPPPPPRPQPPLAPTTDPAG 2951
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
30-205 |
4.91e-09 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 59.74 E-value: 4.91e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLE---------ARLLREAVSSGDEKEGLKHPGLILKYS-----TYKN 95
Cdd:COG2956 70 ERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLEldpddaealRLLAEIYEQEGDWEKAIEVLERLLKLGpenahAYCE 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 96 LAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYT 175
Cdd:COG2956 150 LAELYLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPE 229
|
170 180 190
....*....|....*....|....*....|
gi 1622841216 176 TCLYFICKALEKDCRYSKGLVLKEKIFEEQ 205
Cdd:COG2956 230 EALELLRKALELDPSDDLLLALADLLERKE 259
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1912-2155 |
6.63e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.88 E-value: 6.63e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1912 AAAQRQASGDTPTTPKHPKDSRENFFPV-TVAPTAPDPVPADSAQRPSDaHTKPRPALAAATTVITCPPSASA-STLDLS 1989
Cdd:PHA03247 2758 ARPPTTAGPPAPAPPAAPAAGPPRRLTRpAVASLSESRESLPSPWDPAD-PPAAVLAPAAALPPAASPAGPLPpPTSAQP 2836
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1990 KDPGPPRPHRHEATPSMASLGPeGEELARVAEgTGFPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCSQEgk 2069
Cdd:PHA03247 2837 TAPPPPPGPPPPSLPLGGSVAP-GGDVRRRPP-SRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQ-- 2912
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2070 LRPEPRREGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRP-LPNMPKLVIPSAAtkfPPEITV 2148
Cdd:PHA03247 2913 APPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA---PSREAP 2989
|
....*..
gi 1622841216 2149 TPPTPTL 2155
Cdd:PHA03247 2990 ASSTPPL 2996
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1922-2181 |
9.40e-09 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 60.36 E-value: 9.40e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1922 TPTTPKHPKDSRENFFPVTVAPTAPDPV----PADSAQRPSDAHTKPR------PALAAATTVIT----------CPPSA 1981
Cdd:pfam17823 99 EPATREGAADGAASRALAAAASSSPSSAaqslPAAIAALPSEAFSAPRaaacraNASAAPRAAIAaasaphaaspAPRTA 178
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1982 SASTLDLSKDPGPPRPHRHEATPSMASLGP-EGEELARVAEGTgfpPQEPRCSAQVktaPTSSPAEPHCWPA-----EAA 2055
Cdd:pfam17823 179 ASSTTAASSTTAASSAPTTAASSAPATLTPaRGISTAATATGH---PAAGTALAAV---GNSSPAAGTVTAAvgtvtPAA 252
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2056 PGT-----GTEPTCSQEGKL-RPEPRREGEAQEAASETQPLS-SPPTAASSKAPSG--GSAQPPEGHPGKAEPSRAKSRP 2126
Cdd:pfam17823 253 LATlaaaaGTVASAAGTINMgDPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTL 332
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622841216 2127 LPNMPKLVIPS---------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2181
Cdd:pfam17823 333 EPNTPKSVASTnlavvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1917-2143 |
2.67e-08 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 59.12 E-value: 2.67e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1917 QASGDT-PTTPKHPKDSRENffPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASAstLDLSKDPGPP 1995
Cdd:PRK12323 367 QSGGGAgPATAAAAPVAQPA--PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEA--LAAARQASAR 442
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1996 RPHRHEATPSMASLGPEGEELARVAEGTGFPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCSQEGKLRPEPR 2075
Cdd:PRK12323 443 GPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWV 522
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622841216 2076 REGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRpLPNMPKLVIPSAATKFP 2143
Cdd:PRK12323 523 AESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG-LPDMFDGDWPALAARLP 589
|
|
| BepA |
COG4783 |
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ... |
33-188 |
5.37e-08 |
|
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443813 [Multi-domain] Cd Length: 139 Bit Score: 53.66 E-value: 5.37e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 33 EAEAFALYHKALDLQKHDRFEESAKAYHELLEArllreavsSGDEKEGlkhpglilkystYKNLAQLAAQREDLETAMEF 112
Cdd:COG4783 1 AACAEALYALAQALLLAGDYDEAEALLEKALEL--------DPDNPEA------------FALLGEILLQLGDLDEAIVL 60
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622841216 113 YLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG4783 61 LHEALELDPDEPEARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELD 136
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1941-2182 |
5.90e-08 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 58.32 E-value: 5.90e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1941 VAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASASTldlskdpgPPRPHRHEATPSMASLGPEGEELARVA 2020
Cdd:PRK07003 372 VPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA--------AAAATRAEAPPAAPAPPATADRGDDAA 443
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2021 EGTGFPPQEPRCSAQVKTAPTSSPAEPHCWPA-EAAPGTGTEPTCSQEgklrPEPRREGEAQEAASETQPLSSP------ 2093
Cdd:PRK07003 444 DGDAPVPAKANARASADSRCDERDAQPPADSGsASAPASDAPPDAAFE----PAPRAAAPSAATPAAVPDARAPaaasre 519
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2094 --PTAASSKAPSGGSAQPPEGHPgkaePSRA--------------------KSRPLPNMPKLVIPSAATKFPPEITVTPP 2151
Cdd:PRK07003 520 daPAAAAPPAPEARPPTPAAAAP----AARAggaaaaldvlrnagmrvssdRGARAAAAAKPAAAPAAAPKPAAPRVAVQ 595
|
250 260 270
....*....|....*....|....*....|.
gi 1622841216 2152 TPTLLSPKGSiseeTKQKLKSAILSAQSAAN 2182
Cdd:PRK07003 596 VPTPRARAAT----GDAPPNGAARAEQAAES 622
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1938-2194 |
7.39e-08 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 58.15 E-value: 7.39e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1938 PVTVAPTAPDPvPADSAQRPSDAhtKPRPALAAATTVITCPPSASASTLDLSKDPGPPRPHRHEATPSMASLGPEGEELA 2017
Cdd:PHA03378 598 PVPHPSQTPEP-PTTQSHIPETS--APRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPY 674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2018 RVAegtgfpPQEPRCSAQVKTAPTSS---PAEPHCWPAEAAPGTGTEPTCSQEGKLRPeprregeAQEAASETQPLSSPP 2094
Cdd:PHA03378 675 QPS------PTGANTMLPIQWAPGTMqppPRAPTPMRPPAAPPGRAQRPAAATGRARP-------PAAAPGRARPPAAAP 741
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2095 TAASSKAPSGGSAQPPEGHPGKAEPSRAK-SRPLPNMPKLVIPSA--------ATKFPPEItvtPPTPTLLSPKGSISEE 2165
Cdd:PHA03378 742 GRARPPAAAPGRARPPAAAPGRARPPAAApGAPTPQPPPQAPPAPqqrprgapTPQPPPQA---GPTSMQLMPRAAPGQQ 818
|
250 260
....*....|....*....|....*....
gi 1622841216 2166 TKQKLKSAILSAQSAANVRKESLCQPALE 2194
Cdd:PHA03378 819 GPTKQILRQLLTGGVKRGRPSLKKPAALE 847
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1919-2181 |
1.19e-07 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 57.23 E-value: 1.19e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1919 SGDTPTTPK-HPKDS-RENFFPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASAStldlskdpgpPR 1996
Cdd:pfam05109 483 SGASPVTPSpSPRDNgTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTT----------PT 552
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1997 PHRHEATPSMASLGPEGE--ELARVAEGTGFPPQEPRCsaqvkTAPTSSPAEPHCWPA-EAAPGTGTEPTCSQEGKLRPE 2073
Cdd:pfam05109 553 PNATSPTPAVTTPTPNATipTLGKTSPTSAVTTPTPNA-----TSPTVGETSPQANTTnHTLGGTSSTPVVTSPPKNATS 627
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2074 PRREGEAQEAASETQPLS----------SPPTA--ASSKAPSGGSAQPPEGH------PGKAEPSR-AKSRPLP---NMP 2131
Cdd:pfam05109 628 AVTTGQHNITSSSTSSMSlrpssisetlSPSTSdnSTSHMPLLTSAHPTGGEnitqvtPASTSTHHvSTSSPAPrpgTTS 707
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1622841216 2132 KLVIP--SAATKFPPEITVTPPTPtllsPKGSISEETKQKLKSAILSAQSAA 2181
Cdd:pfam05109 708 QASGPgnSSTSTKPGEVNVTKGTP----PKNATSPQAPSGQKTAVPTVTSTG 755
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1926-2154 |
2.02e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 56.87 E-value: 2.02e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1926 PKHPKDSREnffPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPP---------------------SASAS 1984
Cdd:PHA03247 2475 PGAPVYRRP---AEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVgepvhprmltwirgleelasdDAGDP 2551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1985 TLDLSKDPGPPRPHRHEATPSMASLGPEGEELARvAEGTGFPPQ--------EPRCSAQVKTAPTSSPAEPHCWP----- 2051
Cdd:PHA03247 2552 PPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSR-ARRPDAPPQsarprapvDDRGDPRGPAPPSPLPPDTHAPDpppps 2630
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2052 ----AEAAPGTGTEPTCSQE--------GKLRPePRREGEAQEAASETQPLSSP--PTAASSKAPSGGSAQPPEGHPGKA 2117
Cdd:PHA03247 2631 pspaANEPDPHPPPTVPPPErprddpapGRVSR-PRRARRLGRAAQASSPPQRPrrRAARPTVGSLTSLADPPPPPPTPE 2709
|
250 260 270
....*....|....*....|....*....|....*..
gi 1622841216 2118 EPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPT 2154
Cdd:PHA03247 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPA 2746
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1906-2131 |
7.68e-07 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 54.61 E-value: 7.68e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1906 CQVHLGAAAQRQASGDTPTTPKHPKDSRENffpvTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTviTCPPSASAST 1985
Cdd:PRK07764 582 WQVEAVVGPAPGAAGGEGPPAPASSGPPEE----AARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAP--GVAAPEHHPK 655
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1986 LDLSKDPGPPRPHRHEATPSMASLGPEGeelaRVAEGTGFPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCS 2065
Cdd:PRK07764 656 HVAVPDASDGGDGWPAKAGGAAPAAPPP----APAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAP 731
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622841216 2066 QEGKLRPEPRREGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRPLPNMP 2131
Cdd:PRK07764 732 SPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1913-2191 |
1.18e-06 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 53.81 E-value: 1.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1913 AAQRQASGDTPTTPKHPKDSRENFFPVTVaPTAP--------------DPVPADSAQRPSDAHTkPRPALAAATTVITCP 1978
Cdd:pfam17823 37 AGKQNASGDAVPRADNKSSEQ*NFCAATA-APAPvtltkgtsaahlnsTEVTAEHTPHGTDLSE-PATREGAADGAASRA 114
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1979 PSASASTldlskdpgppRPHRHEATPSMASLGPEGEELArvaegtgfPPQEPRCSAQVKTAPTSSPAephcwpAEAAPGT 2058
Cdd:pfam17823 115 LAAAASS----------SPSSAAQSLPAAIAALPSEAFS--------APRAAACRANASAAPRAAIA------AASAPHA 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2059 GTeptcsqegklrPEPRREGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRPLPNMPKlVIPSA 2138
Cdd:pfam17823 171 AS-----------PAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGN-SSPAA 238
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622841216 2139 ATKFPPEITVTP------------------------PTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQP 2191
Cdd:pfam17823 239 GTVTAAVGTVTPaalatlaaaagtvasaagtinmgdPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQP 315
|
|
| TadD |
COG5010 |
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ... |
38-188 |
1.24e-06 |
|
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444034 [Multi-domain] Cd Length: 155 Bit Score: 50.34 E-value: 1.24e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 38 ALYHKALDLQKHDRFEESAKAYHELLEARLLREAVSSGDEKEGLKHPGLILKYSTYKNLAQLAAQREDLETAMEFYLEAV 117
Cdd:COG5010 2 RALEGFDRLPLYLLLLTKLRTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLGDFEESLALLEQAL 81
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622841216 118 MLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG5010 82 QLDPNNPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTS 152
|
|
| PilF |
COG3063 |
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures]; |
99-188 |
4.27e-06 |
|
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
Pssm-ID: 442297 [Multi-domain] Cd Length: 94 Bit Score: 47.09 E-value: 4.27e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 99 LAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARhAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCL 178
Cdd:COG3063 1 LYLKLGDLEEAEEYYEKALELDPDNADALNNLGLLLLEQGRYDEAI-ALEKALKLDPNNAEALLNLAELLLELGDYDEAL 79
|
90
....*....|
gi 1622841216 179 YFICKALEKD 188
Cdd:COG3063 80 AYLERALELD 89
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1918-2168 |
5.01e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 51.99 E-value: 5.01e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1918 ASGDTPTTP--KHPKDSRENFFPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVitcPPSASAstldlskdPGPP 1995
Cdd:PHA03378 646 LVFPTPHQPpqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM---RPPAAP--------PGRA 714
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1996 RPHRHEATPSMASLG-PEGEELARVAEGTGFPPQ-EPRCSAQVKTAPTSSPaephcwPAEAAPGTgtePTCSQEGKLRPE 2073
Cdd:PHA03378 715 QRPAAATGRARPPAAaPGRARPPAAAPGRARPPAaAPGRARPPAAAPGRAR------PPAAAPGA---PTPQPPPQAPPA 785
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2074 PRRegEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRplpNMPKLVIPSAATKFPPEItvtpPTP 2153
Cdd:PHA03378 786 PQQ--RPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR---GRPSLKKPAALERQAAAG----PTP 856
|
250
....*....|....*
gi 1622841216 2154 tllSPKGSISEETKQ 2168
Cdd:PHA03378 857 ---SPGSGTSDKIVQ 868
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1890-2186 |
7.68e-06 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 50.92 E-value: 7.68e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1890 IKQVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvaPTAPDPvpadsAQRPSDAHTKPRPALA 1969
Cdd:NF033839 248 IDNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP-----GMQPSPQPEKKEVKPE 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1970 AATTVITCPPSASASTLDLSKDPGPPRPhrhEATPSMASLGPEGEELARvAEGTGFPPQEPRCSAQVKTAPTSSPAEPHC 2049
Cdd:NF033839 314 PETPKPEVKPQLEKPKPEVKPQPEKPKP---EVKPQLETPKPEVKPQPE-KPKPEVKPQPEKPKPEVKPQPETPKPEVKP 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2050 WPAEAAPGTGTEP-TCSQEGKLRPE---PRREGEAQEAASETQPLSSPPTAASSKAPSGGSAQ-PPEGHPGKAEPSRAKS 2124
Cdd:NF033839 390 QPEKPKPEVKPQPeKPKPEVKPQPEkpkPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEvKPQPETPKPEVKPQPE 469
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1622841216 2125 RPLPNM-PKLVIPSAATKFPPEITVTPPTPTLLSP------KGSISEETKQKLKSAILSAQSAANVRKE 2186
Cdd:NF033839 470 KPKPEVkPQPEKPKPDNSKPQADDKKPSTPNNLSKdkqpsnQASTNEKATNKPKKSLPSTGSISNLALE 538
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1911-2173 |
9.60e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 50.92 E-value: 9.60e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1911 GAAAQRQASGDTPTTPKHPKDSRENffpvtvaPTAPDPVPADSAQRPSdahTKPRPALAAATTVITCPPSASASTLDLSK 1990
Cdd:pfam03154 319 GQSQQRIHTPPSQSQLQSQQPPREQ-------PLPPAPLSMPHIKPPP---TTPIPQLPNPQSHKHPPHLSGPSPFQMNS 388
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1991 D-PGPP--RP-------HRHEATPSMASLGPEGEELARvaegtgfPPQEPRCSAQVKTAPTSSPAEPhcwpaeaaPGTGT 2060
Cdd:pfam03154 389 NlPPPPalKPlsslsthHPPSAHPPPLQLMPQSQQLPP-------PPAQPPVLTQSQSLPPPAASHP--------PTSGL 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2061 EPTCSQEgklrPEPRREGEAQEAASETQPlSSPPTAASskaPSGGSAQPPeghpgkAEPSRAKSRPLPNMPKLVIPSAAT 2140
Cdd:pfam03154 454 HQVPSQS----PFPQHPFVPGGPPPITPP-SGPPTSTS---SAMPGIQPP------SSASVSSSGPVPAAVSCPLPPVQI 519
|
250 260 270
....*....|....*....|....*....|....*...
gi 1622841216 2141 KFPP-----EITVTPPTPTLLSPKGSISEETKQKLKSA 2173
Cdd:pfam03154 520 KEEAldeaeEPESPPPPPRSPSPEPTVVNTPSHASQSA 557
|
|
| NlpI |
COG4785 |
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis]; |
29-174 |
1.93e-05 |
|
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 443815 [Multi-domain] Cd Length: 223 Bit Score: 47.99 E-value: 1.93e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 29 KEAQEAEAFALYHKALDLQKHDRFEE-------SAKAYHELLEARLLREAVSSGDEKEGLKHPGLIlkySTYKNLAQLAA 101
Cdd:COG4785 8 LLLALALAAAAASKAAILLAALLFAAvlalaiaLADLALALAAAALAAAALAAERIDRALALPDLA---QLYYERGVAYD 84
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622841216 102 QREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4785 85 SLGDYDLAIADFDQALELDPDLAEAYNNRGLAYLLLGDYDAALEDFDRALELDPDYAYAYLNRGIALYYLGRY 157
|
|
| NrfG |
COG4235 |
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ... |
95-174 |
2.07e-05 |
|
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443378 [Multi-domain] Cd Length: 131 Bit Score: 46.15 E-value: 2.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 95 NLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4235 22 LLGRAYLRLGRYDEALAAYEKALRLDPDNADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYLLGLAAFQQGDY 101
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1893-2088 |
3.05e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 49.10 E-value: 3.05e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1893 VDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPK----HPKDSRENFFPVTVAPTAPDPVPADS-AQRPSDAHTKPRPA 1967
Cdd:PRK12323 395 AAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPApealAAARQASARGPGGAPAPAPAPAAAPAaAARPAAAGPRPVAA 474
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1968 LAAATTVITCPPSASASTLD-------LSKDPGPPRPHRHEATPSMASLGPEGEELARVAEGTgFPPQEPRCSAQVKTAP 2040
Cdd:PRK12323 475 AAAAAPARAAPAAAPAPADDdpppweeLPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDA-FETLAPAPAAAPAPRA 553
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2041 T--SSPAEPHCWPAEAAPGTGTEPTCSQEGKLRPEPRReGEAQEAASETQ 2088
Cdd:PRK12323 554 AaaTEPVVAPRPPRASASGLPDMFDGDWPALAARLPVR-GLAQQLARQSE 602
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1912-2086 |
4.08e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 48.83 E-value: 4.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1912 AAAQRQASGDTPTTPKHPKDSRENFFPVTVAPTAPDPVPaDSAQRPSDAHTKPRPALAAATTVITCPPSASASTLDLSKD 1991
Cdd:PRK07764 622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP-DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1992 PGPPRPHRH------EATPSMASLGPEGEELARVAEGTGFPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCS 2065
Cdd:PRK07764 701 PAPAPAATPpagqadDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
|
170 180
....*....|....*....|....*..
gi 1622841216 2066 QEGKLRPEPRRE------GEAQEAASE 2086
Cdd:PRK07764 781 EEEEMAEDDAPSmddedrRDAEEVAME 807
|
|
| TadD |
COG5010 |
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ... |
1-155 |
4.50e-05 |
|
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444034 [Multi-domain] Cd Length: 155 Bit Score: 45.72 E-value: 4.50e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1 MIRIAALNASSTIEDDHEGSFKSHKTQTKEAQEAEAFALYHKALDLQKhdRFEESAKAYHELLEArllreavssgdekeg 80
Cdd:COG5010 21 RTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLG--DFEESLALLEQALQL--------------- 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622841216 81 lkHPGlilKYSTYKNLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNP 155
Cdd:COG5010 84 --DPN---NPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTSP 153
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1938-2145 |
4.75e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 48.72 E-value: 4.75e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1938 PVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAAttvitcPPSASASTLDLSKDPGPPRPHRhEATPSMASLGPEGEELA 2017
Cdd:PRK12323 375 ATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAA------APAAAAAARAVAAAPARRSPAP-EALAAARQASARGPGGA 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2018 RVAEGTGFP---PQEPRCSAQVKTAPTSSPAEP--HCWPAEAAPGTGTEPTCSQEGKLRPEPrrEGEAQEAASETQPLSS 2092
Cdd:PRK12323 448 PAPAPAPAAapaAAARPAAAGPRPVAAAAAAAParAAPAAAPAPADDDPPPWEELPPEFASP--APAQPDAAPAGWVAES 525
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1622841216 2093 PPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRPL--PNMPKLVIPSAATKFPPE 2145
Cdd:PRK12323 526 IPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVvaPRPPRASASGLPDMFDGD 580
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
2024-2172 |
6.81e-05 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 48.15 E-value: 6.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2024 GFPPQEPRCSAQVKTAP-TSSPAEPHCWPAEAA--PGTGTEPTCSQEGKL--RPEPRREGEAQEAASETQPLSSPPTAAS 2098
Cdd:PTZ00449 508 DEPPEGPEASGLPPKAPgDKEGEEGEHEDSKESdePKEGGKPGETKEGEVgkKPGPAKEHKPSKIPTLSKKPEFPKDPKH 587
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622841216 2099 SKAPsggsaQPPEGHPGKAEPSRAKSRPLPNMPKLV-IPSAATKfpPEITVTPPTPtlLSPKGSISEETKQKLKS 2172
Cdd:PTZ00449 588 PKDP-----EEPKKPKRPRSAQRPTRPKSPKLPELLdIPKSPKR--PESPKSPKRP--PPPQRPSSPERPEGPKI 653
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1912-2151 |
7.75e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 47.92 E-value: 7.75e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1912 AAAQRQASGDTPTTPKHPKDSRENFFPVTVAPTAPDPVP--ADSAQRPSDAH-TKPRPALAAATTVITCPPSASASTLDL 1988
Cdd:PRK07003 395 AVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPatADRGDDAADGDaPVPAKANARASADSRCDERDAQPPADS 474
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1989 SKDPGP----PRPHRHEATPSMASLGPEGEELARVAEGTGFPPQEPRCSAQVKTAPTSSPAEPhcwpAEAAPgtgteptc 2064
Cdd:PRK07003 475 GSASAPasdaPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTP----AAAAP-------- 542
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2065 sqegklrpePRREGEAQEAASETQPLSSPPTAASSKAPsGGSAQPPEGHPgkAEPSRAKSRPLPNMPKLVIPSAATKFPP 2144
Cdd:PRK07003 543 ---------AARAGGAAAALDVLRNAGMRVSSDRGARA-AAAAKPAAAPA--AAPKPAAPRVAVQVPTPRARAATGDAPP 610
|
....*..
gi 1622841216 2145 EITVTPP 2151
Cdd:PRK07003 611 NGAARAE 617
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1992-2110 |
1.00e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 47.67 E-value: 1.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1992 PGPPRPHRHEATPSMASLGPEGEelarvaegtgfPPQEPRCSAQVKTAPTSSPAEPHCwPAEAAPGTGTEPTCSQEGKLR 2071
Cdd:PRK07764 396 AAAPSAAAAAPAAAPAPAAAAPA-----------AAAAPAPAAAPQPAPAPAPAPAPP-SPAGNAPAGGAPSPPPAAAPS 463
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1622841216 2072 PEPRREGEA---QEAASETQPLSSPPTAASSKAPSGGSAQPP 2110
Cdd:PRK07764 464 AQPAPAPAAapePTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
2001-2158 |
1.10e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 47.56 E-value: 1.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2001 EATPSMASLGPEGEELARVAEGTGFPPqeprcsaqvktAPTSSPAEPHCWPAEAAPgtgTEPTCSQEGKLRPEPRREGEA 2080
Cdd:PRK12323 371 GAGPATAAAAPVAQPAPAAAAPAAAAP-----------APAAPPAAPAAAPAAAAA---ARAVAAAPARRSPAPEALAAA 436
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622841216 2081 QEAASETQPLSSPPTAASSKAPSGGSAQPPeghPGKAEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSP 2158
Cdd:PRK12323 437 RQASARGPGGAPAPAPAPAAAPAAAARPAA---AGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAP 511
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1940-2159 |
1.11e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.45 E-value: 1.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1940 TVAPTAPDP----VPADSAQRPSDAHTKPrPALAAAT-TVITCPPSASASTLDLSKDPgPPRPHRHEATPSMASLGPEGE 2014
Cdd:pfam03154 143 STSPSIPSPqdneSDSDSSAQQQILQTQP-PVLQAQSgAASPPSPPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQ 220
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2015 ELARVAEGTgFPPQEPRCSAQVKTAPTS--SPAEPHCWPAEAAPGTGTEPTCSQEGKLRPEPRREGEAQ-EAASETQPLS 2091
Cdd:pfam03154 221 TQSTAAPHT-LIQQTPTLHPQRLPSPHPplQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHmQHPVPPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2092 SPPTAASSKAPSGGSAQ------------PPEGHPGKAEPSRakSRPLPNMPkLVIPSAAtkfPPEITVTPPTPTLLSPK 2159
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAapgqsqqrihtpPSQSQLQSQQPPR--EQPLPPAP-LSMPHIK---PPPTTPIPQLPNPQSHK 373
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1931-2131 |
1.17e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 47.47 E-value: 1.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1931 DSRENFFPVTVA---PTAPDPVPADSAQRPSDAHTKPRPALAAA-----TTVITCPPSASASTLDLSKDPGPpRPHRHEA 2002
Cdd:PHA03307 15 AEGGEFFPRPPAtpgDAADDLLSGSQGQLVSDSAELAAVTVVAGaaacdRFEPPTGPPPGPGTEAPANESRS-TPTWSLS 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2003 TPSMASLGPEGEELARVAEGT-GFPPQEPRCSAQVKTAPTSSPA-----EPHCWPAEAAPGTGTEPTCSQEGKlrPEPRR 2076
Cdd:PHA03307 94 TLAPASPAREGSPTPPGPSSPdPPPPTPPPASPPPSPAPDLSEMlrpvgSPGPPPAASPPAAGASPAAVASDA--ASSRQ 171
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1622841216 2077 EGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKaePSRAKSRPLPNMP 2131
Cdd:PHA03307 172 AALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSS--PISASASSPAPAP 224
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
1940-2159 |
1.28e-04 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 47.13 E-value: 1.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1940 TVAPTAPDPVPADSAQRPSDAHTKPRP--ALAAATTVITCPPSASASTLDLSKDPGPPRPHRHEATPSMASLGPEGEELA 2017
Cdd:PRK14086 91 SAGEPAPPPPHARRTSEPELPRPGRRPyeGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQ 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2018 RVaegtGFPPQEPRCSaqvktaPTSSPAEPHCWPAEAAPGTGTEPTCSQEGKlrpEPRREGEAQEAASETQPLSSPptaa 2097
Cdd:PRK14086 171 RL----GFPPRAPYAS------PASYAPEQERDREPYDAGRPEYDQRRRDYD---HPRPDWDRPRRDRTDRPEPPP---- 233
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622841216 2098 sskapsgGSAQPPEGHPGkaePSRAKSRPLPNmpklVIPSAATKFP--PEITVTPPTPTL-LSPK 2159
Cdd:PRK14086 234 -------GAGHVHRGGPG---PPERDDAPVVP----IRPSAPGPLAaqPAPAPGPGEPTArLNPK 284
|
|
| PHA02682 |
PHA02682 |
ORF080 virion core protein; Provisional |
1938-2044 |
1.34e-04 |
|
ORF080 virion core protein; Provisional
Pssm-ID: 177464 [Multi-domain] Cd Length: 280 Bit Score: 46.01 E-value: 1.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1938 PVTVAPTAPDP-VPADSAQRPSDAHTKPRPALAA--------ATTVITCPPSASASTLDLSKDPGPPRPHRHEATPSMAS 2008
Cdd:PHA02682 76 PSGQSPLAPSPaCAAPAPACPACAPAAPAPAVTCpapapacpPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPT 155
|
90 100 110
....*....|....*....|....*....|....*..
gi 1622841216 2009 LGPEGEELARVAEGTGFPPQEPRCSA-QVKTAPTSSP 2044
Cdd:PHA02682 156 PKPAPAAKPIFLHNQLPPPDYPAASCpTIETAPAASP 192
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1912-2163 |
1.37e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.45 E-value: 1.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1912 AAAQRQASGDTPttpkhPKDSRENFFPVTVAPTAPDPVPADSAQRPSDAHTKPrPALAAATT---VITCPPSASASTLDL 1988
Cdd:pfam03154 160 SSAQQQILQTQP-----PVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVP-PQGSPATSqppNQTQSTAAPHTLIQQ 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1989 SKDPGPPR-PHRHEATPSMASLGPEGEELARvaegtgfPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCSQE 2067
Cdd:pfam03154 234 TPTLHPQRlPSPHPPLQPMTQPPPPSQVSPQ-------PLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQ 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2068 GKLRPEPRREGEAQEAASETQPlssPPTAASSkapsggSAQPPEGHPGKAEP-SRAKSRPLPNMPKLVIPSA-ATKFPPE 2145
Cdd:pfam03154 307 SQVPPGPSPAAPGQSQQRIHTP---PSQSQLQ------SQQPPREQPLPPAPlSMPHIKPPPTTPIPQLPNPqSHKHPPH 377
|
250 260
....*....|....*....|....*.
gi 1622841216 2146 ITVTP--------PTPTLLSPKGSIS 2163
Cdd:pfam03154 378 LSGPSpfqmnsnlPPPPALKPLSSLS 403
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1913-2143 |
1.66e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 1.66e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1913 AAQRQASGDTPT-TPKHPKDSR---ENFFPVTVA------PTAPDPVPADSAQRPSDAHTK----------PRP------ 1966
Cdd:PHA03247 270 ETARGATGPPPPpEAAAPNGAAappDGVWGAALAgaplalPAPPDPPPPAPAGDAEEEDDEdgamevvsplPRPrqhypl 349
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1967 ALAAATTVITCPPSasaSTLDLSKDPGPP----------RPHRHEATPSMAslGPEGEELARVAEGTGFPPQEPRCSAQV 2036
Cdd:PHA03247 350 GFPKRRRPTWTPPS---SLEDLSAGRHHPkraslptrkrRSARHAATPFAR--GPGGDDQTRPAAPVPASVPTPAPTPVP 424
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2037 KTAPtSSPAEPhcwPAEAAPGTGTEPTCSQEGKLRPEPRREGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGK 2116
Cdd:PHA03247 425 ASAP-PPPATP---LPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEPPGADLAELLGRHPDT 500
|
250 260 270
....*....|....*....|....*....|....
gi 1622841216 2117 -------AEPSRAKSRPLPNMPKLVIPSAATKFP 2143
Cdd:PHA03247 501 agtvvrlAAREAAIAREVAECSRLTINALRSPFP 534
|
|
| PHA03269 |
PHA03269 |
envelope glycoprotein C; Provisional |
1940-2096 |
1.76e-04 |
|
envelope glycoprotein C; Provisional
Pssm-ID: 165527 [Multi-domain] Cd Length: 566 Bit Score: 46.65 E-value: 1.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1940 TVAPTAPDPVPADSAQrpsdAHTKPRPALAAATTVITCPPSASASTLDLSKDPGP-PRPHrheatpSMASLGPEgeelar 2018
Cdd:PHA03269 34 SAATQKPDPAPAPHQA----ASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPaPAPH------QAASRAPD------ 97
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622841216 2019 vaegtgfppqePRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCSQEGKlrPEPrregeaqeaASETQPlSSPPTA 2096
Cdd:PHA03269 98 -----------PAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKK--PDP---------AAHTQH-SPPPFA 152
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1914-2119 |
1.86e-04 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 47.09 E-value: 1.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1914 AQRQASGDTPTTPKHPKDSRENFFPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASASTLDLSKDPg 1993
Cdd:PHA03307 232 AGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPS- 310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1994 PPRPHRHEATPSMASLgPEGEELARVAEGTGFPPQEPRCSAQVKTAPTSSPAEPHcwPAEAAPGTGTEPTCSQEGKlRPE 2073
Cdd:PHA03307 311 SPRASSSSSSSRESSS-SSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSS--PRKRPRPSRAPSSPAASAG-RPT 386
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1622841216 2074 PRReGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEP 2119
Cdd:PHA03307 387 RRR-ARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYP 431
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1912-2131 |
2.09e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 46.77 E-value: 2.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1912 AAAQRQASGDTPTTPKHPkdsrenffPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASAstldlskd 1991
Cdd:PRK07003 428 AAPAPPATADRGDDAADG--------DAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFE-------- 491
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1992 pgpPRPHRHEATPSMASLGPEGEELARVAEGTGFPPQEPRcsaqvktAPTSSPAEPHC-WPAEAAPGTGTEPTCSQEGKL 2070
Cdd:PRK07003 492 ---PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPP-------APEARPPTPAAaAPAARAGGAAAALDVLRNAGM 561
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622841216 2071 RPEPRREGEAQEAASETQPLSSPPTAASSKAP-----SGGSAQPPEGHPGKAEP------SRAKSRPLPNMP 2131
Cdd:PRK07003 562 RVSSDRGARAAAAAKPAAAPAAAPKPAAPRVAvqvptPRARAATGDAPPNGAARaeqaaeSRGAPPPWEDIP 633
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1913-2111 |
2.11e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.41 E-value: 2.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1913 AAQRQASGDTPTTPKHPKDSRENFFPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASASTldlskdP 1992
Cdd:PRK12323 393 AAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPA------A 466
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1993 GPPRPHRHEATPSMASLGPEGeelARVAEGTGFPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCSQEGKLRP 2072
Cdd:PRK12323 467 AGPRPVAAAAAAAPARAAPAA---APAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAP 543
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 1622841216 2073 EP-RREGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPE 2111
Cdd:PRK12323 544 APaAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPA 583
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1939-2076 |
2.47e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 46.52 E-value: 2.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1939 VTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASAstldlskdPGPPRPHRHEATPSMASLGPEGEELAR 2018
Cdd:PRK07764 385 LGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAP--------QPAPAPAPAPAPPSPAGNAPAGGAPSP 456
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 1622841216 2019 VAEGTGFPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCSQEGKLRPEPRR 2076
Cdd:PRK07764 457 PPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRE 514
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
1987-2131 |
3.70e-04 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 45.14 E-value: 3.70e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1987 DLSkDPGPPRPHRHEATPSMASLGPEGEELARVAEGTGFPPQEPRCSAQVKTAPTSSPAEPHC------WPAEAAPGTGT 2060
Cdd:NF040712 186 WLI-DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGA 264
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1622841216 2061 EPTCSQEGKLRPEPRRegEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSR-PLPNMP 2131
Cdd:NF040712 265 APAAEPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
|
|
| PilF |
COG3063 |
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures]; |
92-158 |
6.12e-04 |
|
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
Pssm-ID: 442297 [Multi-domain] Cd Length: 94 Bit Score: 40.92 E-value: 6.12e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622841216 92 TYKNLAQLAAQREDLETAMEFyLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHW 158
Cdd:COG3063 28 ALNNLGLLLLEQGRYDEAIAL-EKALKLDPNNAEALLNLAELLLELGDYDEALAYLERALELDPSAL 93
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
1936-2063 |
8.43e-04 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 44.32 E-value: 8.43e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1936 FFPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASASTLDLSKDPGPPRPHRHEATPSMASLGPEGEE 2015
Cdd:PRK14951 364 FKPAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA 443
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 1622841216 2016 LARVAEGTGFPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPT 2063
Cdd:PRK14951 444 AVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPT 491
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
1912-2140 |
9.60e-04 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 44.29 E-value: 9.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1912 AAAQRQASGDTPTTPKHPKDSRenffPVTVAPTAPDPVPADSaqrpsdahtkPRPALAAATTvitcpPSASASTLDLSKd 1991
Cdd:pfam03546 250 TPAQAKPALKTPQTKASPRKGT----PITPTSAKVPPVRVGT----------PAPWKAGTVT-----SPACASSPAVAR- 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1992 pGPPRPhrhEATPSMASLGPEGEELARVAEGTGFPPQEPRCSAQVKTAPTSSPAEPHCWPA---EAAPGTGTEPTCSQEG 2068
Cdd:pfam03546 310 -GAQRP---EEDSSSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVppgKTGPAVAQVKAEAQED 385
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622841216 2069 K--LRPEPRREGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRPLPnmPKLVIPSAAT 2140
Cdd:pfam03546 386 SesSEEESDSEEAAATPAQVKASGKTPQAKANPAPTKASSAKGAASAPGKVVAAAAQAKQGS--PAKVKPPART 457
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
1978-2191 |
1.37e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 43.76 E-value: 1.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1978 PPSASASTlDLSKdPGPPRPHRHEATPSMASLGPEGEELAR---VAEGTGF----PPQEPrcsaqVKTAPTSSPAEPHCW 2050
Cdd:PLN03209 329 PPKESDAA-DGPK-PVPTKPVTPEAPSPPIEEEPPQPKAVVprpLSPYTAYedlkPPTSP-----IPTPPSSSPASSKSV 401
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2051 PAEAAPGTGTEPTCSQEGKLRPEPRregEAQEAASETQPLS------------SP-PTAASSKAPSGGSAQPPEGHPGKA 2117
Cdd:PLN03209 402 DAVAKPAEPDVVPSPGSASNVPEVE---PAQVEAKKTRPLSpyaryedlkpptSPsPTAPTGVSPSVSSTSSVPAVPDTA 478
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622841216 2118 EPSRAKSRPLPNMPKlviPSAATKFPPEITVTPPT-PTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQP 2191
Cdd:PLN03209 479 PATAATDAAAPPPAN---MRPLSPYAVYDDLKPPTsPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQP 550
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
1998-2167 |
1.81e-03 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 43.01 E-value: 1.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1998 HRHEATPSMASLGPEGEELARVAEGTGFPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCSQEGKLRPEPRRE 2077
Cdd:PTZ00436 169 HRHKARKQELRKREKDRERARREDAAAAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAA 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2078 GEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRPLPnmPKLVIPSAATKFPPEITVTPPTPTLLS 2157
Cdd:PTZ00436 249 APAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAAP--AKAAAAPAKAAAAPAKAAAPPAKAAAP 326
|
170
....*....|
gi 1622841216 2158 PKGSISEETK 2167
Cdd:PTZ00436 327 PAKAATPPAK 336
|
|
| PHA03264 |
PHA03264 |
envelope glycoprotein D; Provisional |
2051-2157 |
1.85e-03 |
|
envelope glycoprotein D; Provisional
Pssm-ID: 223029 [Multi-domain] Cd Length: 416 Bit Score: 43.07 E-value: 1.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2051 PAEAAPGTGTEPTCSQE-GKLRPEPRREGEAQEAASETQPlsspptAASSKAPSGGSAQPPEGHPGKAEPSRAKSRPLPN 2129
Cdd:PHA03264 255 PPYFEESKGYEPPPAPSgGSPAPPGDDRPEAKPEPGPVED------GAPGRETGGEGEGPEPAGRDGAAGGEPKPGPPRP 328
|
90 100 110
....*....|....*....|....*....|...
gi 1622841216 2130 MPKLVIPSA-----ATKFPPEITVTPPTPTLLS 2157
Cdd:PHA03264 329 APDADRPEGwpsleAITFPPPTPATPAVPRARP 361
|
|
| TPR_12 |
pfam13424 |
Tetratricopeptide repeat; |
36-119 |
1.92e-03 |
|
Tetratricopeptide repeat;
Pssm-ID: 315987 [Multi-domain] Cd Length: 77 Bit Score: 38.91 E-value: 1.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 36 AFALYHKALDLQKHDRFEESAKAYHELLEarlLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424 3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70
|
....
gi 1622841216 116 AVML 119
Cdd:pfam13424 71 ALAL 74
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
2065-2173 |
2.38e-03 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 42.63 E-value: 2.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2065 SQEGKLRPEPRREGEAQEAASETQPLSSPPTAASS------KAPSGGSAQPPEGHPGKAEPSRAKSRPLPNM-PKLVIPS 2137
Cdd:PHA03291 167 PAEGTLAAPPLGEGSADGSCDPALPLSAPRLGPADvfvpatPRPTPRTTASPETTPTPSTTTSPPSTTIPAPsTTIAAPQ 246
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 1622841216 2138 AATKFPPEITVTPPTP----TLLSPKGSISEETKQKLKSA 2173
Cdd:PHA03291 247 AGTTPEAEGTPAPPTPgggeAPPANATPAPEASRYELTVT 286
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
1911-2153 |
3.55e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 42.36 E-value: 3.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1911 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASASTL 1986
Cdd:COG5180 152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLTGGADHPRPEAASS 231
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1987 DLSKDPGPPRPHRHEAT----PSMASLGP--EGEELARVAEGTGFPPQEPRCSAQ---VKTAPTSSPAEPHCWPAEAAPG 2057
Cdd:COG5180 232 PKVDPPSTSEARSRPATvdaqPEMRPPADakERRRAAIGDTPAAEPPGLPVLEAGsepQSDAPEAETARPIDVKGVASAP 311
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2058 TGTEPTCSQEGKLRPEPRREGEAQEaasetQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSRAKS------RPLPNMP 2131
Cdd:COG5180 312 PATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDGAPFQP 386
|
250 260
....*....|....*....|..
gi 1622841216 2132 KLVIPSAATKFPPeiTVTPPTP 2153
Cdd:COG5180 387 PNGAPQPGLGRRG--APGPPMG 406
|
|
| TPR_21 |
pfam09976 |
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat. |
48-151 |
3.59e-03 |
|
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.
Pssm-ID: 430959 [Multi-domain] Cd Length: 194 Bit Score: 41.03 E-value: 3.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 48 KHDRFEESAKAYHELLEArllreaVSSGDEKEGL--------KHPGlilkySTYKNLAQL-----AAQREDLETAMEfYL 114
Cdd:pfam09976 32 QRSQAEEASALYQQLLEA------VAAGDAAKAQaaaaqlkdEYGG-----TGYAALAALllakaAVEAGDLAAAKA-QL 99
|
90 100 110
....*....|....*....|....*....|....*...
gi 1622841216 115 EAVMLDSTDVNLwykiGHVA-LRLIRIPLARHAFEEGL 151
Cdd:pfam09976 100 EWVADNAKDEAL----KALArLRLARVLLAQGKYDEAL 133
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
34-174 |
3.62e-03 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 41.64 E-value: 3.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 34 AEAFALYHKALDLQKHDRFEESAKAYHELLEarllreavssgdekeglKHPGLIlkySTYKNLAQLAAQREDLETAMEFY 113
Cdd:COG2956 6 AAALGWYFKGLNYLLNGQPDKAIDLLEEALE-----------------LDPETV---EAHLALGNLYRRRGEYDRAIRIH 65
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622841216 114 LEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG2956 66 QKLLERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDPDDAEALRLLAEIYEQEGDW 126
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
1896-2182 |
4.08e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 42.29 E-value: 4.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1896 EAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRENFFPVTV--------APTAPDPVPADSAQRPSDAHTKPRPA 1967
Cdd:PHA03369 345 NEILKTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIpysvparsPMTAYPPVPQFCGDPGLVSPYNPQSP 424
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1968 LAAATT--VITCPPSASAS-----TLDLSKDPGPPRPHRHEATPS---------------MASLGPEGEELARVAEGT-- 2023
Cdd:PHA03369 425 GTSYGPepVGPVPPQPTNPyvmpiSMANMVYPGHPQEHGHERKRKrggelkeelietlklVKKLKEEQESLAKELEATah 504
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2024 -GFPPQEprCSAQVKTAPTSSPA---EPHCWPAEAAPGTGTeptcsqegkLRPEPRREGEA-QEAASETQPLSSP----- 2093
Cdd:PHA03369 505 kSEIKKI--AESEFKNAGAKTAAaniEPNCSADAAAPATKR---------ARPETKTELEAvVRFPYQIRNMESPafvhs 573
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2094 PTAASSKAPSGGSAQPPEGHPGKAEPSRAKSRPLPNMPKLVIPSAAtkfppeitVTPPTPTLLSPKGSISEETKQKLKSA 2173
Cdd:PHA03369 574 FTSTTLAAAAGQGSDTAEALAGAIETLLTQASAQPAGLSLPAPAVP--------VNASTPASTPPPLAPQEPPQPGTSAP 645
|
....*....
gi 1622841216 2174 ILSAQSAAN 2182
Cdd:PHA03369 646 SLETSLPQQ 654
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
2000-2192 |
5.09e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 41.91 E-value: 5.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2000 HEATPSMASLGPEGEELARVAEGT-GFPPQEPRCSA------QVKTAPTSSPAEPHCWPAEAAPgtgtePTCSQEGKLRP 2072
Cdd:PHA03369 344 HNEILKTASLTAPSRVLAAAAKVAvIAAPQTHTGPAdrqrpqRPDGIPYSVPARSPMTAYPPVP-----QFCGDPGLVSP 418
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2073 -EPRREGEAQEAASETQPLSSPPTaaSSKAPSGGSAQPPEGHPGKAEPSRAKSRPLP-NMPKLVIPSAATKFPPEITVTP 2150
Cdd:PHA03369 419 yNPQSPGTSYGPEPVGPVPPQPTN--PYVMPISMANMVYPGHPQEHGHERKRKRGGElKEELIETLKLVKKLKEEQESLA 496
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1622841216 2151 PTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPA 2192
Cdd:PHA03369 497 KELEATAHKSEIKKIAESEFKNAGAKTAAANIEPNCSADAAA 538
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
1936-2012 |
5.99e-03 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 41.48 E-value: 5.99e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622841216 1936 FFPVTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTviTCPPSASASTLDLSKDPGPPRPHRHEATPSMASLGPE 2012
Cdd:PHA03291 203 FVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPST--TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPE 277
|
|
| PRK12495 |
PRK12495 |
hypothetical protein; Provisional |
2026-2107 |
6.23e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 183558 [Multi-domain] Cd Length: 226 Bit Score: 40.62 E-value: 6.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2026 PPQEPRCSAQVKTAPTSSPAEPHcwPAEAAPGTGTEPTCSQEGKLRPEPRREGEAQEAASETQPLSSPPTAASSKAPSGG 2105
Cdd:PRK12495 96 PDDDAQPAAEAEAADQSAPPEAS--STSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTP 173
|
..
gi 1622841216 2106 SA 2107
Cdd:PRK12495 174 DA 175
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1967-2154 |
6.73e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 41.70 E-value: 6.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1967 ALAAATTVITCPPSASASTLD--LSKDPG--PPRPHRHEATPSMASLGPegeelARVAEGTGFPPQEPR-CSAQVKTAPT 2041
Cdd:PHA03307 13 AAAEGGEFFPRPPATPGDAADdlLSGSQGqlVSDSAELAAVTVVAGAAA-----CDRFEPPTGPPPGPGtEAPANESRST 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2042 SSPAEPhcWPAEAAPGTGTEPTCSQEGKLRPEPRREGEAQEAASETQPLSSPPTAASSKAPSGGSAQPPEGHPGKAEPSR 2121
Cdd:PHA03307 88 PTWSLS--TLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASD 165
|
170 180 190
....*....|....*....|....*....|...
gi 1622841216 2122 AKSRPLPNMPkLVIPSAATKFPPEITVTPPTPT 2154
Cdd:PHA03307 166 AASSRQAALP-LSSPEETARAPSSPPAEPPPST 197
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1895-2128 |
7.43e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 41.51 E-value: 7.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1895 EEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRENFFPVTVAPTAPDPVPAD-SAQRPSDAHTKPRPALAAATT 1973
Cdd:PRK07764 442 PSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAApAAPAGADDAATLRERWPEILA 521
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1974 VItcpPSASASTLDLSKD---PGPPRPHR----HEATPSMASLG-------------------------PEGEELARVAE 2021
Cdd:PRK07764 522 AV---PKRSRKTWAILLPeatVLGVRGDTlvlgFSTGGLARRFAspgnaevlvtalaeelggdwqveavVGPAPGAAGGE 598
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2022 GTGFPPQEPRCSAQVKTAPTSSPAEPHCWPAEAAPGTGTEPTCSQEGKLRPEPRREGEAQEAASETQPLSSPPTAASSKA 2101
Cdd:PRK07764 599 GPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAP 678
|
250 260
....*....|....*....|....*..
gi 1622841216 2102 PSGGSAQPPEGHPGKAEPSRAKSRPLP 2128
Cdd:PRK07764 679 AAPPPAPAPAAPAAPAGAAPAQPAPAP 705
|
|
| PTZ00436 |
PTZ00436 |
60S ribosomal protein L19-like protein; Provisional |
1911-2078 |
7.71e-03 |
|
60S ribosomal protein L19-like protein; Provisional
Pssm-ID: 185616 [Multi-domain] Cd Length: 357 Bit Score: 41.09 E-value: 7.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1911 GAAAQRQASGDTPTTPKHPKDSRenffpvTVAPTAPDPVPADSAQRPSDAHTKPRPALAAATTVITCPPSASASTldlSK 1990
Cdd:PTZ00436 196 AAAAKQKAAAKKAAAPSGKKSAK------AAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAPP---AK 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1991 DPGPPrphRHEATPSMASLGPEGEELARVAEGTGFPPQEPRCSAQVKTAP---TSSPAEPHCWPAEAAPGTGTEPTCSQE 2067
Cdd:PTZ00436 267 AAAPP---AKAAAPPAKAAAPPAKAAAPPAKAAAAPAKAAAAPAKAAAAPakaAAPPAKAAAPPAKAATPPAKAAAPPAK 343
|
170
....*....|.
gi 1622841216 2068 GKLRPEPRREG 2078
Cdd:PTZ00436 344 AAAAPVGKKAG 354
|
|
| PRK14951 |
PRK14951 |
DNA polymerase III subunits gamma and tau; Provisional |
1911-2053 |
8.59e-03 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237865 [Multi-domain] Cd Length: 618 Bit Score: 41.24 E-value: 8.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1911 GAAAQRQASGDTPTTPKH--PKDSREnffPVTVAPTAPDPVP--ADSAQRPSDAHTKPRPALAAATTVITCPPSASASTL 1986
Cdd:PRK14951 369 AAEAAAPAEKKTPARPEAaaPAAAPV---AQAAAAPAPAAAPaaAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAV 445
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622841216 1987 DLSKDPGPPRPHRHEATPsmaslgpegeelARVAEGTGFPPQEPrcsaqvktAPTSSPAEPHCWPAE 2053
Cdd:PRK14951 446 ALAPAPPAQAAPETVAIP------------VRVAPEPAVASAAP--------APAAAPAAARLTPTE 492
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
1965-2131 |
9.06e-03 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 40.71 E-value: 9.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 1965 RPALAAA---TTVITCPPSASASTLDLSKDP-----GPPRPHRH--------EATPSmASLGPEGeeLARV-AEGTGFPP 2027
Cdd:PHA03291 99 RPAVAFTlcrSTRRTQSPAYATLTLDLARQPllrarGAARAVVGlyvlrvwvEGATN-ASLFPLG--LAAFpAEGTLAAP 175
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622841216 2028 QEPRCSAQVKTAPTSSPAEPHCWPAEA-APGTGTEPTCSqegklrPEPRREGEAQEAASETQPLSSPPTAASSKAPSGGS 2106
Cdd:PHA03291 176 PLGEGSADGSCDPALPLSAPRLGPADVfVPATPRPTPRT------TASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGT 249
|
170 180
....*....|....*....|....*
gi 1622841216 2107 AQPPEGHPGKAEPSRAKSRPLPNMP 2131
Cdd:PHA03291 250 TPEAEGTPAPPTPGGGEAPPANATP 274
|
|
|