|
Name |
Accession |
Description |
Interval |
E-value |
| MEF2_binding |
pfam09047 |
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ... |
2156-2190 |
1.20e-15 |
|
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.
Pssm-ID: 370261 [Multi-domain] Cd Length: 35 Bit Score: 72.19 E-value: 1.20e-15
10 20 30
....*....|....*....|....*....|....*
gi 313151181 2156 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2190
Cdd:pfam09047 1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
|
|
| MEF2_binding |
cd13839 |
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ... |
2156-2190 |
6.83e-14 |
|
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.
Pssm-ID: 260103 [Multi-domain] Cd Length: 35 Bit Score: 67.41 E-value: 6.83e-14
10 20 30
....*....|....*....|....*....|....*
gi 313151181 2156 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2190
Cdd:cd13839 1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
30-205 |
3.26e-11 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 65.80 E-value: 3.26e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457 2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457 62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
|
170
....*....|....*.
gi 313151181 190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457 142 DDADALYNLGIALEKL 157
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1914-2157 |
4.72e-11 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 68.81 E-value: 4.72e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1986
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1987 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2063
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2064 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2142
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
|
250
....*....|....*
gi 313151181 2143 kfPPEITVTPPTPTL 2157
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1914-2183 |
3.60e-10 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 64.98 E-value: 3.60e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1989
Cdd:pfam17823 120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1990 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2067
Cdd:pfam17823 199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2068 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2139
Cdd:pfam17823 266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 313151181 2140 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2183
Cdd:pfam17823 346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1892-2155 |
7.63e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.84 E-value: 7.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1892 IKQVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvpadsvqrpsdaHTKPRPALA 1971
Cdd:NF033839 248 IDNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP------------GMQPSPQPE 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1972 AAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEPRHSPQVKMAPTSSPAEPHC 2051
Cdd:NF033839 307 KK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLETPKPEVKPQPEKPKPEVKP 367
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2052 WPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP---PEGHPGKPE--PSRAKS 2126
Cdd:NF033839 368 QPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPevkPQPEKPKPEvkPQPEKP 438
|
250 260 270
....*....|....*....|....*....|...
gi 313151181 2127 ----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2155
Cdd:NF033839 439 kpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
1993-2133 |
1.69e-04 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 46.30 E-value: 1.69e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1993 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2066
Cdd:NF040712 189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2067 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2133
Cdd:NF040712 269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
1913-2155 |
1.83e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 43.51 E-value: 1.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1913 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1988
Cdd:COG5180 152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1989 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2054
Cdd:COG5180 227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2055 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2128
Cdd:COG5180 307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
|
250 260
....*....|....*....|....*..
gi 313151181 2129 LPNMPKLVIPSAATKFPPeiTVTPPTP 2155
Cdd:COG5180 382 APFQPPNGAPQPGLGRRG--APGPPMG 406
|
|
| TPR_12 |
pfam13424 |
Tetratricopeptide repeat; |
36-119 |
1.98e-03 |
|
Tetratricopeptide repeat;
Pssm-ID: 315987 [Multi-domain] Cd Length: 77 Bit Score: 38.91 E-value: 1.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424 3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70
|
....
gi 313151181 116 AVML 119
Cdd:pfam13424 71 ALAL 74
|
|
| sucB |
TIGR01347 |
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ... |
2015-2125 |
1.98e-03 |
|
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]
Pssm-ID: 273565 [Multi-domain] Cd Length: 403 Bit Score: 42.80 E-value: 1.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2015 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2094
Cdd:TIGR01347 68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
|
90 100 110
....*....|....*....|....*....|.
gi 313151181 2095 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2125
Cdd:TIGR01347 143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
|
|
| KLF9_13_N-like |
cd21975 |
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ... |
2000-2143 |
5.35e-03 |
|
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.
Pssm-ID: 409240 [Multi-domain] Cd Length: 163 Bit Score: 40.06 E-value: 5.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2000 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2079
Cdd:cd21975 19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 313151181 2080 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2143
Cdd:cd21975 99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MEF2_binding |
pfam09047 |
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ... |
2156-2190 |
1.20e-15 |
|
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.
Pssm-ID: 370261 [Multi-domain] Cd Length: 35 Bit Score: 72.19 E-value: 1.20e-15
10 20 30
....*....|....*....|....*....|....*
gi 313151181 2156 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2190
Cdd:pfam09047 1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
|
|
| MEF2_binding |
cd13839 |
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ... |
2156-2190 |
6.83e-14 |
|
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.
Pssm-ID: 260103 [Multi-domain] Cd Length: 35 Bit Score: 67.41 E-value: 6.83e-14
10 20 30
....*....|....*....|....*....|....*
gi 313151181 2156 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2190
Cdd:cd13839 1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
30-205 |
3.26e-11 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 65.80 E-value: 3.26e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457 2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457 62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
|
170
....*....|....*.
gi 313151181 190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457 142 DDADALYNLGIALEKL 157
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1914-2157 |
4.72e-11 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 68.81 E-value: 4.72e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1986
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1987 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2063
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2064 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2142
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
|
250
....*....|....*
gi 313151181 2143 kfPPEITVTPPTPTL 2157
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1912-2162 |
1.25e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 67.27 E-value: 1.25e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1912 LGAAAQRQASGDTPTTPKHPKDSRenffpVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQS 1991
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPA-----GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1992 KdPGPPRPhrpeATPSMASLGPEGEELARVAEGTSFPPqePRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTcsqegk 2071
Cdd:PHA03247 2798 L-PSPWDP----ADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSL-PLGGSVAPGGDVR------ 2863
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2072 lRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSgSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVT 2151
Cdd:PHA03247 2864 -RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTES-FALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
|
250
....*....|.
gi 313151181 2152 PPTPTLLSPKG 2162
Cdd:PHA03247 2941 PPLAPTTDPAG 2951
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1925-2195 |
1.36e-10 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 67.27 E-value: 1.36e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1925 PTTPKHPKDSRENFFPVTVVPTAPDPVPADS-VQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPE 2003
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2004 ATPSMASLGPEGEELARVAEGTSFPPqeprhsPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqeGKLRPEPRRDGEAQ 2083
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPA------LPAAPAPPAVPAGPATPGGPARPARPPTTA----GPPAPAPPAAPAAG 2778
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2084 EAASETQPLSSPPTAASSKAPS-SGSAQPPEGHPGKPEPSRAKSRPLPNMPKlviPSAATKFPPEITVTPPTPTL----- 2157
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSpWDPADPPAAVLAPAAALPPAASPAGPLPP---PTSAQPTAPPPPPGPPPPSLplggs 2855
|
250 260 270
....*....|....*....|....*....|....*...
gi 313151181 2158 LSPKGSISEETKQKLKSAILSAQSAANVRkeSLCQPAL 2195
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVR--RLARPAV 2891
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1920-2186 |
1.89e-10 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 66.73 E-value: 1.89e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1920 ASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATT--IITCPPSASA----STLDQSKD 1993
Cdd:PHA03307 124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPeeTARAPSSPPAepppSTPPAAAS 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRPEATPSM--ASLGPEGEELARVAEGTSFPPQEPRHSP--QVKMAPTSSPAEPHC--WPAEAALGTGAEPTCS 2067
Cdd:PHA03307 204 PRPPRRSSPISASASspAPAPGRSAADDAGASSSDSSSSESSGCGwgPENECPLPRPAPITLptRIWEASGWNGPSSRPG 283
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2068 QEGKLRPEPRRDGEAQEAASETQPLSSPPT----------AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNmpklvi 2137
Cdd:PHA03307 284 PASSSSSPRERSPSPSPSSPGSGPAPSSPRasssssssreSSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP------ 357
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 313151181 2138 PSAATKFPPE-ITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVR 2186
Cdd:PHA03307 358 PPADPSSPRKrPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRF 407
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
1914-2183 |
3.60e-10 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 64.98 E-value: 3.60e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1989
Cdd:pfam17823 120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1990 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2067
Cdd:pfam17823 199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2068 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2139
Cdd:pfam17823 266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 313151181 2140 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2183
Cdd:pfam17823 346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
34-199 |
5.85e-10 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 61.95 E-value: 5.85e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 34 AEAFALYHKALDLQkhdrfEESAKAYHELleASLLREAvssGDEKEGLKH--------PGLIlkySTYKNLAQLAAQRED 105
Cdd:COG0457 25 EEAIEDYEKALELD-----PDDAEALYNL--GLAYLRL---GRYEEALADyeqaleldPDDA---EALNNLGLALQALGR 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 106 LETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKAL 185
Cdd:COG0457 92 YEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEKLGRYEEALELLEKLE 171
|
170
....*....|....
gi 313151181 186 EKDCRYSKGLVLKE 199
Cdd:COG0457 172 AAALAALLAAALGE 185
|
|
| Spy |
COG3914 |
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ... |
30-188 |
2.25e-09 |
|
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443119 [Multi-domain] Cd Length: 658 Bit Score: 62.70 E-value: 2.25e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAsllreavssgdekeglkHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG3914 72 AALLLLAALLELAALLLQALGRYEEALALYRRALAL-----------------NPDN---AEALFNLGNLLLALGRLEEA 131
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 313151181 110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG3914 132 LAALRRALALNPDFAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNALQDLGRLEEAIAAYRRALELD 210
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1921-2194 |
6.41e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.88 E-value: 6.41e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1921 SGDTPttPKHPKDSRENFFPVTVVPTAPDPVPADSVQRpSDAHTKPRPALAAATTIITCPPSASASTLDQSkdPGPPRPH 2000
Cdd:PHA03247 2548 AGDPP--PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVT-SRARRPDAPPQSARPRAPVDDRGDPRGPAPPS--PLPPDTH 2622
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2001 RPEATPSMASlgPEGEELARVAEGTSFPPQEPRHSPQVK-----------------MAPTSSPAEPHCWPAEAALGTGAE 2063
Cdd:PHA03247 2623 APDPPPPSPS--PAANEPDPHPPPTVPPPERPRDDPAPGrvsrprrarrlgraaqaSSPPQRPRRRAARPTVGSLTSLAD 2700
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2064 PTcsqegklrPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKP----EPSRAKSRPLPNMPklviPS 2139
Cdd:PHA03247 2701 PP--------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPatpgGPARPARPPTTAGP----PA 2768
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 313151181 2140 AAtkfPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPA 2194
Cdd:PHA03247 2769 PA---PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
30-205 |
1.02e-08 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 58.59 E-value: 1.02e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAS---------LLREAVSSGDEKEGLKHPGLILKYS-----TYKN 95
Cdd:COG2956 70 ERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDpddaealrlLAEIYEQEGDWEKAIEVLERLLKLGpenahAYCE 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 96 LAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYT 175
Cdd:COG2956 150 LAELYLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPE 229
|
170 180 190
....*....|....*....|....*....|
gi 313151181 176 TCLYFICKALEKDCRYSKGLVLKEKIFEEQ 205
Cdd:COG2956 230 EALELLRKALELDPSDDLLLALADLLERKE 259
|
|
| BepA |
COG4783 |
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ... |
33-188 |
6.39e-08 |
|
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443813 [Multi-domain] Cd Length: 139 Bit Score: 53.66 E-value: 6.39e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 33 EAEAFALYHKALDLQKHDRFEESAKAYHELLEASllreavssGDEKEGlkhpglilkystYKNLAQLAAQREDLETAMEF 112
Cdd:COG4783 1 AACAEALYALAQALLLAGDYDEAEALLEKALELD--------PDNPEA------------FALLGEILLQLGDLDEAIVL 60
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 313151181 113 YLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG4783 61 LHEALELDPDEPEARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELD 136
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1921-2183 |
1.22e-07 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 57.23 E-value: 1.22e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1921 SGDTPTTPK-HPKDS-RENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPP- 1997
Cdd:pfam05109 483 SGASPVTPSpSPRDNgTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAv 562
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1998 RPHRPEAT-PSMASLGPEGEELARVAEGTSfpPQEPRHSPQVK-----MAPTSSPAEPHCWP--AEAALGTGAEPTCSQE 2069
Cdd:pfam05109 563 TTPTPNATiPTLGKTSPTSAVTTPTPNATS--PTVGETSPQANttnhtLGGTSSTPVVTSPPknATSAVTTGQHNITSSS 640
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2070 G---KLRPEPRRDG---EAQEAASETQPL--SSPPTAA---SSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMpklviP 2138
Cdd:pfam05109 641 TssmSLRPSSISETlspSTSDNSTSHMPLltSAHPTGGeniTQVTPASTSTHHVSTSSPAPRPGTTSQASGPGN-----S 715
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 313151181 2139 SAATKfPPEITVTPPTPtllsPKGSISEETKQKLKSAILSAQSAA 2183
Cdd:pfam05109 716 STSTK-PGEVNVTKGTP----PKNATSPQAPSGQKTAVPTVTSTG 755
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1945-2184 |
2.56e-07 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 56.01 E-value: 2.56e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1945 PTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTldqskdpgPPRPHRPEATPSMASLGPEGEELARVAEG 2024
Cdd:PRK07003 374 ARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA--------AAAATRAEAPPAAPAPPATADRGDDAADG 445
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2025 TSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG-AEPTCSQEgklrPEPRRDGEAQEAASETQPLSSPPTAASSKA 2103
Cdd:PRK07003 446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASdAPPDAAFE----PAPRAAAPSAATPAAVPDARAPAAASREDA 521
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2104 PSSGSAQPPEGHPGKP----EPSRA--------------------KSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLS 2159
Cdd:PRK07003 522 PAAAAPPAPEARPPTPaaaaPAARAggaaaaldvlrnagmrvssdRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRA 601
|
250 260
....*....|....*....|....*
gi 313151181 2160 PKGSiseeTKQKLKSAILSAQSAAN 2184
Cdd:PRK07003 602 RAAT----GDAPPNGAARAEQAAES 622
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1923-2196 |
1.06e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 54.30 E-value: 1.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1923 DTPTTPKHP---KDSRENFFPVTVVPTAP---DPVPADSVQRPSdAHTKPRPALAAATTIITCPPSASAstldQSKDPGP 1996
Cdd:PHA03378 607 EPPTTQSHIpetSAPRQWPMPLRPIPMRPlrmQPITFNVLVFPT-PHQPPQVEITPYKPTWTQIGHIPY----QPSPTGA 681
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1997 PRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPaephcwpaeaalgtgaeptcsqeGKLRPep 2076
Cdd:PHA03378 682 NTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAP-----------------------GRARP-- 736
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2077 rrdgeAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSrpLPNMPKLVIPSAATKFPPeiTVTPPTPT 2156
Cdd:PHA03378 737 -----PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQ--APPAPQQRPRGAPTPQPP--PQAGPTSM 807
|
250 260 270 280
....*....|....*....|....*....|....*....|
gi 313151181 2157 LLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPALE 2196
Cdd:PHA03378 808 QLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALE 847
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
1942-2161 |
1.26e-06 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 53.68 E-value: 1.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1942 TVVPTAPDPVPADSVQRPSDAHTKPRPAlaaattiitcpPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARV 2021
Cdd:PRK14086 91 SAGEPAPPPPHARRTSEPELPRPGRRPY-----------EGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWP 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2022 AEGTSFPPQEPRHSPqvkmaptsSPAEPHCWPAEAAlgTGAEPTCSQEGKLRPE---PRRDGEAQEaasetqPLSSPPTA 2098
Cdd:PRK14086 160 RAADDYGWQQQRLGF--------PPRAPYASPASYA--PEQERDREPYDAGRPEydqRRRDYDHPR------PDWDRPRR 223
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2099 ASSKAP--SSGSAQPPEGHPGKPEPSRAKSRPlpnmpklVIPSAATKFP--PEITVTPPTPTL-LSPK 2161
Cdd:PRK14086 224 DRTDRPepPPGAGHVHRGGPGPPERDDAPVVP-------IRPSAPGPLAaqPAPAPGPGEPTArLNPK 284
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1920-2170 |
1.31e-06 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 53.92 E-value: 1.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1920 ASGDTPTTP--KHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIitcPPSASAstldqskdPGPP 1997
Cdd:PHA03378 646 LVFPTPHQPpqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM---RPPAAP--------PGRA 714
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1998 RPhrPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEGKLRPEPR 2077
Cdd:PHA03378 715 QR--PAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG-----RARPPAAAPGAPTPQPPPQAPPAPQ 787
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2078 RdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRplpNMPKLVIPSAATKFPPEItvtpPTPtl 2157
Cdd:PHA03378 788 Q--RPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR---GRPSLKKPAALERQAAAG----PTP-- 856
|
250
....*....|...
gi 313151181 2158 lSPKGSISEETKQ 2170
Cdd:PHA03378 857 -SPGSGTSDKIVQ 868
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1919-2145 |
1.40e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 53.73 E-value: 1.40e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1919 QASGDT-PTTPKHPKDSRENffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA-STLDQSKDPGP 1996
Cdd:PRK12323 367 QSGGGAgPATAAAAPVAQPA--PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlAAARQASARGP 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1997 PRPHRPEATPSMAslgPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEP 2076
Cdd:PRK12323 445 GGAPAPAPAPAAA---PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 313151181 2077 RRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRpLPNMPKLVIPSAATKFP 2145
Cdd:PRK12323 522 VAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG-LPDMFDGDWPALAARLP 589
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1908-2133 |
1.70e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 53.45 E-value: 1.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1908 CQVHLGAAAQRQASGDTPTTPKHPKDSRENffpvTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAST 1987
Cdd:PRK07764 582 WQVEAVVGPAPGAAGGEGPPAPASSGPPEE----AARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHV 657
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1988 LDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcWPAEAALGTGAEPTCS 2067
Cdd:PRK07764 658 AVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQ-PPQAAQGASAPSPAAD 736
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 313151181 2068 QEGKLRPEPRRDGEAQEAasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMP 2133
Cdd:PRK07764 737 DPVPLPPEPDDPPDPAGA-----PAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1946-2153 |
2.48e-06 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 52.96 E-value: 2.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1946 TAPDPVPADSVQRPSDAHTKPRPALAAATTiitcPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGT 2025
Cdd:PRK12323 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAA----PPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGA 447
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2026 SFPPQEPRHSPQVKMAPTSSPAEPhcwpaEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASEtqpLSSPPTAASSKAPs 2105
Cdd:PRK12323 448 PAPAPAPAAAPAAAARPAAAGPRP-----VAAAAAAAPARAAPAAAPAPADDDPPPWEELPPE---FASPAPAQPDAAP- 518
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 313151181 2106 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPP 2153
Cdd:PRK12323 519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP 566
|
|
| TadD |
COG5010 |
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ... |
38-188 |
2.84e-06 |
|
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444034 [Multi-domain] Cd Length: 155 Bit Score: 49.19 E-value: 2.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 38 ALYHKALDLQKHDRFEESAKAYHELLEASLLREAVSSGDEKEGLKHPGLILKYSTYKNLAQLAAQREDLETAMEFYLEAV 117
Cdd:COG5010 2 RALEGFDRLPLYLLLLTKLRTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLGDFEESLALLEQAL 81
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 313151181 118 MLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG5010 82 QLDPNNPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTS 152
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1913-2133 |
3.00e-06 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 52.87 E-value: 3.00e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1913 GAAAQRQASGDTPTTPKHPKDSREnfFPVTVVPTAPDPVPADSVQRPSDahtkprPALAAATtiitcPPSASASTLDQSK 1992
Cdd:PHA03307 76 GTEAPANESRSTPTWSLSTLAPAS--PAREGSPTPPGPSSPDPPPPTPP------PASPPPS-----PAPDLSEMLRPVG 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1993 DPGPPRPHRPEATPSMASLGPEGEE-------LARVAEGTSFPPQEPRHSPQVKMAP---TSSPAEPHCWPAEAALGTGA 2062
Cdd:PHA03307 143 SPGPPPAASPPAAGASPAAVASDAAssrqaalPLSSPEETARAPSSPPAEPPPSTPPaaaSPRPPRRSSPISASASSPAP 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2063 EPTCSQEGKLR--------PEPRRDGEAQE-------AASETQP----LSSPPTAASSKAPSSGSAQPPEGHPGKPEPSR 2123
Cdd:PHA03307 223 APGRSAADDAGasssdsssSESSGCGWGPEnecplprPAPITLPtriwEASGWNGPSSRPGPASSSSSPRERSPSPSPSS 302
|
250
....*....|
gi 313151181 2124 AKSRPLPNMP 2133
Cdd:PHA03307 303 PGSGPAPSSP 312
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1956-2161 |
4.57e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 52.08 E-value: 4.57e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1956 VQRPSDAHTKPRPALAAATTIITCPPSASASTLD-QSKDPGPPRPHRPEATPSMASLGPEGEELarvaegtsFPPQEPrh 2034
Cdd:pfam03154 174 LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPpQGSPATSQPPNQTQSTAAPHTLIQQTPTL--------HPQRLP-- 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2035 SPQVKMAPTSSPAEPHCWPAEAAlgtgAEPTCSQEGKLRPEPRRDGEAQ-EAASETQPLSSPPTAASSKAPSSGSAQ--- 2110
Cdd:pfam03154 244 SPHPPLQPMTQPPPPSQVSPQPL----PQPSLHGQMPPMPHSLQTGPSHmQHPVPPQPFPLTPQSSQSQVPPGPSPAapg 319
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2111 ---------PPEGHPGKPEPSRakSRPLPNMPkLVIPSAAtkfPPEITVTPPTPTLLSPK 2161
Cdd:pfam03154 320 qsqqrihtpPSQSQLQSQQPPR--EQPLPPAP-LSMPHIK---PPPTTPIPQLPNPQSHK 373
|
|
| PilF |
COG3063 |
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures]; |
99-188 |
4.66e-06 |
|
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
Pssm-ID: 442297 [Multi-domain] Cd Length: 94 Bit Score: 47.09 E-value: 4.66e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 99 LAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARhAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCL 178
Cdd:COG3063 1 LYLKLGDLEEAEEYYEKALELDPDNADALNNLGLLLLEQGRYDEAI-ALEKALKLDPNNAEALLNLAELLLELGDYDEAL 79
|
90
....*....|
gi 313151181 179 YFICKALEKD 188
Cdd:COG3063 80 AYLERALELD 89
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1913-2175 |
6.40e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 51.69 E-value: 6.40e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1913 GAAAQRQASGDTPTTPKHPKDSRENffpvtvvPTAPDPVPADSVQRPSdahTKPRPALAAATTIiTCPPSASASTLDQSK 1992
Cdd:pfam03154 319 GQSQQRIHTPPSQSQLQSQQPPREQ-------PLPPAPLSMPHIKPPP---TTPIPQLPNPQSH-KHPPHLSGPSPFQMN 387
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1993 DPGPPRP-----------HRPEATPSMASLGPEGEELARvaegtsfPPQEPRHSPQVKMAPTSSPAEPHcwpaeaalGTG 2061
Cdd:pfam03154 388 SNLPPPPalkplsslsthHPPSAHPPPLQLMPQSQQLPP-------PPAQPPVLTQSQSLPPPAASHPP--------TSG 452
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2062 AEPTCSQEgklrPEPRRDGEAQEAASETQPlSSPPTAASSKAPSSgsaQPPEghpgkpEPSRAKSRPLPNMPKLVIPSAA 2141
Cdd:pfam03154 453 LHQVPSQS----PFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGI---QPPS------SASVSSSGPVPAAVSCPLPPVQ 518
|
250 260 270
....*....|....*....|....*....|....*....
gi 313151181 2142 TKFPP-----EITVTPPTPTLLSPKGSISEETKQKLKSA 2175
Cdd:pfam03154 519 IKEEAldeaeEPESPPPPPRSPSPEPTVVNTPSHASQSA 557
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
1919-2164 |
9.68e-06 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 51.21 E-value: 9.68e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1919 QASGDTPTTPKHPKDSrenffPVTVVP----TAPDPVPADSVQRPSDAHTKPRPaLAAATTIITCP-------PSASAST 1987
Cdd:PHA03379 407 KASEPTYGTPRPPVEK-----PRPEVPqsleTATSHGSAQVPEPPPVHDLEPGP-LHDQHSMAPCPvaqlppgPLQDLEP 480
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1988 LDQskDPGPPRPHRPEATPSMASLGP---EGEELARVAEGTSFPPQEPRHSP-QVKMAPTSSPAEPHC-WPAEAALGTGA 2062
Cdd:PHA03379 481 GDQ--LPGVVQDGRPACAPVPAPAGPivrPWEASLSQVPGVAFAPVMPQPMPvEPVPVPTVALERPVCpAPPLIAMQGPG 558
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2063 EPTCSQEGKLR---------------PEPRRDGEAQ---EAASETQPLSSPP---TAASSKAPSSGSAQPPEG-HPGKPE 2120
Cdd:PHA03379 559 ETSGIVRVRERwrpapwtpnpprspsQMSVRDRLARlraEAQPYQASVEVQPpqlTQVSPQQPMEYPLEPEQQmFPGSPF 638
|
250 260 270 280
....*....|....*....|....*....|....*....|....
gi 313151181 2121 PSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI 2164
Cdd:PHA03379 639 SQVADVMRAGGVPAMQPQYFDLPLQQPISQGAPLAPLRASMGPV 682
|
|
| NlpI |
COG4785 |
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis]; |
29-174 |
1.57e-05 |
|
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 443815 [Multi-domain] Cd Length: 223 Bit Score: 48.37 E-value: 1.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 29 KEAQEAEAFALYHKALDLQKHDRF-----EESAKAYHELLEASLLREAVSSGDEK--EGLKHPGLIlkySTYKNLAQLAA 101
Cdd:COG4785 8 LLLALALAAAAASKAAILLAALLFaavlaLAIALADLALALAAAALAAAALAAERidRALALPDLA---QLYYERGVAYD 84
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 313151181 102 QREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4785 85 SLGDYDLAIADFDQALELDPDLAEAYNNRGLAYLLLGDYDAALEDFDRALELDPDYAYAYLNRGIALYYLGRY 157
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1914-2167 |
1.95e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 50.17 E-value: 1.95e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTT-PKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPalaaattiitcPPSASASTLDQSK 1992
Cdd:PHA03307 60 AACDRFEPPTGPPPgPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPD-----------PPPPTPPPASPPP 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1993 DPGPPRPH-----RPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAAlGTGAEPTCs 2067
Cdd:PHA03307 129 SPAPDLSEmlrpvGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP-PAAASPRP- 206
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2068 qegklrpePRRDGEAQEAASETQPlSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPpe 2147
Cdd:PHA03307 207 --------PRRSSPISASASSPAP-APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW-- 275
|
250 260
....*....|....*....|
gi 313151181 2148 iTVTPPTPTLLSPKGSISEE 2167
Cdd:PHA03307 276 -NGPSSRPGPASSSSSPRER 294
|
|
| NrfG |
COG4235 |
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ... |
95-174 |
2.26e-05 |
|
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443378 [Multi-domain] Cd Length: 131 Bit Score: 46.15 E-value: 2.26e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 95 NLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4235 22 LLGRAYLRLGRYDEALAAYEKALRLDPDNADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYLLGLAAFQQGDY 101
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1941-2157 |
2.81e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.60 E-value: 2.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1941 VTVVPtAPDPVPADSVQRPSDAHTKPRPALAAATtiitcPPSASASTlDQSKDPGPPRPHRPEATPSMASLGPEGEELAR 2020
Cdd:PRK07764 584 VEAVV-GPAPGAAGGEGPPAPASSGPPEEAARPA-----APAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKH 656
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2021 VAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwpAEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASETQplsSPPTAAS 2100
Cdd:PRK07764 657 VAVPDASDGGDGWPAKAGGAAPAAPPPAP----APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP---QAAQGAS 729
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181 2101 SKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTL 2157
Cdd:PRK07764 730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1914-2086 |
3.30e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 49.21 E-value: 3.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPaDSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1993
Cdd:PRK07764 622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP-DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRP------EATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCS 2067
Cdd:PRK07764 701 PAPAPAATPpagqadDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
|
170 180
....*....|....*....|..
gi 313151181 2068 QEGKLRPEPRR---DGEAQEAA 2086
Cdd:PRK07764 781 EEEEMAEDDAPsmdDEDRRDAE 802
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
1914-2133 |
4.08e-05 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 49.08 E-value: 4.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKdsrenffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1993
Cdd:PRK07003 395 AVPAVTAVTGAAGAALAPK-------AAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERD 467
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRPEATPSMASLGPEGEELA--------------RVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC-WPAEAAL 2058
Cdd:PRK07003 468 AQPPADSGSASAPASDAPPDAAFEPApraaapsaatpaavPDARAPAAASREDAPAAAAPPAPEARPPTPAAaAPAARAG 547
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2059 GTGAEPTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSK---------APSSGSAQPPEghPGKPEPSRAKSR-- 2127
Cdd:PRK07003 548 GAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRvavqvptprARAATGDAPPN--GAARAEQAAESRga 625
|
....*...
gi 313151181 2128 --PLPNMP 2133
Cdd:PRK07003 626 ppPWEDIP 633
|
|
| PHA02682 |
PHA02682 |
ORF080 virion core protein; Provisional |
1940-2046 |
4.68e-05 |
|
ORF080 virion core protein; Provisional
Pssm-ID: 177464 [Multi-domain] Cd Length: 280 Bit Score: 47.55 E-value: 4.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1940 PVTVVPTAPDP-VPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHR--PEAT------PSMAS 2010
Cdd:PHA02682 76 PSGQSPLAPSPaCAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPacPPSTrqcppaPPLPT 155
|
90 100 110
....*....|....*....|....*....|....*..
gi 313151181 2011 LGPEGEELARVAEGTSFPPQEPRHS-PQVKMAPTSSP 2046
Cdd:PHA02682 156 PKPAPAAKPIFLHNQLPPPDYPAAScPTIETAPAASP 192
|
|
| TadD |
COG5010 |
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ... |
1-155 |
5.20e-05 |
|
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444034 [Multi-domain] Cd Length: 155 Bit Score: 45.72 E-value: 5.20e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1 MIRIAALNASSTIEDDHEGSFKSHKTQTKEAQEAEAFALYHKALDLQKhdRFEESAKAYHELLEAsllreavssgdekeg 80
Cdd:COG5010 21 RTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLG--DFEESLALLEQALQL--------------- 83
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 313151181 81 lkHPGlilKYSTYKNLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNP 155
Cdd:COG5010 84 --DPN---NPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTSP 153
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
1940-2128 |
5.31e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.63 E-value: 5.31e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1940 PVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAA-----TTIITCPPSASASTLDQSKDPGPpRPHRPEATPSMASLGPE 2014
Cdd:PHA03307 25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGaaacdRFEPPTGPPPGPGTEAPANESRS-TPTWSLSTLAPASPARE 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2015 GEELARVAEGTSFPPQ-EPRHSPqvkmAPTSSPAEPHCWPAEAALGTGAEPtcsqegklRPEPRRDGEAQEAASETQPLS 2093
Cdd:PHA03307 104 GSPTPPGPSSPDPPPPtPPPASP----PPSPAPDLSEMLRPVGSPGPPPAA--------SPPAAGASPAAVASDAASSRQ 171
|
170 180 190
....*....|....*....|....*....|....*
gi 313151181 2094 SPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP 2128
Cdd:PHA03307 172 AALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP 206
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
1980-2193 |
5.93e-05 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 48.38 E-value: 5.93e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1980 PPSASASTldQSKDPGPPRPHRPEATPSMASLGPEGEELAR---VAEGTSF----PPQEPRHSPQVKMAPTSSPAEPHCW 2052
Cdd:PLN03209 329 PPKESDAA--DGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVprpLSPYTAYedlkPPTSPIPTPPSSSPASSKSVDAVAK 406
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2053 PAEAALGTGAEPTCS-QEGKLRPEPR---RDGEAQEAASETQPLSSP-PTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR 2127
Cdd:PLN03209 407 PAEPDVVPSPGSASNvPEVEPAQVEAkktRPLSPYARYEDLKPPTSPsPTAPTGVSPSVSSTSSVPAVPDTAPATAATDA 486
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181 2128 PLPNMPKlviPSAATKFPPEITVTPPT-PTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQP 2193
Cdd:PLN03209 487 AAPPPAN---MRPLSPYAVYDDLKPPTsPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQP 550
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
1892-2155 |
7.63e-05 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 47.84 E-value: 7.63e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1892 IKQVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvpadsvqrpsdaHTKPRPALA 1971
Cdd:NF033839 248 IDNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP------------GMQPSPQPE 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1972 AAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEPRHSPQVKMAPTSSPAEPHC 2051
Cdd:NF033839 307 KK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLETPKPEVKPQPEKPKPEVKP 367
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2052 WPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP---PEGHPGKPE--PSRAKS 2126
Cdd:NF033839 368 QPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPevkPQPEKPKPEvkPQPEKP 438
|
250 260 270
....*....|....*....|....*....|...
gi 313151181 2127 ----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2155
Cdd:NF033839 439 kpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
1925-2196 |
9.08e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 47.75 E-value: 9.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1925 PTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPHRPEA 2004
Cdd:PHA03378 676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRA------RPPAAAPGRARPPAAAPGRARPPAA 749
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2005 TPSMA---SLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPA---EPHCWPAEAALGTGAEPTCSQEGKLRPEPRR 2078
Cdd:PHA03378 750 APGRArppAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTpqpPPQAGPTSMQLMPRAAPGQQGPTKQILRQLL 829
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2079 DGEAQEA-ASETQPLSSPPTAASSKAPSSGSA------QPPEGHPGKPEPSRAKSRplPNMPKLVIPSAATKFPPEIT-- 2149
Cdd:PHA03378 830 TGGVKRGrPSLKKPAALERQAAAGPTPSPGSGtsdkivQAPVFYPPVLQPIQVMRQ--LGSVRAAAASTVTQAPTEYTge 907
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 313151181 2150 ---VTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLC---QPALE 2196
Cdd:PHA03378 908 rrgVGPMHPTDIPPSKRAKTDAYVESQPPHGGQSHSFSVIWENVSqgqQQTLE 960
|
|
| SepH |
NF040712 |
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ... |
1993-2133 |
1.69e-04 |
|
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.
Pssm-ID: 468676 [Multi-domain] Cd Length: 346 Bit Score: 46.30 E-value: 1.69e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1993 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2066
Cdd:NF040712 189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2067 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2133
Cdd:NF040712 269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
|
|
| PHA03291 |
PHA03291 |
envelope glycoprotein I; Provisional |
1938-2161 |
2.49e-04 |
|
envelope glycoprotein I; Provisional
Pssm-ID: 223033 [Multi-domain] Cd Length: 401 Bit Score: 45.72 E-value: 2.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1938 FFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiiTCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEgee 2017
Cdd:PHA03291 203 FVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPST--TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPE--- 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2018 larvaegtsfppqEPRHspQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRdgeaqeaASETQPLSSPPT 2097
Cdd:PHA03291 278 -------------ASRY--ELTVTQIIQIAIPASIIACVFLGSCACCLHRRCRRRRRRPAR-------IYRPPSPVAPSI 335
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181 2098 AASSKAPSSGSAQPPEGHPGKPePSRAKSRPLPN-MPKLVIPSAATKFP--PEITVTPPTPTLLSPK 2161
Cdd:PHA03291 336 SAVNEAALARLGDELKRHPPES-PRRSKRRSSQTmVPSLTAISEESEAPavVELSRSPRRPGGPTAR 401
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
1893-2130 |
2.58e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 46.62 E-value: 2.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1893 KQVDEEAALEQAVKfcqvhlGAAAQRQA---SGDTPTTPKHPK-DSRENFFPVT--VVPTAPDPVPADSVQRPSDAHTkP 1966
Cdd:PRK10263 270 KRMDDDEEITYTAR------GVAADPDDvlfSGNRATQPEYDEyDPLLNGAPITepVAVAAAATTATQSWAAPVEPVT-Q 342
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1967 RPALAAATTIITCPpsasasTLDQSKDPGPprpHRPEatPSMASlGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSP 2046
Cdd:PRK10263 343 TPPVASVDVPPAQP------TVAWQPVPGP---QTGE--PVIAP-APEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAP 410
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2047 AEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS 2126
Cdd:PRK10263 411 AAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVV 490
|
....
gi 313151181 2127 RPLP 2130
Cdd:PRK10263 491 EPEP 494
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
2003-2160 |
3.18e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 46.02 E-value: 3.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2003 EATPSMASLGPEGEELARVAEGTSFPPqeprhspqvkmAPTSSPAEPHCWPAEAAlgtGAEPTCSQEGKLRPEPRRDGEA 2082
Cdd:PRK12323 371 GAGPATAAAAPVAQPAPAAAAPAAAAP-----------APAAPPAAPAAAPAAAA---AARAVAAAPARRSPAPEALAAA 436
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2083 QEAASETQPLSSPPTAASSKAPSsgSAQPPEGHPGKPEPSRAKSRPLPNMPKLViPSAATKFPPEITVTPPTPTLLSP 2160
Cdd:PRK12323 437 RQASARGPGGAPAPAPAPAAAPA--AAARPAAAGPRPVAAAAAAAPARAAPAAA-PAPADDDPPPWEELPPEFASPAP 511
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1958-2184 |
4.03e-04 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 45.64 E-value: 4.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1958 RPSDAHTKPRPALAAATTIITCPPSASASTldqskdPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEP----R 2033
Cdd:PRK12323 364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPA------AAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaaR 437
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2034 HSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEgkLRPEPRrdgeaqeAASETQPLSSPPTAAsskAPSSGSAQPPE 2113
Cdd:PRK12323 438 QASARGPGGAPAPA-----PAPAAAPAAAARPAAAG--PRPVAA-------AAAAAPARAAPAAAP---APADDDPPPWE 500
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2114 GHPGK-PEPSRAKSRPLPNM--------PKLVIPSAATKFPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAAN 2184
Cdd:PRK12323 501 ELPPEfASPAPAQPDAAPAGwvaesipdPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
|
|
| PHA03381 |
PHA03381 |
tegument protein VP22; Provisional |
1931-2071 |
5.21e-04 |
|
tegument protein VP22; Provisional
Pssm-ID: 177618 [Multi-domain] Cd Length: 290 Bit Score: 44.23 E-value: 5.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1931 PKDSRENFFPVTVVPTAPDPVPAD-SVQRPSDAHTKPRPALAAAT----------TIITCPPSASASTLDQSKDPGPPRP 1999
Cdd:PHA03381 11 PHGTDEVEADVYYDFISPDASPARvSFEEPADRARRGAGQARGRSqaerrfhhydEARADYPYYTGSSSEDERPADPRPS 90
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 313151181 2000 HRPEATPSM----ASLGPEGEELARVAEGTSFPPqEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTCSQEGK 2071
Cdd:PHA03381 91 RRPHAQPEAsgpgPARGARGPAGSRGRGRRAESP-SPRDPPNPKGASAPRGRKSAC-ADSAALLDAPAPAAPKRQK 164
|
|
| PilF |
COG3063 |
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures]; |
92-158 |
6.56e-04 |
|
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
Pssm-ID: 442297 [Multi-domain] Cd Length: 94 Bit Score: 40.92 E-value: 6.56e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181 92 TYKNLAQLAAQREDLETAMEFyLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHW 158
Cdd:COG3063 28 ALNNLGLLLLEQGRYDEAIAL-EKALKLDPNNAEALLNLAELLLELGDYDEALAYLERALELDPSAL 93
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1949-2160 |
8.07e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.93 E-value: 8.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1949 DPVPADSV-QRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGP-------------- 2013
Cdd:PHA03247 2452 DPFFARTIlGAPFSLSLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPailpdepvgepvhp 2531
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2014 ------EG-EELARVAEGTSFPPQEPRHSPQV--KMAPTSSPAePHcwPAEAALGTGAE----PTCSQEGKLRPEPRRDG 2080
Cdd:PHA03247 2532 rmltwiRGlEELASDDAGDPPPPLPPAAPPAApdRSVPPPRPA-PR--PSEPAVTSRARrpdaPPQSARPRAPVDDRGDP 2608
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2081 EAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHP-GKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTP--PTPTL 2157
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPP-SPSPAANEPDPHPPpTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrPRRRA 2687
|
...
gi 313151181 2158 LSP 2160
Cdd:PHA03247 2688 ARP 2690
|
|
| PHA03325 |
PHA03325 |
nuclear-egress-membrane-like protein; Provisional |
1983-2152 |
9.49e-04 |
|
nuclear-egress-membrane-like protein; Provisional
Pssm-ID: 223044 Cd Length: 418 Bit Score: 44.10 E-value: 9.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1983 ASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQ-----VKMAPTSSPAEPhcwPAEAA 2057
Cdd:PHA03325 259 SSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLPPppvrrPRVKHPEAGKEE---PDGAR 335
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2058 LGTGAEPTCSQEGKLRPeprrdgeAQEAASETQPLSSPPTAASSKApSSGSAQPPEGHPGKPEPSRAKSRPLPnmpklvi 2137
Cdd:PHA03325 336 NAEAKEPAQPATSTSSK-------GSSSAQNKDSGSTGPGSSLAAA-SSFLEDDDFGSPPLDLTTSLRHMPSP------- 400
|
170
....*....|....*
gi 313151181 2138 PSAATKFPPEITVTP 2152
Cdd:PHA03325 401 SVTSAPEPPSIPLTY 415
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1994-2131 |
1.07e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 44.21 E-value: 1.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRPEATPSMASLGPEGEelarvaegtsfPPQEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPtcsqegklr 2073
Cdd:PRK07764 396 AAAPSAAAAAPAAAPAPAAAAPA-----------AAAAPAPAAAPQPAPAPAPAPAPP-SPAGNAPAGGAP--------- 454
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2074 pePRRDGEAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHPGKPEPSRAKSRPLPN 2131
Cdd:PRK07764 455 --SPPPAAAPSAQPAPAPAAAPEPTAAP-APAPPAAPAPAAAPAAPAAPAAPAGADDA 509
|
|
| PLN03209 |
PLN03209 |
translocon at the inner envelope of chloroplast subunit 62; Provisional |
1928-2160 |
1.33e-03 |
|
translocon at the inner envelope of chloroplast subunit 62; Provisional
Pssm-ID: 178748 [Multi-domain] Cd Length: 576 Bit Score: 43.76 E-value: 1.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1928 PKHPkDSRENFFPVTVVPTAPD-PVPADSVQRPSDAHTKPRPA--------LAAATTIITCPPS---ASASTLDQSKDPG 1995
Cdd:PLN03209 330 PKES-DAADGPKPVPTKPVTPEaPSPPIEEEPPQPKAVVPRPLspytayedLKPPTSPIPTPPSsspASSKSVDAVAKPA 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1996 PPRPH-RPEATPSMASLGPEGEELARVAEGTSF-------PPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCS 2067
Cdd:PLN03209 409 EPDVVpSPGSASNVPEVEPAQVEAKKTRPLSPYaryedlkPPTSPSPTAPTGVSPSVSST-----SSVPAVPDTAPATAA 483
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2068 QEGKLRPEPRrdgeaqeaaseTQPLSSPPTAASSKAPSSGSaqppeghPGKPEPSRAKSRPlPNMPKLVIPSAATKFPPE 2147
Cdd:PLN03209 484 TDAAAPPPAN-----------MRPLSPYAVYDDLKPPTSPS-------PAAPVGKVAPSST-NEVVKVGNSAPPTALADE 544
|
250
....*....|...
gi 313151181 2148 ITVTPPTPTLLSP 2160
Cdd:PLN03209 545 QHHAQPKPRPLSP 557
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1915-2157 |
1.35e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.16 E-value: 1.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1915 AAQRQASGDTPT-TPKHPKDSRENFFPV---------TVVPTAPDPVPADSVQRPSDAHTKPRpalaAATTIITCP-PSA 1983
Cdd:PHA03247 270 ETARGATGPPPPpEAAAPNGAAAPPDGVwgaalagapLALPAPPDPPPPAPAGDAEEEDDEDG----AMEVVSPLPrPRQ 345
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1984 SASTldqskdpGPPRPHRPEATP--SMASLGpEGEELARVAEgtsfPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG 2061
Cdd:PHA03247 346 HYPL-------GFPKRRRPTWTPpsSLEDLS-AGRHHPKRAS----LPTRKRRSARHAATPFARGPGGDDQTRPAAPVPA 413
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2062 AEPTCSQEGKLRPEPrrdgeaqeaasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKlVIPSAA 2141
Cdd:PHA03247 414 SVPTPAPTPVPASAP--------------PPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRK-ALDALR 478
|
250
....*....|....*.
gi 313151181 2142 TKFPPEitvtPPTPTL 2157
Cdd:PHA03247 479 ERRPPE----PPGADL 490
|
|
| PBP1 |
COG5180 |
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ... |
1913-2155 |
1.83e-03 |
|
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];
Pssm-ID: 444064 [Multi-domain] Cd Length: 548 Bit Score: 43.51 E-value: 1.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1913 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1988
Cdd:COG5180 152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1989 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2054
Cdd:COG5180 227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2055 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2128
Cdd:COG5180 307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
|
250 260
....*....|....*....|....*..
gi 313151181 2129 LPNMPKLVIPSAATKFPPeiTVTPPTP 2155
Cdd:COG5180 382 APFQPPNGAPQPGLGRRG--APGPPMG 406
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
1994-2112 |
1.86e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 43.44 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRPEATPSMASLGPEgeelarvAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwPAEAALGTGAEPTCSQEGKLR 2073
Cdd:PRK07764 394 PAAAAPSAAAAAPAAAPAPAA-------AAPAAAAAPAPAAAPQPAPAPAPAPAPP---SPAGNAPAGGAPSPPPAAAPS 463
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 313151181 2074 PEPRR---DGEAQEAASETQPLSSPPTAASSKAPSSGSAQPP 2112
Cdd:PRK07764 464 AQPAPapaAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
|
|
| TPR_12 |
pfam13424 |
Tetratricopeptide repeat; |
36-119 |
1.98e-03 |
|
Tetratricopeptide repeat;
Pssm-ID: 315987 [Multi-domain] Cd Length: 77 Bit Score: 38.91 E-value: 1.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424 3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70
|
....
gi 313151181 116 AVML 119
Cdd:pfam13424 71 ALAL 74
|
|
| sucB |
TIGR01347 |
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ... |
2015-2125 |
1.98e-03 |
|
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]
Pssm-ID: 273565 [Multi-domain] Cd Length: 403 Bit Score: 42.80 E-value: 1.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2015 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2094
Cdd:TIGR01347 68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
|
90 100 110
....*....|....*....|....*....|.
gi 313151181 2095 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2125
Cdd:TIGR01347 143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
|
|
| Treacle |
pfam03546 |
Treacher Collins syndrome protein Treacle; |
1952-2186 |
2.32e-03 |
|
Treacher Collins syndrome protein Treacle;
Pssm-ID: 460967 [Multi-domain] Cd Length: 531 Bit Score: 43.14 E-value: 2.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1952 PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARvaEGTSFPPQE 2031
Cdd:pfam03546 168 DSESSSEESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSE--ESSDSEEEA 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2032 PRHSPQVKMAPTSSPAEPHCWPAEaalGTGAEPTCSQEGKLR---PEPRRDGEAQEAASetqpLSSPPTAASSKAP---S 2105
Cdd:pfam03546 246 PAAATPAQAKPALKTPQTKASPRK---GTPITPTSAKVPPVRvgtPAPWKAGTVTSPAC----ASSPAVARGAQRPeedS 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2106 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI-----------SEETKQKLKS 2174
Cdd:pfam03546 319 SSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVPPGKTGPAVAQvkaeaqedsesSEEESDSEEA 398
|
250
....*....|..
gi 313151181 2175 AILSAQSAANVR 2186
Cdd:pfam03546 399 AATPAQVKASGK 410
|
|
| PHA03321 |
PHA03321 |
tegument protein VP11/12; Provisional |
1918-2162 |
2.39e-03 |
|
tegument protein VP11/12; Provisional
Pssm-ID: 223041 [Multi-domain] Cd Length: 694 Bit Score: 43.02 E-value: 2.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1918 RQASGDTPTTPKHPKDSRENFFPVT-----------VVPTAPDPVPAdSVQRPSDAHTK---PRPAlaaattiitcPPSA 1983
Cdd:PHA03321 447 RARPGSTPACARRARAQRARDAGPEyvdplgalrrlPAGAAPPPEPA-AAPSPATYYTRmggGPPR----------LPPR 515
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1984 SASTLDQSKDPGPPRPHRPEATPSmASLGPEGEELARVAEGTSFPPQEPRHSPqvkmAPTSSPaephcwPAEaALGTGAE 2063
Cdd:PHA03321 516 NRATETLRPDWGPPAAAPPEQMED-PYLEPDDDRFDRRDGAAAAATSHPREAP----APDDDP------IYE-GVSDSEE 583
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2064 PTCSQegklRPEPR----RDGEAQEAASETQPLSSPptaassKAPSSGSAQPPEGHPGKP--EPSRAKSRPLPnmpklvi 2137
Cdd:PHA03321 584 PVYEE----IPTPRvyqnPLPRPMEGAGEPPDLDAP------TSPWVEEENPIYGWGDSPlfSPPPAARFPPP------- 646
|
250 260
....*....|....*....|....*
gi 313151181 2138 PSAATKFPPEITVTPPTPTLLSPKG 2162
Cdd:PHA03321 647 DPALSPEPPALPAHRPRPGALAPDG 671
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
1895-2129 |
3.22e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.56 E-value: 3.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1895 VDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPkhpkdsrenffpvtvvptAPDPVPADSVQRPSDAHTKPRPALA-AA 1973
Cdd:PRK12323 395 AAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP------------------APEALAAARQASARGPGGAPAPAPApAA 456
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1974 TTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLG-PEGEELarvaegtsfpPQEPrhspqvkmaPTSSPAEPHCW 2052
Cdd:PRK12323 457 APAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpPPWEEL----------PPEF---------ASPAPAQPDAA 517
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181 2053 PAEAALGTGAEPTCSQEGKLRPEPRrdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKpEPSRAKSRPL 2129
Cdd:PRK12323 518 PAGWVAESIPDPATADPDDAFETLA---PAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD-WPALAARLPV 590
|
|
| TPR_21 |
pfam09976 |
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat. |
48-151 |
3.59e-03 |
|
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.
Pssm-ID: 430959 [Multi-domain] Cd Length: 194 Bit Score: 41.03 E-value: 3.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 48 KHDRFEESAKAYHELLEAsllreaVSSGDEKEGL--------KHPGlilkySTYKNLAQL-----AAQREDLETAMEfYL 114
Cdd:pfam09976 32 QRSQAEEASALYQQLLEA------VAAGDAAKAQaaaaqlkdEYGG-----TGYAALAALllakaAVEAGDLAAAKA-QL 99
|
90 100 110
....*....|....*....|....*....|....*...
gi 313151181 115 EAVMLDSTDVNLwykiGHVA-LRLIRIPLARHAFEEGL 151
Cdd:pfam09976 100 EWVADNAKDEAL----KALArLRLARVLLAQGKYDEAL 133
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
34-174 |
4.07e-03 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 41.64 E-value: 4.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 34 AEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLIlkySTYKNLAQLAAQREDLETAMEFY 113
Cdd:COG2956 6 AAALGWYFKGLNYLLNGQPDKAIDLLEEALE-----------------LDPETV---EAHLALGNLYRRRGEYDRAIRIH 65
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 313151181 114 LEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG2956 66 QKLLERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDPDDAEALRLLAEIYEQEGDW 126
|
|
| PRK12727 |
PRK12727 |
flagellar biosynthesis protein FlhF; |
1941-2155 |
4.32e-03 |
|
flagellar biosynthesis protein FlhF;
Pssm-ID: 237182 [Multi-domain] Cd Length: 559 Bit Score: 42.28 E-value: 4.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1941 VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKdpgpprPHRPEATPSMASLGpegeelAR 2020
Cdd:PRK12727 62 TPATAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMA------LRQPVSVPRQAPAA------AP 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2021 VAEGTSFPPQEPRHSPQVKMapTSSPAEPHCWPAEAALGTGAEPTCSQegklRPEPRRDGEAQEAASETqPLSSPPTAAS 2100
Cdd:PRK12727 130 VRAASIPSPAAQALAHAAAV--RTAPRQEHALSAVPEQLFADFLTTAP----VPRAPVQAPVVAAPAPV-PAIAAALAAH 202
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 313151181 2101 SKAPSSGSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVTPPTP 2155
Cdd:PRK12727 203 AAYAQDDDEQLDDDGFDLDDAL-PQILPPAALPPIVVAPAAPAALAAVAAAAPAP 256
|
|
| PHA03369 |
PHA03369 |
capsid maturational protease; Provisional |
2007-2194 |
4.34e-03 |
|
capsid maturational protease; Provisional
Pssm-ID: 223061 [Multi-domain] Cd Length: 663 Bit Score: 42.29 E-value: 4.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2007 SMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALgTGAEPTCSQEGKLRPEPRRDGEAQEAA 2086
Cdd:PHA03369 349 KTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPM-TAYPPVPQFCGDPGLVSPYNPQSPGTS 427
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2087 SETQPLSS-PPT-AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLP-NMPKLVIPSAATKFPPEITVTPPTPTLLSPKGS 2163
Cdd:PHA03369 428 YGPEPVGPvPPQpTNPYVMPISMANMVYPGHPQEHGHERKRKRGGElKEELIETLKLVKKLKEEQESLAKELEATAHKSE 507
|
170 180 190
....*....|....*....|....*....|.
gi 313151181 2164 ISEETKQKLKSAILSAQSAANVRKESLCQPA 2194
Cdd:PHA03369 508 IKKIAESEFKNAGAKTAAANIEPNCSADAAA 538
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
1916-2048 |
4.61e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 42.07 E-value: 4.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1916 AQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA---------- 1985
Cdd:PRK14971 360 AQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVdppaavpvnp 439
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 313151181 1986 -STLDQSKDPGPPRPHRPEATPSMASLGPegeelarvaeGTSFPPQEPRHSPQ--VKMAPTSSPAE 2048
Cdd:PRK14971 440 pSTAPQAVRPAQFKEEKKIPVSKVSSLGP----------STLRPIQEKAEQATgnIKEAPTGTQKE 495
|
|
| KLF9_13_N-like |
cd21975 |
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ... |
2000-2143 |
5.35e-03 |
|
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.
Pssm-ID: 409240 [Multi-domain] Cd Length: 163 Bit Score: 40.06 E-value: 5.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2000 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2079
Cdd:cd21975 19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 313151181 2080 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2143
Cdd:cd21975 99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
1998-2189 |
9.51e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 41.00 E-value: 9.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1998 RPHRPEATPSM---ASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEgklRP 2074
Cdd:PRK07994 360 HPAAPLPEPEVppqSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQG---AT 436
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2075 EPRRDGEAqeAASETQPLSSPPTAASSKAPssgSAQPPEGHPGKPEPSRAKSR-PLPNMPKLVIPSAATKFPPEITVTPP 2153
Cdd:PRK07994 437 KAKKSEPA--AASRARPVNSALERLASVRP---APSALEKAPAKKEAYRWKATnPVEVKKEPVATPKALKKALEHEKTPE 511
|
170 180 190
....*....|....*....|....*....|....*....
gi 313151181 2154 TPTLLSPKGSISE---ETKQKLKSAILSAQSAANVRKES 2189
Cdd:PRK07994 512 LAAKLAAEAIERDpwaALVSQLGLPGLVEQLALNAWKEE 550
|
|
|