NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|313151181|ref|NP_001186210|]
View 

calcineurin-binding protein cabin-1 isoform a [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MEF2_binding pfam09047
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ...
2156-2190 1.20e-15

MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.


:

Pssm-ID: 370261 [Multi-domain]  Cd Length: 35  Bit Score: 72.19  E-value: 1.20e-15
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 313151181  2156 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2190
Cdd:pfam09047    1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
30-205 3.26e-11

Tetratricopeptide (TPR) repeat [General function prediction only];


:

Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 65.80  E-value: 3.26e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457    62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
                         170
                  ....*....|....*.
gi 313151181  190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457   142 DDADALYNLGIALEKL 157
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1914-2157 4.72e-11

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.81  E-value: 4.72e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1986
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1987 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2063
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2064 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2142
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
                         250
                  ....*....|....*
gi 313151181 2143 kfPPEITVTPPTPTL 2157
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
 
Name Accession Description Interval E-value
MEF2_binding pfam09047
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ...
2156-2190 1.20e-15

MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.


Pssm-ID: 370261 [Multi-domain]  Cd Length: 35  Bit Score: 72.19  E-value: 1.20e-15
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 313151181  2156 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2190
Cdd:pfam09047    1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
MEF2_binding cd13839
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ...
2156-2190 6.83e-14

Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.


Pssm-ID: 260103 [Multi-domain]  Cd Length: 35  Bit Score: 67.41  E-value: 6.83e-14
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 313151181 2156 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2190
Cdd:cd13839     1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
30-205 3.26e-11

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 65.80  E-value: 3.26e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457    62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
                         170
                  ....*....|....*.
gi 313151181  190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457   142 DDADALYNLGIALEKL 157
PHA03247 PHA03247
large tegument protein UL36; Provisional
1914-2157 4.72e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.81  E-value: 4.72e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1986
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1987 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2063
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2064 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2142
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
                         250
                  ....*....|....*
gi 313151181 2143 kfPPEITVTPPTPTL 2157
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1914-2183 3.60e-10

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 64.98  E-value: 3.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1914 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1989
Cdd:pfam17823  120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1990 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2067
Cdd:pfam17823  199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2068 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2139
Cdd:pfam17823  266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 313151181  2140 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2183
Cdd:pfam17823  346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1892-2155 7.63e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.84  E-value: 7.63e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1892 IKQVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvpadsvqrpsdaHTKPRPALA 1971
Cdd:NF033839  248 IDNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP------------GMQPSPQPE 306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1972 AAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEPRHSPQVKMAPTSSPAEPHC 2051
Cdd:NF033839  307 KK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLETPKPEVKPQPEKPKPEVKP 367
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2052 WPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP---PEGHPGKPE--PSRAKS 2126
Cdd:NF033839  368 QPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPevkPQPEKPKPEvkPQPEKP 438
                         250       260       270
                  ....*....|....*....|....*....|...
gi 313151181 2127 ----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2155
Cdd:NF033839  439 kpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
1993-2133 1.69e-04

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 46.30  E-value: 1.69e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1993 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2066
Cdd:NF040712  189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2067 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2133
Cdd:NF040712  269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
1913-2155 1.83e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 43.51  E-value: 1.83e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1913 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1988
Cdd:COG5180   152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1989 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2054
Cdd:COG5180   227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2055 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2128
Cdd:COG5180   307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
                         250       260
                  ....*....|....*....|....*..
gi 313151181 2129 LPNMPKLVIPSAATKFPPeiTVTPPTP 2155
Cdd:COG5180   382 APFQPPNGAPQPGLGRRG--APGPPMG 406
TPR_12 pfam13424
Tetratricopeptide repeat;
36-119 1.98e-03

Tetratricopeptide repeat;


Pssm-ID: 315987 [Multi-domain]  Cd Length: 77  Bit Score: 38.91  E-value: 1.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181    36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424    3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70

                   ....
gi 313151181   116 AVML 119
Cdd:pfam13424   71 ALAL 74
sucB TIGR01347
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ...
2015-2125 1.98e-03

2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]


Pssm-ID: 273565 [Multi-domain]  Cd Length: 403  Bit Score: 42.80  E-value: 1.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2015 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2094
Cdd:TIGR01347   68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
                           90       100       110
                   ....*....|....*....|....*....|.
gi 313151181  2095 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2125
Cdd:TIGR01347  143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
2000-2143 5.35e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 40.06  E-value: 5.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2000 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2079
Cdd:cd21975    19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 313151181 2080 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2143
Cdd:cd21975    99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
 
Name Accession Description Interval E-value
MEF2_binding pfam09047
MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the ...
2156-2190 1.20e-15

MEF2 binding; The myocyte enhancer factor-2 (MEF2) binding domain, predominantly found in the calcineurin-binding protein CABIN 1, adopts an amphipathic alpha-helical structure, which allows it to bind a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription.


Pssm-ID: 370261 [Multi-domain]  Cd Length: 35  Bit Score: 72.19  E-value: 1.20e-15
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 313151181  2156 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2190
Cdd:pfam09047    1 TLLSPKGSISEETKQKLKNAILSAQSAANVKKDSL 35
MEF2_binding cd13839
Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; ...
2156-2190 6.83e-14

Mycocyte enhancer factor-2 (MEF2) binding domain of the calcineurin-binding protein cabin-1; The myocyte enhancer factor-2 (MEF2) binding domain, as found in the calcineurin-binding protein cabin-1, adopts an amphipathic alpha-helical structure, which allows it to bind to a hydrophobic groove on the MEF2S domain, forming a triple-helical interaction. Interaction of this domain with MEF2 causes repression of transcription. Cabin-1 inhibits calcineurin-mediated signal transduction in T-cell receptor-mediated signalling pathways, by binding to the activated form of calcineurin. Cabin-1 acts as a co-repressor of MEF2, the mycocyte enhancer factor-2, which regulates transcription in a calcium-dependent manner and plays vital roles in T-cell development and function.


Pssm-ID: 260103 [Multi-domain]  Cd Length: 35  Bit Score: 67.41  E-value: 6.83e-14
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 313151181 2156 TLLSPKGSISEETKQKLKSAILSAQSAANVRKESL 2190
Cdd:cd13839     1 TLLSPKGSISEETKQKLKNAILSSQSAANVKKDTL 35
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
30-205 3.26e-11

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 65.80  E-value: 3.26e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG0457     2 ELDPDDAEAYNNLGLAYRRLGRYEEAIEDYEKALE-----------------LDPDD---AEALYNLGLAYLRLGRYEEA 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKDC 189
Cdd:COG0457    62 LADYEQALELDPDDAEALNNLGLALQALGRYEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDP 141
                         170
                  ....*....|....*.
gi 313151181  190 RYSKGLVLKEKIFEEQ 205
Cdd:COG0457   142 DDADALYNLGIALEKL 157
PHA03247 PHA03247
large tegument protein UL36; Provisional
1914-2157 4.72e-11

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 68.81  E-value: 4.72e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKDSRENFFP----VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAS--- 1986
Cdd:PHA03247 2752 GGPARPARPPTTAGPPAPAPPAAPAAGpprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLppp 2831
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1987 TLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFP-PQEPRHSPQVKMA--PTSSPAEPHCWPAEaalgtgaE 2063
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAkPAAPARPPVRRLArpAVSRSTESFALPPD-------Q 2904
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2064 PTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP-LPNMPKLVIPSAAt 2142
Cdd:PHA03247 2905 PERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPgRVAVPRFRVPQPA- 2983
                         250
                  ....*....|....*
gi 313151181 2143 kfPPEITVTPPTPTL 2157
Cdd:PHA03247 2984 --PSREAPASSTPPL 2996
PHA03247 PHA03247
large tegument protein UL36; Provisional
1912-2162 1.25e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.27  E-value: 1.25e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1912 LGAAAQRQASGDTPTTPKHPKDSRenffpVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQS 1991
Cdd:PHA03247 2723 PGPAAARQASPALPAAPAPPAVPA-----GPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1992 KdPGPPRPhrpeATPSMASLGPEGEELARVAEGTSFPPqePRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTcsqegk 2071
Cdd:PHA03247 2798 L-PSPWDP----ADPPAAVLAPAAALPPAASPAGPLPP--PTSAQPTAPPPPPGPPPPSL-PLGGSVAPGGDVR------ 2863
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2072 lRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSgSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVT 2151
Cdd:PHA03247 2864 -RRPPSRSPAAKPAAPARPPVRRLARPAVSRSTES-FALPPDQPERPPQPQ-APPPPQPQPQPPPPPQPQPPPPPPPRPQ 2940
                         250
                  ....*....|.
gi 313151181 2152 PPTPTLLSPKG 2162
Cdd:PHA03247 2941 PPLAPTTDPAG 2951
PHA03247 PHA03247
large tegument protein UL36; Provisional
1925-2195 1.36e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 67.27  E-value: 1.36e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1925 PTTPKHPKDSRENFFPVTVVPTAPDPVPADS-VQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPE 2003
Cdd:PHA03247 2629 PSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTP 2708
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2004 ATPSMASLGPEGEELARVAEGTSFPPqeprhsPQVKMAPTSSPAEPHCWPAEAALGTGAEPTcsqeGKLRPEPRRDGEAQ 2083
Cdd:PHA03247 2709 EPAPHALVSATPLPPGPAAARQASPA------LPAAPAPPAVPAGPATPGGPARPARPPTTA----GPPAPAPPAAPAAG 2778
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2084 EAASETQPLSSPPTAASSKAPS-SGSAQPPEGHPGKPEPSRAKSRPLPNMPKlviPSAATKFPPEITVTPPTPTL----- 2157
Cdd:PHA03247 2779 PPRRLTRPAVASLSESRESLPSpWDPADPPAAVLAPAAALPPAASPAGPLPP---PTSAQPTAPPPPPGPPPPSLplggs 2855
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 313151181 2158 LSPKGSISEETKQKLKSAILSAQSAANVRkeSLCQPAL 2195
Cdd:PHA03247 2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVR--RLARPAV 2891
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1920-2186 1.89e-10

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 66.73  E-value: 1.89e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1920 ASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATT--IITCPPSASA----STLDQSKD 1993
Cdd:PHA03307  124 ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPeeTARAPSSPPAepppSTPPAAAS 203
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRPEATPSM--ASLGPEGEELARVAEGTSFPPQEPRHSP--QVKMAPTSSPAEPHC--WPAEAALGTGAEPTCS 2067
Cdd:PHA03307  204 PRPPRRSSPISASASspAPAPGRSAADDAGASSSDSSSSESSGCGwgPENECPLPRPAPITLptRIWEASGWNGPSSRPG 283
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2068 QEGKLRPEPRRDGEAQEAASETQPLSSPPT----------AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNmpklvi 2137
Cdd:PHA03307  284 PASSSSSPRERSPSPSPSSPGSGPAPSSPRasssssssreSSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP------ 357
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 313151181 2138 PSAATKFPPE-ITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVR 2186
Cdd:PHA03307  358 PPADPSSPRKrPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRF 407
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1914-2183 3.60e-10

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 64.98  E-value: 3.60e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1914 AAAQRQASGDTPTTPKHPKdSRENFFPVTVVPTAPDPV----PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLD 1989
Cdd:pfam17823  120 SSSPSSAAQSLPAAIAALP-SEAFSAPRAAACRANASAapraAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTA 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1990 QSKDPGPPRPHRPEATPSMASLGPE-GEELARVaeGTSFPpqeprhspqvkMAPTSSPAEPHCWPAE-AALGTGAEPTCS 2067
Cdd:pfam17823  199 ASSAPATLTPARGISTAATATGHPAaGTALAAV--GNSSP-----------AAGTVTAAVGTVTPAAlATLAAAAGTVAS 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2068 QEGKLR---PEPRRDGEAQEAASETQPLS-SPPTAASSKAPSS--GSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPS-- 2139
Cdd:pfam17823  266 AAGTINmgdPHARRLSPAKHMPSDTMARNpAAPMGAQAQGPIIqvSTDQPVHNTAGEPTPSPSNTTLEPNTPKSVASTnl 345
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 313151181  2140 -------AATKFPPEITVtPPTPTLLSPKGSISEETKQklKSAILSAQSAA 2183
Cdd:pfam17823  346 avvtttkAQAKEPSASPV-PVLHTSMIPEVEATSPTTQ--PSPLLPTQGAA 393
TPR COG0457
Tetratricopeptide (TPR) repeat [General function prediction only];
34-199 5.85e-10

Tetratricopeptide (TPR) repeat [General function prediction only];


Pssm-ID: 440225 [Multi-domain]  Cd Length: 245  Bit Score: 61.95  E-value: 5.85e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   34 AEAFALYHKALDLQkhdrfEESAKAYHELleASLLREAvssGDEKEGLKH--------PGLIlkySTYKNLAQLAAQRED 105
Cdd:COG0457    25 EEAIEDYEKALELD-----PDDAEALYNL--GLAYLRL---GRYEEALADyeqaleldPDDA---EALNNLGLALQALGR 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  106 LETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKAL 185
Cdd:COG0457    92 YEEALEDYDKALELDPDDAEALYNLGLALLELGRYDEAIEAYERALELDPDDADALYNLGIALEKLGRYEEALELLEKLE 171
                         170
                  ....*....|....
gi 313151181  186 EKDCRYSKGLVLKE 199
Cdd:COG0457   172 AAALAALLAAALGE 185
Spy COG3914
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational ...
30-188 2.25e-09

Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443119 [Multi-domain]  Cd Length: 658  Bit Score: 62.70  E-value: 2.25e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAsllreavssgdekeglkHPGLilkYSTYKNLAQLAAQREDLETA 109
Cdd:COG3914    72 AALLLLAALLELAALLLQALGRYEEALALYRRALAL-----------------NPDN---AEALFNLGNLLLALGRLEEA 131
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 313151181  110 MEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG3914   132 LAALRRALALNPDFAEAYLNLGEALRRLGRLEEAIAALRRALELDPDNAEALNNLGNALQDLGRLEEAIAAYRRALELD 210
PHA03247 PHA03247
large tegument protein UL36; Provisional
1921-2194 6.41e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.88  E-value: 6.41e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1921 SGDTPttPKHPKDSRENFFPVTVVPTAPDPVPADSVQRpSDAHTKPRPALAAATTIITCPPSASASTLDQSkdPGPPRPH 2000
Cdd:PHA03247 2548 AGDPP--PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVT-SRARRPDAPPQSARPRAPVDDRGDPRGPAPPS--PLPPDTH 2622
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2001 RPEATPSMASlgPEGEELARVAEGTSFPPQEPRHSPQVK-----------------MAPTSSPAEPHCWPAEAALGTGAE 2063
Cdd:PHA03247 2623 APDPPPPSPS--PAANEPDPHPPPTVPPPERPRDDPAPGrvsrprrarrlgraaqaSSPPQRPRRRAARPTVGSLTSLAD 2700
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2064 PTcsqegklrPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKP----EPSRAKSRPLPNMPklviPS 2139
Cdd:PHA03247 2701 PP--------PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPatpgGPARPARPPTTAGP----PA 2768
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 313151181 2140 AAtkfPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPA 2194
Cdd:PHA03247 2769 PA---PPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
30-205 1.02e-08

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 58.59  E-value: 1.02e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   30 EAQEAEAFALYHKALDLQKHDRFEESAKAYHELLEAS---------LLREAVSSGDEKEGLKHPGLILKYS-----TYKN 95
Cdd:COG2956    70 ERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDpddaealrlLAEIYEQEGDWEKAIEVLERLLKLGpenahAYCE 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   96 LAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYT 175
Cdd:COG2956   150 LAELYLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLGDPE 229
                         170       180       190
                  ....*....|....*....|....*....|
gi 313151181  176 TCLYFICKALEKDCRYSKGLVLKEKIFEEQ 205
Cdd:COG2956   230 EALELLRKALELDPSDDLLLALADLLERKE 259
BepA COG4783
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ...
33-188 6.39e-08

Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443813 [Multi-domain]  Cd Length: 139  Bit Score: 53.66  E-value: 6.39e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   33 EAEAFALYHKALDLQKHDRFEESAKAYHELLEASllreavssGDEKEGlkhpglilkystYKNLAQLAAQREDLETAMEF 112
Cdd:COG4783     1 AACAEALYALAQALLLAGDYDEAEALLEKALELD--------PDNPEA------------FALLGEILLQLGDLDEAIVL 60
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 313151181  113 YLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG4783    61 LHEALELDPDEPEARLNLGLALLKAGDYDEALALLEKALKLDPEHPEAYLRLARAYRALGRPDEAIAALEKALELD 136
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1921-2183 1.22e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 57.23  E-value: 1.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1921 SGDTPTTPK-HPKDS-RENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPP- 1997
Cdd:pfam05109  483 SGASPVTPSpSPRDNgTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAv 562
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1998 RPHRPEAT-PSMASLGPEGEELARVAEGTSfpPQEPRHSPQVK-----MAPTSSPAEPHCWP--AEAALGTGAEPTCSQE 2069
Cdd:pfam05109  563 TTPTPNATiPTLGKTSPTSAVTTPTPNATS--PTVGETSPQANttnhtLGGTSSTPVVTSPPknATSAVTTGQHNITSSS 640
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2070 G---KLRPEPRRDG---EAQEAASETQPL--SSPPTAA---SSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMpklviP 2138
Cdd:pfam05109  641 TssmSLRPSSISETlspSTSDNSTSHMPLltSAHPTGGeniTQVTPASTSTHHVSTSSPAPRPGTTSQASGPGN-----S 715
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 313151181  2139 SAATKfPPEITVTPPTPtllsPKGSISEETKQKLKSAILSAQSAA 2183
Cdd:pfam05109  716 STSTK-PGEVNVTKGTP----PKNATSPQAPSGQKTAVPTVTSTG 755
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1945-2184 2.56e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 56.01  E-value: 2.56e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1945 PTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTldqskdpgPPRPHRPEATPSMASLGPEGEELARVAEG 2024
Cdd:PRK07003  374 ARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA--------AAAATRAEAPPAAPAPPATADRGDDAADG 445
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2025 TSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG-AEPTCSQEgklrPEPRRDGEAQEAASETQPLSSPPTAASSKA 2103
Cdd:PRK07003  446 DAPVPAKANARASADSRCDERDAQPPADSGSASAPASdAPPDAAFE----PAPRAAAPSAATPAAVPDARAPAAASREDA 521
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2104 PSSGSAQPPEGHPGKP----EPSRA--------------------KSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLS 2159
Cdd:PRK07003  522 PAAAAPPAPEARPPTPaaaaPAARAggaaaaldvlrnagmrvssdRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRA 601
                         250       260
                  ....*....|....*....|....*
gi 313151181 2160 PKGSiseeTKQKLKSAILSAQSAAN 2184
Cdd:PRK07003  602 RAAT----GDAPPNGAARAEQAAES 622
PHA03378 PHA03378
EBNA-3B; Provisional
1923-2196 1.06e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 54.30  E-value: 1.06e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1923 DTPTTPKHP---KDSRENFFPVTVVPTAP---DPVPADSVQRPSdAHTKPRPALAAATTIITCPPSASAstldQSKDPGP 1996
Cdd:PHA03378  607 EPPTTQSHIpetSAPRQWPMPLRPIPMRPlrmQPITFNVLVFPT-PHQPPQVEITPYKPTWTQIGHIPY----QPSPTGA 681
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1997 PRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPaephcwpaeaalgtgaeptcsqeGKLRPep 2076
Cdd:PHA03378  682 NTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAP-----------------------GRARP-- 736
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2077 rrdgeAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSrpLPNMPKLVIPSAATKFPPeiTVTPPTPT 2156
Cdd:PHA03378  737 -----PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQ--APPAPQQRPRGAPTPQPP--PQAGPTSM 807
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 313151181 2157 LLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQPALE 2196
Cdd:PHA03378  808 QLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKKPAALE 847
dnaA PRK14086
chromosomal replication initiator protein DnaA;
1942-2161 1.26e-06

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 53.68  E-value: 1.26e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1942 TVVPTAPDPVPADSVQRPSDAHTKPRPAlaaattiitcpPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARV 2021
Cdd:PRK14086   91 SAGEPAPPPPHARRTSEPELPRPGRRPY-----------EGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWP 159
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2022 AEGTSFPPQEPRHSPqvkmaptsSPAEPHCWPAEAAlgTGAEPTCSQEGKLRPE---PRRDGEAQEaasetqPLSSPPTA 2098
Cdd:PRK14086  160 RAADDYGWQQQRLGF--------PPRAPYASPASYA--PEQERDREPYDAGRPEydqRRRDYDHPR------PDWDRPRR 223
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2099 ASSKAP--SSGSAQPPEGHPGKPEPSRAKSRPlpnmpklVIPSAATKFP--PEITVTPPTPTL-LSPK 2161
Cdd:PRK14086  224 DRTDRPepPPGAGHVHRGGPGPPERDDAPVVP-------IRPSAPGPLAaqPAPAPGPGEPTArLNPK 284
PHA03378 PHA03378
EBNA-3B; Provisional
1920-2170 1.31e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 53.92  E-value: 1.31e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1920 ASGDTPTTP--KHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIitcPPSASAstldqskdPGPP 1997
Cdd:PHA03378  646 LVFPTPHQPpqVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPM---RPPAAP--------PGRA 714
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1998 RPhrPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEGKLRPEPR 2077
Cdd:PHA03378  715 QR--PAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPG-----RARPPAAAPGAPTPQPPPQAPPAPQ 787
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2078 RdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRplpNMPKLVIPSAATKFPPEItvtpPTPtl 2157
Cdd:PHA03378  788 Q--RPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR---GRPSLKKPAALERQAAAG----PTP-- 856
                         250
                  ....*....|...
gi 313151181 2158 lSPKGSISEETKQ 2170
Cdd:PHA03378  857 -SPGSGTSDKIVQ 868
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1919-2145 1.40e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 53.73  E-value: 1.40e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1919 QASGDT-PTTPKHPKDSRENffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA-STLDQSKDPGP 1996
Cdd:PRK12323  367 QSGGGAgPATAAAAPVAQPA--PAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlAAARQASARGP 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1997 PRPHRPEATPSMAslgPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEP 2076
Cdd:PRK12323  445 GGAPAPAPAPAAA---PAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 313151181 2077 RRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRpLPNMPKLVIPSAATKFP 2145
Cdd:PRK12323  522 VAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASG-LPDMFDGDWPALAARLP 589
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1908-2133 1.70e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.45  E-value: 1.70e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1908 CQVHLGAAAQRQASGDTPTTPKHPKDSRENffpvTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASAST 1987
Cdd:PRK07764  582 WQVEAVVGPAPGAAGGEGPPAPASSGPPEE----AARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHV 657
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1988 LDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcWPAEAALGTGAEPTCS 2067
Cdd:PRK07764  658 AVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQ-PPQAAQGASAPSPAAD 736
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 313151181 2068 QEGKLRPEPRRDGEAQEAasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMP 2133
Cdd:PRK07764  737 DPVPLPPEPDDPPDPAGA-----PAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1946-2153 2.48e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 52.96  E-value: 2.48e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1946 TAPDPVPADSVQRPSDAHTKPRPALAAATTiitcPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGT 2025
Cdd:PRK12323  372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAA----PPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGA 447
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2026 SFPPQEPRHSPQVKMAPTSSPAEPhcwpaEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASEtqpLSSPPTAASSKAPs 2105
Cdd:PRK12323  448 PAPAPAPAAAPAAAARPAAAGPRP-----VAAAAAAAPARAAPAAAPAPADDDPPPWEELPPE---FASPAPAQPDAAP- 518
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 313151181 2106 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPP 2153
Cdd:PRK12323  519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP 566
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
38-188 2.84e-06

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 49.19  E-value: 2.84e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   38 ALYHKALDLQKHDRFEESAKAYHELLEASLLREAVSSGDEKEGLKHPGLILKYSTYKNLAQLAAQREDLETAMEFYLEAV 117
Cdd:COG5010     2 RALEGFDRLPLYLLLLTKLRTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLGDFEESLALLEQAL 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 313151181  118 MLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCLYFICKALEKD 188
Cdd:COG5010    82 QLDPNNPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTS 152
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1913-2133 3.00e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.87  E-value: 3.00e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1913 GAAAQRQASGDTPTTPKHPKDSREnfFPVTVVPTAPDPVPADSVQRPSDahtkprPALAAATtiitcPPSASASTLDQSK 1992
Cdd:PHA03307   76 GTEAPANESRSTPTWSLSTLAPAS--PAREGSPTPPGPSSPDPPPPTPP------PASPPPS-----PAPDLSEMLRPVG 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1993 DPGPPRPHRPEATPSMASLGPEGEE-------LARVAEGTSFPPQEPRHSPQVKMAP---TSSPAEPHCWPAEAALGTGA 2062
Cdd:PHA03307  143 SPGPPPAASPPAAGASPAAVASDAAssrqaalPLSSPEETARAPSSPPAEPPPSTPPaaaSPRPPRRSSPISASASSPAP 222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2063 EPTCSQEGKLR--------PEPRRDGEAQE-------AASETQP----LSSPPTAASSKAPSSGSAQPPEGHPGKPEPSR 2123
Cdd:PHA03307  223 APGRSAADDAGasssdsssSESSGCGWGPEnecplprPAPITLPtriwEASGWNGPSSRPGPASSSSSPRERSPSPSPSS 302
                         250
                  ....*....|
gi 313151181 2124 AKSRPLPNMP 2133
Cdd:PHA03307  303 PGSGPAPSSP 312
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1956-2161 4.57e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.08  E-value: 4.57e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1956 VQRPSDAHTKPRPALAAATTIITCPPSASASTLD-QSKDPGPPRPHRPEATPSMASLGPEGEELarvaegtsFPPQEPrh 2034
Cdd:pfam03154  174 LQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPpQGSPATSQPPNQTQSTAAPHTLIQQTPTL--------HPQRLP-- 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2035 SPQVKMAPTSSPAEPHCWPAEAAlgtgAEPTCSQEGKLRPEPRRDGEAQ-EAASETQPLSSPPTAASSKAPSSGSAQ--- 2110
Cdd:pfam03154  244 SPHPPLQPMTQPPPPSQVSPQPL----PQPSLHGQMPPMPHSLQTGPSHmQHPVPPQPFPLTPQSSQSQVPPGPSPAapg 319
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2111 ---------PPEGHPGKPEPSRakSRPLPNMPkLVIPSAAtkfPPEITVTPPTPTLLSPK 2161
Cdd:pfam03154  320 qsqqrihtpPSQSQLQSQQPPR--EQPLPPAP-LSMPHIK---PPPTTPIPQLPNPQSHK 373
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
99-188 4.66e-06

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 47.09  E-value: 4.66e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   99 LAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARhAFEEGLRCNPDHWPCLDNLITVLYTLSDYTTCL 178
Cdd:COG3063     1 LYLKLGDLEEAEEYYEKALELDPDNADALNNLGLLLLEQGRYDEAI-ALEKALKLDPNNAEALLNLAELLLELGDYDEAL 79
                          90
                  ....*....|
gi 313151181  179 YFICKALEKD 188
Cdd:COG3063    80 AYLERALELD 89
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1913-2175 6.40e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 6.40e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1913 GAAAQRQASGDTPTTPKHPKDSRENffpvtvvPTAPDPVPADSVQRPSdahTKPRPALAAATTIiTCPPSASASTLDQSK 1992
Cdd:pfam03154  319 GQSQQRIHTPPSQSQLQSQQPPREQ-------PLPPAPLSMPHIKPPP---TTPIPQLPNPQSH-KHPPHLSGPSPFQMN 387
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1993 DPGPPRP-----------HRPEATPSMASLGPEGEELARvaegtsfPPQEPRHSPQVKMAPTSSPAEPHcwpaeaalGTG 2061
Cdd:pfam03154  388 SNLPPPPalkplsslsthHPPSAHPPPLQLMPQSQQLPP-------PPAQPPVLTQSQSLPPPAASHPP--------TSG 452
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2062 AEPTCSQEgklrPEPRRDGEAQEAASETQPlSSPPTAASSKAPSSgsaQPPEghpgkpEPSRAKSRPLPNMPKLVIPSAA 2141
Cdd:pfam03154  453 LHQVPSQS----PFPQHPFVPGGPPPITPP-SGPPTSTSSAMPGI---QPPS------SASVSSSGPVPAAVSCPLPPVQ 518
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 313151181  2142 TKFPP-----EITVTPPTPTLLSPKGSISEETKQKLKSA 2175
Cdd:pfam03154  519 IKEEAldeaeEPESPPPPPRSPSPEPTVVNTPSHASQSA 557
PHA03379 PHA03379
EBNA-3A; Provisional
1919-2164 9.68e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 51.21  E-value: 9.68e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1919 QASGDTPTTPKHPKDSrenffPVTVVP----TAPDPVPADSVQRPSDAHTKPRPaLAAATTIITCP-------PSASAST 1987
Cdd:PHA03379  407 KASEPTYGTPRPPVEK-----PRPEVPqsleTATSHGSAQVPEPPPVHDLEPGP-LHDQHSMAPCPvaqlppgPLQDLEP 480
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1988 LDQskDPGPPRPHRPEATPSMASLGP---EGEELARVAEGTSFPPQEPRHSP-QVKMAPTSSPAEPHC-WPAEAALGTGA 2062
Cdd:PHA03379  481 GDQ--LPGVVQDGRPACAPVPAPAGPivrPWEASLSQVPGVAFAPVMPQPMPvEPVPVPTVALERPVCpAPPLIAMQGPG 558
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2063 EPTCSQEGKLR---------------PEPRRDGEAQ---EAASETQPLSSPP---TAASSKAPSSGSAQPPEG-HPGKPE 2120
Cdd:PHA03379  559 ETSGIVRVRERwrpapwtpnpprspsQMSVRDRLARlraEAQPYQASVEVQPpqlTQVSPQQPMEYPLEPEQQmFPGSPF 638
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....
gi 313151181 2121 PSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI 2164
Cdd:PHA03379  639 SQVADVMRAGGVPAMQPQYFDLPLQQPISQGAPLAPLRASMGPV 682
NlpI COG4785
Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];
29-174 1.57e-05

Lipoprotein NlpI, contains TPR repeats [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 443815 [Multi-domain]  Cd Length: 223  Bit Score: 48.37  E-value: 1.57e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   29 KEAQEAEAFALYHKALDLQKHDRF-----EESAKAYHELLEASLLREAVSSGDEK--EGLKHPGLIlkySTYKNLAQLAA 101
Cdd:COG4785     8 LLLALALAAAAASKAAILLAALLFaavlaLAIALADLALALAAAALAAAALAAERidRALALPDLA---QLYYERGVAYD 84
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 313151181  102 QREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4785    85 SLGDYDLAIADFDQALELDPDLAEAYNNRGLAYLLLGDYDAALEDFDRALELDPDYAYAYLNRGIALYYLGRY 157
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1914-2167 1.95e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 50.17  E-value: 1.95e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTT-PKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPalaaattiitcPPSASASTLDQSK 1992
Cdd:PHA03307   60 AACDRFEPPTGPPPgPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPD-----------PPPPTPPPASPPP 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1993 DPGPPRPH-----RPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAAlGTGAEPTCs 2067
Cdd:PHA03307  129 SPAPDLSEmlrpvGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTP-PAAASPRP- 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2068 qegklrpePRRDGEAQEAASETQPlSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPpe 2147
Cdd:PHA03307  207 --------PRRSSPISASASSPAP-APGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGW-- 275
                         250       260
                  ....*....|....*....|
gi 313151181 2148 iTVTPPTPTLLSPKGSISEE 2167
Cdd:PHA03307  276 -NGPSSRPGPASSSSSPRER 294
NrfG COG4235
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ...
95-174 2.26e-05

Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 443378 [Multi-domain]  Cd Length: 131  Bit Score: 46.15  E-value: 2.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   95 NLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG4235    22 LLGRAYLRLGRYDEALAAYEKALRLDPDNADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYLLGLAAFQQGDY 101
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1941-2157 2.81e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.60  E-value: 2.81e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1941 VTVVPtAPDPVPADSVQRPSDAHTKPRPALAAATtiitcPPSASASTlDQSKDPGPPRPHRPEATPSMASLGPEGEELAR 2020
Cdd:PRK07764  584 VEAVV-GPAPGAAGGEGPPAPASSGPPEEAARPA-----APAAPAAP-AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKH 656
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2021 VAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwpAEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASETQplsSPPTAAS 2100
Cdd:PRK07764  657 VAVPDASDGGDGWPAKAGGAAPAAPPPAP----APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPP---QAAQGAS 729
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181 2101 SKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTL 2157
Cdd:PRK07764  730 APSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMA 786
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1914-2086 3.30e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 49.21  E-value: 3.30e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPaDSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1993
Cdd:PRK07764  622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVP-DASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRP------EATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCS 2067
Cdd:PRK07764  701 PAPAPAATPpagqadDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
                         170       180
                  ....*....|....*....|..
gi 313151181 2068 QEGKLRPEPRR---DGEAQEAA 2086
Cdd:PRK07764  781 EEEEMAEDDAPsmdDEDRRDAE 802
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1914-2133 4.08e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 49.08  E-value: 4.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1914 AAAQRQASGDTPTTPKHPKdsrenffPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKD 1993
Cdd:PRK07003  395 AVPAVTAVTGAAGAALAPK-------AAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERD 467
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRPEATPSMASLGPEGEELA--------------RVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC-WPAEAAL 2058
Cdd:PRK07003  468 AQPPADSGSASAPASDAPPDAAFEPApraaapsaatpaavPDARAPAAASREDAPAAAAPPAPEARPPTPAAaAPAARAG 547
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2059 GTGAEPTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSK---------APSSGSAQPPEghPGKPEPSRAKSR-- 2127
Cdd:PRK07003  548 GAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRvavqvptprARAATGDAPPN--GAARAEQAAESRga 625

                  ....*...
gi 313151181 2128 --PLPNMP 2133
Cdd:PRK07003  626 ppPWEDIP 633
PHA02682 PHA02682
ORF080 virion core protein; Provisional
1940-2046 4.68e-05

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 47.55  E-value: 4.68e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1940 PVTVVPTAPDP-VPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHR--PEAT------PSMAS 2010
Cdd:PHA02682   76 PSGQSPLAPSPaCAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPacPPSTrqcppaPPLPT 155
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 313151181 2011 LGPEGEELARVAEGTSFPPQEPRHS-PQVKMAPTSSP 2046
Cdd:PHA02682  156 PKPAPAAKPIFLHNQLPPPDYPAAScPTIETAPAASP 192
TadD COG5010
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, ...
1-155 5.20e-05

Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444034 [Multi-domain]  Cd Length: 155  Bit Score: 45.72  E-value: 5.20e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181    1 MIRIAALNASSTIEDDHEGSFKSHKTQTKEAQEAEAFALYHKALDLQKhdRFEESAKAYHELLEAsllreavssgdekeg 80
Cdd:COG5010    21 RTLVEKYEAALAGANNTKEDELAAAGRDKLAKAFAIESPSDNLYNKLG--DFEESLALLEQALQL--------------- 83
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 313151181   81 lkHPGlilKYSTYKNLAQLAAQREDLETAMEFYLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNP 155
Cdd:COG5010    84 --DPN---NPELYYNLALLYSRSGDKDEAKEYYEKALALSPDNPNAYSNLAALLLSLGQDDEAKAALQRALGTSP 153
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1940-2128 5.31e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.63  E-value: 5.31e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1940 PVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAA-----TTIITCPPSASASTLDQSKDPGPpRPHRPEATPSMASLGPE 2014
Cdd:PHA03307   25 PATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGaaacdRFEPPTGPPPGPGTEAPANESRS-TPTWSLSTLAPASPARE 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2015 GEELARVAEGTSFPPQ-EPRHSPqvkmAPTSSPAEPHCWPAEAALGTGAEPtcsqegklRPEPRRDGEAQEAASETQPLS 2093
Cdd:PHA03307  104 GSPTPPGPSSPDPPPPtPPPASP----PPSPAPDLSEMLRPVGSPGPPPAA--------SPPAAGASPAAVASDAASSRQ 171
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 313151181 2094 SPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRP 2128
Cdd:PHA03307  172 AALPLSSPEETARAPSSPPAEPPPSTPPAAASPRP 206
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
1980-2193 5.93e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 48.38  E-value: 5.93e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1980 PPSASASTldQSKDPGPPRPHRPEATPSMASLGPEGEELAR---VAEGTSF----PPQEPRHSPQVKMAPTSSPAEPHCW 2052
Cdd:PLN03209  329 PPKESDAA--DGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVprpLSPYTAYedlkPPTSPIPTPPSSSPASSKSVDAVAK 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2053 PAEAALGTGAEPTCS-QEGKLRPEPR---RDGEAQEAASETQPLSSP-PTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR 2127
Cdd:PLN03209  407 PAEPDVVPSPGSASNvPEVEPAQVEAkktRPLSPYARYEDLKPPTSPsPTAPTGVSPSVSSTSSVPAVPDTAPATAATDA 486
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181 2128 PLPNMPKlviPSAATKFPPEITVTPPT-PTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLCQP 2193
Cdd:PLN03209  487 AAPPPAN---MRPLSPYAVYDDLKPPTsPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQP 550
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1892-2155 7.63e-05

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 47.84  E-value: 7.63e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1892 IKQVDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPKHPKDSRenffpvtvvPTAPDPvpadsvqrpsdaHTKPRPALA 1971
Cdd:NF033839  248 IDNVNTKVEIENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKK---------PSAPKP------------GMQPSPQPE 306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1972 AAttiitcPPSASASTLDQSKDPGPPRPhRPEATPSmaslgPEGEElarvaegTSFPPQEPRHSPQVKMAPTSSPAEPHC 2051
Cdd:NF033839  307 KK------EVKPEPETPKPEVKPQLEKP-KPEVKPQ-----PEKPK-------PEVKPQLETPKPEVKPQPEKPKPEVKP 367
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2052 WPAEAALGTGAEPTcsqegklRPEPRRDGEAQEAASETQPlsSPPTAASSKAPSSGSAQP---PEGHPGKPE--PSRAKS 2126
Cdd:NF033839  368 QPEKPKPEVKPQPE-------TPKPEVKPQPEKPKPEVKP--QPEKPKPEVKPQPEKPKPevkPQPEKPKPEvkPQPEKP 438
                         250       260       270
                  ....*....|....*....|....*....|...
gi 313151181 2127 ----RPLPNMPKLVIPSAATKFPPEITVTPPTP 2155
Cdd:NF033839  439 kpevKPQPEKPKPEVKPQPETPKPEVKPQPEKP 471
PHA03378 PHA03378
EBNA-3B; Provisional
1925-2196 9.08e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 47.75  E-value: 9.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1925 PTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAAttiitcPPSASASTLDQSKDPGPPRPHRPEA 2004
Cdd:PHA03378  676 PSPTGANTMLPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRA------RPPAAAPGRARPPAAAPGRARPPAA 749
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2005 TPSMA---SLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPA---EPHCWPAEAALGTGAEPTCSQEGKLRPEPRR 2078
Cdd:PHA03378  750 APGRArppAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRGAPTpqpPPQAGPTSMQLMPRAAPGQQGPTKQILRQLL 829
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2079 DGEAQEA-ASETQPLSSPPTAASSKAPSSGSA------QPPEGHPGKPEPSRAKSRplPNMPKLVIPSAATKFPPEIT-- 2149
Cdd:PHA03378  830 TGGVKRGrPSLKKPAALERQAAAGPTPSPGSGtsdkivQAPVFYPPVLQPIQVMRQ--LGSVRAAAASTVTQAPTEYTge 907
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 313151181 2150 ---VTPPTPTLLSPKGSISEETKQKLKSAILSAQSAANVRKESLC---QPALE 2196
Cdd:PHA03378  908 rrgVGPMHPTDIPPSKRAKTDAYVESQPPHGGQSHSFSVIWENVSqgqQQTLE 960
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
1993-2133 1.69e-04

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 46.30  E-value: 1.69e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1993 DPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHC------WPAEAALGTGAEPTC 2066
Cdd:NF040712  189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRrragveQPEDEPVGPGAAPAA 268
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2067 SQEGKLRPEPRRdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSR-PLPNMP 2133
Cdd:NF040712  269 EPDEATRDAGEP--PAPGAAETPEAAEPPAPAPAAPAAPAAPEAEEPARPEPPPAPKPKRRrRRASVP 334
PHA03291 PHA03291
envelope glycoprotein I; Provisional
1938-2161 2.49e-04

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 45.72  E-value: 2.49e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1938 FFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiiTCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEgee 2017
Cdd:PHA03291  203 FVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPST--TIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPE--- 277
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2018 larvaegtsfppqEPRHspQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRdgeaqeaASETQPLSSPPT 2097
Cdd:PHA03291  278 -------------ASRY--ELTVTQIIQIAIPASIIACVFLGSCACCLHRRCRRRRRRPAR-------IYRPPSPVAPSI 335
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181 2098 AASSKAPSSGSAQPPEGHPGKPePSRAKSRPLPN-MPKLVIPSAATKFP--PEITVTPPTPTLLSPK 2161
Cdd:PHA03291  336 SAVNEAALARLGDELKRHPPES-PRRSKRRSSQTmVPSLTAISEESEAPavVELSRSPRRPGGPTAR 401
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1893-2130 2.58e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.62  E-value: 2.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1893 KQVDEEAALEQAVKfcqvhlGAAAQRQA---SGDTPTTPKHPK-DSRENFFPVT--VVPTAPDPVPADSVQRPSDAHTkP 1966
Cdd:PRK10263  270 KRMDDDEEITYTAR------GVAADPDDvlfSGNRATQPEYDEyDPLLNGAPITepVAVAAAATTATQSWAAPVEPVT-Q 342
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1967 RPALAAATTIITCPpsasasTLDQSKDPGPprpHRPEatPSMASlGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSP 2046
Cdd:PRK10263  343 TPPVASVDVPPAQP------TVAWQPVPGP---QTGE--PVIAP-APEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAP 410
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2047 AEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRDGEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS 2126
Cdd:PRK10263  411 AAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVV 490

                  ....
gi 313151181 2127 RPLP 2130
Cdd:PRK10263  491 EPEP 494
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
2003-2160 3.18e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.02  E-value: 3.18e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2003 EATPSMASLGPEGEELARVAEGTSFPPqeprhspqvkmAPTSSPAEPHCWPAEAAlgtGAEPTCSQEGKLRPEPRRDGEA 2082
Cdd:PRK12323  371 GAGPATAAAAPVAQPAPAAAAPAAAAP-----------APAAPPAAPAAAPAAAA---AARAVAAAPARRSPAPEALAAA 436
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2083 QEAASETQPLSSPPTAASSKAPSsgSAQPPEGHPGKPEPSRAKSRPLPNMPKLViPSAATKFPPEITVTPPTPTLLSP 2160
Cdd:PRK12323  437 RQASARGPGGAPAPAPAPAAAPA--AAARPAAAGPRPVAAAAAAAPARAAPAAA-PAPADDDPPPWEELPPEFASPAP 511
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1958-2184 4.03e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 45.64  E-value: 4.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1958 RPSDAHTKPRPALAAATTIITCPPSASASTldqskdPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEP----R 2033
Cdd:PRK12323  364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPA------AAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaaR 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2034 HSPQVKMAPTSSPAephcwPAEAALGTGAEPTCSQEgkLRPEPRrdgeaqeAASETQPLSSPPTAAsskAPSSGSAQPPE 2113
Cdd:PRK12323  438 QASARGPGGAPAPA-----PAPAAAPAAAARPAAAG--PRPVAA-------AAAAAPARAAPAAAP---APADDDPPPWE 500
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2114 GHPGK-PEPSRAKSRPLPNM--------PKLVIPSAATKFPPEITVTPPTPTLLSPKGSISEETKQKLKSAILSAQSAAN 2184
Cdd:PRK12323  501 ELPPEfASPAPAQPDAAPAGwvaesipdPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD 580
PHA03381 PHA03381
tegument protein VP22; Provisional
1931-2071 5.21e-04

tegument protein VP22; Provisional


Pssm-ID: 177618 [Multi-domain]  Cd Length: 290  Bit Score: 44.23  E-value: 5.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1931 PKDSRENFFPVTVVPTAPDPVPAD-SVQRPSDAHTKPRPALAAAT----------TIITCPPSASASTLDQSKDPGPPRP 1999
Cdd:PHA03381   11 PHGTDEVEADVYYDFISPDASPARvSFEEPADRARRGAGQARGRSqaerrfhhydEARADYPYYTGSSSEDERPADPRPS 90
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 313151181 2000 HRPEATPSM----ASLGPEGEELARVAEGTSFPPqEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPTCSQEGK 2071
Cdd:PHA03381   91 RRPHAQPEAsgpgPARGARGPAGSRGRGRRAESP-SPRDPPNPKGASAPRGRKSAC-ADSAALLDAPAPAAPKRQK 164
PilF COG3063
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
92-158 6.56e-04

Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];


Pssm-ID: 442297 [Multi-domain]  Cd Length: 94  Bit Score: 40.92  E-value: 6.56e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181   92 TYKNLAQLAAQREDLETAMEFyLEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHW 158
Cdd:COG3063    28 ALNNLGLLLLEQGRYDEAIAL-EKALKLDPNNAEALLNLAELLLELGDYDEALAYLERALELDPSAL 93
PHA03247 PHA03247
large tegument protein UL36; Provisional
1949-2160 8.07e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 8.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1949 DPVPADSV-QRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGP-------------- 2013
Cdd:PHA03247 2452 DPFFARTIlGAPFSLSLLLGELFPGAPVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPailpdepvgepvhp 2531
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2014 ------EG-EELARVAEGTSFPPQEPRHSPQV--KMAPTSSPAePHcwPAEAALGTGAE----PTCSQEGKLRPEPRRDG 2080
Cdd:PHA03247 2532 rmltwiRGlEELASDDAGDPPPPLPPAAPPAApdRSVPPPRPA-PR--PSEPAVTSRARrpdaPPQSARPRAPVDDRGDP 2608
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2081 EAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHP-GKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTP--PTPTL 2157
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPP-SPSPAANEPDPHPPpTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqrPRRRA 2687

                  ...
gi 313151181 2158 LSP 2160
Cdd:PHA03247 2688 ARP 2690
PHA03325 PHA03325
nuclear-egress-membrane-like protein; Provisional
1983-2152 9.49e-04

nuclear-egress-membrane-like protein; Provisional


Pssm-ID: 223044  Cd Length: 418  Bit Score: 44.10  E-value: 9.49e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1983 ASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQ-----VKMAPTSSPAEPhcwPAEAA 2057
Cdd:PHA03325  259 SSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLPPppvrrPRVKHPEAGKEE---PDGAR 335
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2058 LGTGAEPTCSQEGKLRPeprrdgeAQEAASETQPLSSPPTAASSKApSSGSAQPPEGHPGKPEPSRAKSRPLPnmpklvi 2137
Cdd:PHA03325  336 NAEAKEPAQPATSTSSK-------GSSSAQNKDSGSTGPGSSLAAA-SSFLEDDDFGSPPLDLTTSLRHMPSP------- 400
                         170
                  ....*....|....*
gi 313151181 2138 PSAATKFPPEITVTP 2152
Cdd:PHA03325  401 SVTSAPEPPSIPLTY 415
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1994-2131 1.07e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.21  E-value: 1.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRPEATPSMASLGPEGEelarvaegtsfPPQEPRHSPQVKMAPTSSPAEPHCwPAEAALGTGAEPtcsqegklr 2073
Cdd:PRK07764  396 AAAPSAAAAAPAAAPAPAAAAPA-----------AAAAPAPAAAPQPAPAPAPAPAPP-SPAGNAPAGGAP--------- 454
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 313151181 2074 pePRRDGEAQEAASETQPLSSPPTAASSkAPSSGSAQPPEGHPGKPEPSRAKSRPLPN 2131
Cdd:PRK07764  455 --SPPPAAAPSAQPAPAPAAAPEPTAAP-APAPPAAPAPAAAPAAPAAPAAPAGADDA 509
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
1928-2160 1.33e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 43.76  E-value: 1.33e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1928 PKHPkDSRENFFPVTVVPTAPD-PVPADSVQRPSDAHTKPRPA--------LAAATTIITCPPS---ASASTLDQSKDPG 1995
Cdd:PLN03209  330 PKES-DAADGPKPVPTKPVTPEaPSPPIEEEPPQPKAVVPRPLspytayedLKPPTSPIPTPPSsspASSKSVDAVAKPA 408
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1996 PPRPH-RPEATPSMASLGPEGEELARVAEGTSF-------PPQEPRHSPQVKMAPTSSPAephcwPAEAALGTGAEPTCS 2067
Cdd:PLN03209  409 EPDVVpSPGSASNVPEVEPAQVEAKKTRPLSPYaryedlkPPTSPSPTAPTGVSPSVSST-----SSVPAVPDTAPATAA 483
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2068 QEGKLRPEPRrdgeaqeaaseTQPLSSPPTAASSKAPSSGSaqppeghPGKPEPSRAKSRPlPNMPKLVIPSAATKFPPE 2147
Cdd:PLN03209  484 TDAAAPPPAN-----------MRPLSPYAVYDDLKPPTSPS-------PAAPVGKVAPSST-NEVVKVGNSAPPTALADE 544
                         250
                  ....*....|...
gi 313151181 2148 ITVTPPTPTLLSP 2160
Cdd:PLN03209  545 QHHAQPKPRPLSP 557
PHA03247 PHA03247
large tegument protein UL36; Provisional
1915-2157 1.35e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 1.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1915 AAQRQASGDTPT-TPKHPKDSRENFFPV---------TVVPTAPDPVPADSVQRPSDAHTKPRpalaAATTIITCP-PSA 1983
Cdd:PHA03247  270 ETARGATGPPPPpEAAAPNGAAAPPDGVwgaalagapLALPAPPDPPPPAPAGDAEEEDDEDG----AMEVVSPLPrPRQ 345
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1984 SASTldqskdpGPPRPHRPEATP--SMASLGpEGEELARVAEgtsfPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTG 2061
Cdd:PHA03247  346 HYPL-------GFPKRRRPTWTPpsSLEDLS-AGRHHPKRAS----LPTRKRRSARHAATPFARGPGGDDQTRPAAPVPA 413
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2062 AEPTCSQEGKLRPEPrrdgeaqeaasetqPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLPNMPKlVIPSAA 2141
Cdd:PHA03247  414 SVPTPAPTPVPASAP--------------PPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRK-ALDALR 478
                         250
                  ....*....|....*.
gi 313151181 2142 TKFPPEitvtPPTPTL 2157
Cdd:PHA03247  479 ERRPPE----PPGADL 490
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
1913-2155 1.83e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 43.51  E-value: 1.83e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1913 GAAAQRQASGDTPTTPKH----PKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTiitcpPSASASTL 1988
Cdd:COG5180   152 AALLQRSDPILAKDPDGDsastLPPPAEKLDKVLTEPRDALKDSPEKLDRPKVEVKDEAQEEPPDLT-----GGADHPRP 226
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1989 DQSKDPGPPRPHRPEATPSMASLGPEGEEL-------ARVAEGTSFPPQEPRHSPQ-------VKMAPTSSPAEPHCWPA 2054
Cdd:COG5180   227 EAASSPKVDPPSTSEARSRPATVDAQPEMRppadakeRRRAAIGDTPAAEPPGLPVleagsepQSDAPEAETARPIDVKG 306
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2055 EAALGTGAEPTCSQEGKLRPEPRRDGEAQEaasetQPLSSPPTAASSKAPSSGSAQPPEGHPGKPEPSRAKS------RP 2128
Cdd:COG5180   307 VASAPPATRPVRPPGGARDPGTPRPGQPTE-----RPAGVPEAASDAGQPPSAYPPAEEAVPGKPLEQGAPRpgssggDG 381
                         250       260
                  ....*....|....*....|....*..
gi 313151181 2129 LPNMPKLVIPSAATKFPPeiTVTPPTP 2155
Cdd:COG5180   382 APFQPPNGAPQPGLGRRG--APGPPMG 406
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1994-2112 1.86e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.44  E-value: 1.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1994 PGPPRPHRPEATPSMASLGPEgeelarvAEGTSFPPQEPRHSPQVKMAPTSSPAEPhcwPAEAALGTGAEPTCSQEGKLR 2073
Cdd:PRK07764  394 PAAAAPSAAAAAPAAAPAPAA-------AAPAAAAAPAPAAAPQPAPAPAPAPAPP---SPAGNAPAGGAPSPPPAAAPS 463
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 313151181 2074 PEPRR---DGEAQEAASETQPLSSPPTAASSKAPSSGSAQPP 2112
Cdd:PRK07764  464 AQPAPapaAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
TPR_12 pfam13424
Tetratricopeptide repeat;
36-119 1.98e-03

Tetratricopeptide repeat;


Pssm-ID: 315987 [Multi-domain]  Cd Length: 77  Bit Score: 38.91  E-value: 1.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181    36 AFALYHKALDLQKHDRFEESAKAYHELLEaslLREAVSSGDekeglkHPGLILkysTYKNLAQLAAQREDLETAMEFYLE 115
Cdd:pfam13424    3 ATALNNLAAVLRRLGRYDEALELLEKALE---IARRLLGPD------HPLTAT---TLLNLGRLYLELGRYEEALELLER 70

                   ....
gi 313151181   116 AVML 119
Cdd:pfam13424   71 ALAL 74
sucB TIGR01347
2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This ...
2015-2125 1.98e-03

2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component); This model describes the TCA cycle 2-oxoglutarate system E2 component, dihydrolipoamide succinyltransferase. It is closely related to the pyruvate dehydrogenase E2 component, dihydrolipoamide acetyltransferase. The seed for this model includes mitochondrial and Gram-negative bacterial forms. Mycobacterial candidates are highly derived, differ in having and extra copy of the lipoyl-binding domain at the N-terminus. They score below the trusted cutoff, but above the noise cutoff and above all examples of dihydrolipoamide acetyltransferase. [Energy metabolism, TCA cycle]


Pssm-ID: 273565 [Multi-domain]  Cd Length: 403  Bit Score: 42.80  E-value: 1.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2015 GEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHcwPAEAALGTGAEPTCSQEGKlrpEPRRDGEAQEAASETQPLSS 2094
Cdd:TIGR01347   68 GQVLAILEEGNDATAAPPAKSGEEKEETPAASAAAA--PTAAANRPSLSPAARRLAK---EHGIDLSAVPGTGVTGRVTK 142
                           90       100       110
                   ....*....|....*....|....*....|.
gi 313151181  2095 PPTAASSKAPSsgSAQPPEGHPGKPEPSRAK 2125
Cdd:TIGR01347  143 EDIIKKTEAPA--SAQPPAAAAAAAAPAAAT 171
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
1952-2186 2.32e-03

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 43.14  E-value: 2.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  1952 PADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLGPEGEELARvaEGTSFPPQE 2031
Cdd:pfam03546  168 DSESSSEESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSE--ESSDSEEEA 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2032 PRHSPQVKMAPTSSPAEPHCWPAEaalGTGAEPTCSQEGKLR---PEPRRDGEAQEAASetqpLSSPPTAASSKAP---S 2105
Cdd:pfam03546  246 PAAATPAQAKPALKTPQTKASPRK---GTPITPTSAKVPPVRvgtPAPWKAGTVTSPAC----ASSPAVARGAQRPeedS 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181  2106 SGSAQPPEGHPGKPEPSRAKSRPLPNMPKLVIPSAATKFPPEITVTPPTPTLLSPKGSI-----------SEETKQKLKS 2174
Cdd:pfam03546  319 SSSEESESEEETAPAAAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVPPGKTGPAVAQvkaeaqedsesSEEESDSEEA 398
                          250
                   ....*....|..
gi 313151181  2175 AILSAQSAANVR 2186
Cdd:pfam03546  399 AATPAQVKASGK 410
PHA03321 PHA03321
tegument protein VP11/12; Provisional
1918-2162 2.39e-03

tegument protein VP11/12; Provisional


Pssm-ID: 223041 [Multi-domain]  Cd Length: 694  Bit Score: 43.02  E-value: 2.39e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1918 RQASGDTPTTPKHPKDSRENFFPVT-----------VVPTAPDPVPAdSVQRPSDAHTK---PRPAlaaattiitcPPSA 1983
Cdd:PHA03321  447 RARPGSTPACARRARAQRARDAGPEyvdplgalrrlPAGAAPPPEPA-AAPSPATYYTRmggGPPR----------LPPR 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1984 SASTLDQSKDPGPPRPHRPEATPSmASLGPEGEELARVAEGTSFPPQEPRHSPqvkmAPTSSPaephcwPAEaALGTGAE 2063
Cdd:PHA03321  516 NRATETLRPDWGPPAAAPPEQMED-PYLEPDDDRFDRRDGAAAAATSHPREAP----APDDDP------IYE-GVSDSEE 583
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2064 PTCSQegklRPEPR----RDGEAQEAASETQPLSSPptaassKAPSSGSAQPPEGHPGKP--EPSRAKSRPLPnmpklvi 2137
Cdd:PHA03321  584 PVYEE----IPTPRvyqnPLPRPMEGAGEPPDLDAP------TSPWVEEENPIYGWGDSPlfSPPPAARFPPP------- 646
                         250       260
                  ....*....|....*....|....*
gi 313151181 2138 PSAATKFPPEITVTPPTPTLLSPKG 2162
Cdd:PHA03321  647 DPALSPEPPALPAHRPRPGALAPDG 671
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1895-2129 3.22e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.56  E-value: 3.22e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1895 VDEEAALEQAVKFCQVHLGAAAQRQASGDTPTTPkhpkdsrenffpvtvvptAPDPVPADSVQRPSDAHTKPRPALA-AA 1973
Cdd:PRK12323  395 AAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSP------------------APEALAAARQASARGPGGAPAPAPApAA 456
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1974 TTIITCPPSASASTLDQSKDPGPPRPHRPEATPSMASLG-PEGEELarvaegtsfpPQEPrhspqvkmaPTSSPAEPHCW 2052
Cdd:PRK12323  457 APAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDpPPWEEL----------PPEF---------ASPAPAQPDAA 517
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 313151181 2053 PAEAALGTGAEPTCSQEGKLRPEPRrdgEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGKpEPSRAKSRPL 2129
Cdd:PRK12323  518 PAGWVAESIPDPATADPDDAFETLA---PAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGD-WPALAARLPV 590
TPR_21 pfam09976
Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.
48-151 3.59e-03

Tetratricopeptide repeat-like domain; This family resembles a single unit of a TPR repeat.


Pssm-ID: 430959 [Multi-domain]  Cd Length: 194  Bit Score: 41.03  E-value: 3.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181    48 KHDRFEESAKAYHELLEAsllreaVSSGDEKEGL--------KHPGlilkySTYKNLAQL-----AAQREDLETAMEfYL 114
Cdd:pfam09976   32 QRSQAEEASALYQQLLEA------VAAGDAAKAQaaaaqlkdEYGG-----TGYAALAALllakaAVEAGDLAAAKA-QL 99
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 313151181   115 EAVMLDSTDVNLwykiGHVA-LRLIRIPLARHAFEEGL 151
Cdd:pfam09976  100 EWVADNAKDEAL----KALArLRLARVLLAQGKYDEAL 133
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
34-174 4.07e-03

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 41.64  E-value: 4.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181   34 AEAFALYHKALDLQKHDRFEESAKAYHELLEasllreavssgdekeglKHPGLIlkySTYKNLAQLAAQREDLETAMEFY 113
Cdd:COG2956     6 AAALGWYFKGLNYLLNGQPDKAIDLLEEALE-----------------LDPETV---EAHLALGNLYRRRGEYDRAIRIH 65
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 313151181  114 LEAVMLDSTDVNLWYKIGHVALRLIRIPLARHAFEEGLRCNPDHWPCLDNLITVLYTLSDY 174
Cdd:COG2956    66 QKLLERDPDRAEALLELAQDYLKAGLLDRAEELLEKLLELDPDDAEALRLLAEIYEQEGDW 126
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
1941-2155 4.32e-03

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 42.28  E-value: 4.32e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1941 VTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASASTLDQSKdpgpprPHRPEATPSMASLGpegeelAR 2020
Cdd:PRK12727   62 TPATAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMA------LRQPVSVPRQAPAA------AP 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2021 VAEGTSFPPQEPRHSPQVKMapTSSPAEPHCWPAEAALGTGAEPTCSQegklRPEPRRDGEAQEAASETqPLSSPPTAAS 2100
Cdd:PRK12727  130 VRAASIPSPAAQALAHAAAV--RTAPRQEHALSAVPEQLFADFLTTAP----VPRAPVQAPVVAAPAPV-PAIAAALAAH 202
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 313151181 2101 SKAPSSGSAQPPEGHPGKPEPSrAKSRPLPNMPKLVIPSAATKFPPEITVTPPTP 2155
Cdd:PRK12727  203 AAYAQDDDEQLDDDGFDLDDAL-PQILPPAALPPIVVAPAAPAALAAVAAAAPAP 256
PHA03369 PHA03369
capsid maturational protease; Provisional
2007-2194 4.34e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 42.29  E-value: 4.34e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2007 SMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALgTGAEPTCSQEGKLRPEPRRDGEAQEAA 2086
Cdd:PHA03369  349 KTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPM-TAYPPVPQFCGDPGLVSPYNPQSPGTS 427
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2087 SETQPLSS-PPT-AASSKAPSSGSAQPPEGHPGKPEPSRAKSRPLP-NMPKLVIPSAATKFPPEITVTPPTPTLLSPKGS 2163
Cdd:PHA03369  428 YGPEPVGPvPPQpTNPYVMPISMANMVYPGHPQEHGHERKRKRGGElKEELIETLKLVKKLKEEQESLAKELEATAHKSE 507
                         170       180       190
                  ....*....|....*....|....*....|.
gi 313151181 2164 ISEETKQKLKSAILSAQSAANVRKESLCQPA 2194
Cdd:PHA03369  508 IKKIAESEFKNAGAKTAAANIEPNCSADAAA 538
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1916-2048 4.61e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 42.07  E-value: 4.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1916 AQRQASGDTPTTPKHPKDSRENFFPVTVVPTAPDPVPADSVQRPSDAHTKPRPALAAATTIITCPPSASA---------- 1985
Cdd:PRK14971  360 AQLTQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVdppaavpvnp 439
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 313151181 1986 -STLDQSKDPGPPRPHRPEATPSMASLGPegeelarvaeGTSFPPQEPRHSPQ--VKMAPTSSPAE 2048
Cdd:PRK14971  440 pSTAPQAVRPAQFKEEKKIPVSKVSSLGP----------STLRPIQEKAEQATgnIKEAPTGTQKE 495
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
2000-2143 5.35e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 40.06  E-value: 5.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2000 HRPEATPSMASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEGKLRPEPRRD 2079
Cdd:cd21975    19 HGVRPDPEGAGLAAGLDVRATREVAKGPGPPGPAWKPDGADSPGLVTAAPHLLAANVLAPLRGPSVEGSSLESGDADMGS 98
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 313151181 2080 GEAQEAASETQPLSSPPTAASSKAPSSGSAQPPEGHPGkPEPSRAKSRPLPNMPKLVIPSAATK 2143
Cdd:cd21975    99 DSDVAPASGAAASTSPESSSDAASSPSPLSLLHPGEAG-LEPERPRPRVRRGVRRRGVTPAAKR 161
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
1998-2189 9.51e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.00  E-value: 9.51e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 1998 RPHRPEATPSM---ASLGPEGEELARVAEGTSFPPQEPRHSPQVKMAPTSSPAEPHCWPAEAALGTGAEPTCSQEgklRP 2074
Cdd:PRK07994  360 HPAAPLPEPEVppqSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQG---AT 436
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 313151181 2075 EPRRDGEAqeAASETQPLSSPPTAASSKAPssgSAQPPEGHPGKPEPSRAKSR-PLPNMPKLVIPSAATKFPPEITVTPP 2153
Cdd:PRK07994  437 KAKKSEPA--AASRARPVNSALERLASVRP---APSALEKAPAKKEAYRWKATnPVEVKKEPVATPKALKKALEHEKTPE 511
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 313151181 2154 TPTLLSPKGSISE---ETKQKLKSAILSAQSAANVRKES 2189
Cdd:PRK07994  512 LAAKLAAEAIERDpwaALVSQLGLPGLVEQLALNAWKEE 550
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH