NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|755526783|ref|XP_011246688|]
View 

polycystin-1-like protein 3 isoform X11 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
1151-1269 1.16e-56

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


:

Pssm-ID: 238850  Cd Length: 120  Bit Score: 191.72  E-value: 1.16e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783 1151 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1230
Cdd:cd01752     2 LYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPEKPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSWY 81
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 755526783 1231 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1269
Cdd:cd01752    82 LSRVIVRDLQTGKKWFFLCNDWLSVEEGDGTVERTFPVA 120
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
325-652 2.09e-21

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 101.53  E-value: 2.09e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   325 PASSSPpqVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSP-PQGTSETPASNSP-PQGTSET 402
Cdd:pfam05109  461 PASTGP--TVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPtPNATSPTPAVTTPtPNATSPT 538
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   403 PGFSSPPQ-VTTATLVSSSP-PQVTSETPASSSPT--------QVTSETPASSSPTqVTSDTPASNSPPQ---GTSDTPG 469
Cdd:pfam05109  539 LGKTSPTSaVTTPTPNATSPtPAVTTPTPNATIPTlgktsptsAVTTPTPNATSPT-VGETSPQANTTNHtlgGTSSTPV 617
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASS 549
Cdd:pfam05109  618 VTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTS 697
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   550 SPPQVTSETPASSSPTNmtSDTPASSSPTNMTSDTPasssPTNMTS-DTPASSSPPWPVITEVTRPESTIPAGRSLANIT 628
Cdd:pfam05109  698 SPAPRPGTTSQASGPGN--SSTSTKPGEVNVTKGTP----PKNATSpQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHG 771
                          330       340
                   ....*....|....*....|....*.
gi 755526783   629 SKAQED--SPLGVISTHPQMSFQSST 652
Cdd:pfam05109  772 ARTSTEptTDYGGDSTTPRTRYNATT 797
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
37-142 3.21e-07

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


:

Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 50.31  E-value: 3.21e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   37 SCYQLNRLFCDFQEADNYCHAQRGRLAHTWNPKLRGFLKSFL---NEETVW-------------WVRGNLTLPGSHPGIN 100
Cdd:cd00037     1 SCYKFSTEKLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLkksSSSDVWiglndlssegtwkWSDGSPLVDYTNWAPG 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 755526783  101 QTGGDDvlrnqkPGECpsvVTHSNAVFSRWN--LCIEKHHFICQ 142
Cdd:cd00037    81 EPNPGG------SEDC---VVLSSSSDGKWNdvSCSSKLPFICE 115
GPS pfam01825
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
1041-1079 7.48e-07

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


:

Pssm-ID: 460350  Cd Length: 44  Bit Score: 46.92  E-value: 7.48e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 755526783  1041 QCYFWDRYNRT---WKSDGCQVGPKSTiLKTQCLCDHLTFFS 1079
Cdd:pfam01825    2 QCVFWDFTNSTtgrWSTEGCTTVSLND-THTVCSCNHLTSFA 42
Chi1 super family cl43877
Chitinase [Carbohydrate transport and metabolism];
224-422 1.08e-05

Chitinase [Carbohydrate transport and metabolism];


The actual alignment was detected with superfamily member COG3469:

Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.75  E-value: 1.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  224 TGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSS 303
Cdd:COG3469    12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  304 PPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS--SSPPQGTSDTPASSS 381
Cdd:COG3469    92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSgtETATGGTTTTSTTTT 171
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 755526783  382 PPQGTSETPASNSPPQGTSETPGFSSPP-QVTTATLVSSSPP 422
Cdd:COG3469   172 TTSASTTPSATTTATATTASGATTPSATtTATTTGPPTPGLP 213
 
Name Accession Description Interval E-value
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
1151-1269 1.16e-56

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 191.72  E-value: 1.16e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783 1151 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1230
Cdd:cd01752     2 LYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPEKPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSWY 81
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 755526783 1231 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1269
Cdd:cd01752    82 LSRVIVRDLQTGKKWFFLCNDWLSVEEGDGTVERTFPVA 120
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
1152-1267 3.97e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 95.58  E-value: 3.97e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  1152 YLIQVYTGYRRRAATTAKVVITLYGSEGHS--EPHHLCDPEktvFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSW 1229
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESaqLEITLDNPD---FERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEW 77
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 755526783  1230 YVSQVIV-SDMTTRKKWHFQCNCWLAVDLGNcERDRVFT 1267
Cdd:pfam01477   78 FLKSITVeVPGETGGKYTFPCNSWVYGSKKY-KETRVFF 115
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
325-652 2.09e-21

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 101.53  E-value: 2.09e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   325 PASSSPpqVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSP-PQGTSETPASNSP-PQGTSET 402
Cdd:pfam05109  461 PASTGP--TVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPtPNATSPTPAVTTPtPNATSPT 538
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   403 PGFSSPPQ-VTTATLVSSSP-PQVTSETPASSSPT--------QVTSETPASSSPTqVTSDTPASNSPPQ---GTSDTPG 469
Cdd:pfam05109  539 LGKTSPTSaVTTPTPNATSPtPAVTTPTPNATIPTlgktsptsAVTTPTPNATSPT-VGETSPQANTTNHtlgGTSSTPV 617
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASS 549
Cdd:pfam05109  618 VTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTS 697
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   550 SPPQVTSETPASSSPTNmtSDTPASSSPTNMTSDTPasssPTNMTS-DTPASSSPPWPVITEVTRPESTIPAGRSLANIT 628
Cdd:pfam05109  698 SPAPRPGTTSQASGPGN--SSTSTKPGEVNVTKGTP----PKNATSpQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHG 771
                          330       340
                   ....*....|....*....|....*.
gi 755526783   629 SKAQED--SPLGVISTHPQMSFQSST 652
Cdd:pfam05109  772 ARTSTEptTDYGGDSTTPRTRYNATT 797
PHA03247 PHA03247
large tegument protein UL36; Provisional
139-661 4.34e-21

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 101.17  E-value: 4.34e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  139 FICQAAAFPPQGASIWRNEFGPGPllPMKRRGAETERHMI------PGNGPPLAMCHQPAPPELFETLCFPIDPASSAPP 212
Cdd:PHA03247 2454 FFARTILGAPFSLSLLLGELFPGA--PVYRRPAEARFPFAagaapdPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHP 2531
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  213 kathRMtITSLTGRPQVTSDTLASSSPPQGTSDTPA----SSSPPQVTSATSASSSPPQGTS-DTPASSSPPQVTSATSA 287
Cdd:PHA03247 2532 ----RM-LTWIRGLEELASDDAGDPPPPLPPAAPPAapdrSVPPPRPAPRPSEPAVTSRARRpDAPPQSARPRAPVDDRG 2606
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  288 SSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSP----PQGTSDTPASSSPPQGTLD- 362
Cdd:PHA03247 2607 DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSrprrARRLGRAAQASSPPQRPRRr 2686
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  363 ---------TPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSP-----PQVTSET 428
Cdd:PHA03247 2687 aarptvgslTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParparPPTTAGP 2766
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  429 PASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSP----PQVTSDTPASSSPPQVTS 504
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPagplPPPTSAQPTAPPPPPGPP 2846
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  505 DTP-------------ASSSPPQVTSETPASSSPPQVTSDTSASISPP----------QVISDTPASSSPPQVTSETPAS 561
Cdd:PHA03247 2847 PPSlplggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRStesfalppdqPERPPQPQAPPPPQPQPQPPPP 2926
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  562 SSPTNMTSDTPASSSPTNMTSDTPASSSPtnmtsdtpaSSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVIS 641
Cdd:PHA03247 2927 PQPQPPPPPPPRPQPPLAPTTDPAGAGEP---------SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLT 2997
                         570       580
                  ....*....|....*....|
gi 755526783  642 THPQMSFQSSTSQQALDETA 661
Cdd:PHA03247 2998 GHSLSRVSSWASSLALHEET 3017
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
373-609 8.70e-14

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 75.94  E-value: 8.70e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  373 TSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTS 452
Cdd:COG3469     1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  453 DTPASNSPPqgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPpqVTSDT 532
Cdd:COG3469    81 TATAAAAAA---------TSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSA--GSTTT 149
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755526783  533 SASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSsptnmtsdTPASSSPPWPVIT 609
Cdd:COG3469   150 TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT--------GPPTPGLPKHVLV 218
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
1151-1256 3.20e-12

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 64.20  E-value: 3.20e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   1151 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNsgDSPSWY 1230
Cdd:smart00308    2 KYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDYLFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEH--RHPEWF 79
                            90       100
                    ....*....|....*....|....*.
gi 755526783   1231 VSQVIVSDMTTRKKWHFQCNCWLAVD 1256
Cdd:smart00308   80 LKSITVKDLPTGGKYHFPCNSWVYPD 105
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
347-677 9.74e-09

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 60.39  E-value: 9.74e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   347 TSDTPASSSPPQGTLDTPSSSSPPQGTS------DTPassSPPQGTSE----TPASNSPPQGT------SETPgfSSPPQ 410
Cdd:TIGR00927   76 SSDPPKSSSEMEGEMLAPQATVGRDEATpsiameNTP---SPPRRTAKitptTPKNNYSPTAAgtervkEDTP--ATPSR 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   411 VTTATLVSSSPPQVTSETPA------SSSPTQVTSE----TPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTAT 480
Cdd:TIGR00927  151 ALNHYISTSGRQRVKSYTPKprgevkSSSPTQTREKvrkyTPSPLGRMVNSYAPSTFMTMPRSHGITPRTTVKDSEITAT 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   481 --LVSSSPPQ----VTSDTP----ASSSPPQVTSDTPASS-SPPQVTSETPASSSPPQVTSDTSA---------SISPPQ 540
Cdd:TIGR00927  231 ykMLETNPSKrtagKTTPTPlkgmTDNTPTFLTREVETDLlTSPRSVVEKNTLTTPRRVESNSSTnhwglvgknNLTTPQ 310
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   541 --VISDTPASSSPpQVTSETPASSSPTnmtsDTPASSSPTNMTSDTPASSSPTNMTsdtpaSSSPPWPVITEVTRPESTI 618
Cdd:TIGR00927  311 gtVLEHTPATSEG-QVTISIMTGSSPA----ETKASTAAWKIRNPLSRTSAPAVRI-----ASATFRGLEKNPSTAPSTP 380
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 755526783   619 PAGRSLANITSKAQE---DSPLGVISTHPQMSFQSSTSQQALDETAGERVPTIPDFQAHSEF 677
Cdd:TIGR00927  381 ATPRVRAVLTTQVHHcvvVKPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDLHPKAEY 442
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
37-142 3.21e-07

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 50.31  E-value: 3.21e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   37 SCYQLNRLFCDFQEADNYCHAQRGRLAHTWNPKLRGFLKSFL---NEETVW-------------WVRGNLTLPGSHPGIN 100
Cdd:cd00037     1 SCYKFSTEKLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLkksSSSDVWiglndlssegtwkWSDGSPLVDYTNWAPG 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 755526783  101 QTGGDDvlrnqkPGECpsvVTHSNAVFSRWN--LCIEKHHFICQ 142
Cdd:cd00037    81 EPNPGG------SEDC---VVLSSSSDGKWNdvSCSSKLPFICE 115
GPS pfam01825
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
1041-1079 7.48e-07

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


Pssm-ID: 460350  Cd Length: 44  Bit Score: 46.92  E-value: 7.48e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 755526783  1041 QCYFWDRYNRT---WKSDGCQVGPKSTiLKTQCLCDHLTFFS 1079
Cdd:pfam01825    2 QCVFWDFTNSTtgrWSTEGCTTVSLND-THTVCSCNHLTSFA 42
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
240-589 1.52e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.99  E-value: 1.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  240 PQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQ 319
Cdd:NF033609  558 PEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSA 637
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  320 GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGT 399
Cdd:NF033609  638 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 717
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  400 SETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPGFSSPTQVTT 478
Cdd:NF033609  718 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD 797
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  479 ATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSET 558
Cdd:NF033609  798 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNS 877
                         330       340       350
                  ....*....|....*....|....*....|..
gi 755526783  559 PASSSPTNMTSDTPASSSPTNMT-SDTPASSS 589
Cdd:NF033609  878 PKNGTNASNKNEAKDSKEPLPDTgSEDEANTS 909
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
270-603 8.18e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.68  E-value: 8.18e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  270 SDT-PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTS 348
Cdd:NF033609  561 SDSdPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 640
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  349 DTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:NF033609  641 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 720
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  429 PA-SSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:NF033609  721 DSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 800
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPAS 587
Cdd:NF033609  801 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKN 880
                         330
                  ....*....|....*.
gi 755526783  588 SSPTNMTSDTPASSSP 603
Cdd:NF033609  881 GTNASNKNEAKDSKEP 896
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
1040-1079 9.86e-06

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 43.92  E-value: 9.86e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 755526783   1040 TQCYFWDRYNRTWKSDGCQVGPKS-TIlkTQCLCDHLTFFS 1079
Cdd:smart00303    3 PICVFWDESSGEWSTRGCELLETNgTH--TTCSCNHLTTFA 41
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
224-422 1.08e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.75  E-value: 1.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  224 TGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSS 303
Cdd:COG3469    12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  304 PPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS--SSPPQGTSDTPASSS 381
Cdd:COG3469    92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSgtETATGGTTTTSTTTT 171
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 755526783  382 PPQGTSETPASNSPPQGTSETPGFSSPP-QVTTATLVSSSPP 422
Cdd:COG3469   172 TTSASTTPSATTTATATTASGATTPSATtTATTTGPPTPGLP 213
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
349-601 1.39e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.91  E-value: 1.39e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  349 DTPASSSPPqgtlDTPSSSSP-PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSE 427
Cdd:NF033609  540 DKPVVPEQP----DEPGEIEPiPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASD 615
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  428 TPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:NF033609  616 SDSASDSDSASDSDSASDSDSASDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 692
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASS-SPTNMTSDTPA 586
Cdd:NF033609  693 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDS 772
                         250
                  ....*....|....*.
gi 755526783  587 -SSSPTNMTSDTPASS 601
Cdd:NF033609  773 dSDSDSDSDSDSDSDS 788
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
439-631 1.92e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.52  E-value: 1.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  439 SETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQvtSDTPASSSPPQVTSE 518
Cdd:NF033609   33 SSKEADASENSVTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQ--QETTQSASTNATTEE 110
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  519 TPASSSPPQVTSDTSASISPPQViSDTPASSSPPQVTSETpaSSSPTNMTSDTpasSSPTN--------MTSDTPASSSP 590
Cdd:NF033609  111 TPVTGEATTTATNQANTPATTQS-SNTNAEELVNQTSNET--TSNDTNTVSSV---NSPQNstnaenvsTTQDTSTEATP 184
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 755526783  591 TNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKA 631
Cdd:NF033609  185 SNNESAPQSTDASNKDVVNQAVNTSAPRMRAFSLAAVAADA 225
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
231-576 1.78e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 1.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  231 SDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSA 310
Cdd:NF033609  571 SDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDS 650
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  311 TSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETP 390
Cdd:NF033609  651 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 730
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  391 ASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPG 469
Cdd:NF033609  731 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDS 810
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  470 FSSPTQVTTATLVSSSPPQVTSDTPASSSppqvtSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPqvisDTPASS 549
Cdd:NF033609  811 DSDSDSDSDSDSDSDSDSDSDSDSDSDSD-----SDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP----NSPKNG 881
                         330       340
                  ....*....|....*....|....*...
gi 755526783  550 SPPQVTSETPASSSPTNMT-SDTPASSS 576
Cdd:NF033609  882 TNASNKNEAKDSKEPLPDTgSEDEANTS 909
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
204-410 5.80e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.59  E-value: 5.80e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  204 IDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvts 283
Cdd:PRK07764  588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD--- 664
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  284 atSASSSPPQGTSDTPASSSPPQVTsatsasssppqGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDT 363
Cdd:PRK07764  665 --GGDGWPAKAGGAAPAAPPPAPAP-----------AAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAP 731
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 755526783  364 PSSSS---PPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQ 410
Cdd:PRK07764  732 SPAADdpvPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
555-603 8.08e-04

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 42.30  E-value: 8.08e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 755526783  555 TSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:cd21441    65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVP 113
CLECT smart00034
C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function ...
35-142 3.80e-03

C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function as calcium-dependent carbohydrate binding modules.


Pssm-ID: 214480 [Multi-domain]  Cd Length: 124  Bit Score: 38.73  E-value: 3.80e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783     35 GNSCYQLNRLFCDFQEADNYCHAQRGRLA-------HTWnpkLRGFLKSFLNEETVW------WVRGNLTlpgshpGINQ 101
Cdd:smart00034    9 GGKCYKFSTEKKTWEDAQAFCQSLGGHLAsihseaeNDF---VASLLKNSGSSDYYWiglsdpDSNGSWQ------WSDG 79
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*..
gi 755526783    102 TGGDDVLrNQKPGEcPSVVTHSNAVFS----RWNL--CIEKHHFICQ 142
Cdd:smart00034   80 SGPVSYS-NWAPGE-PNNSSGDCVVLStsggKWNDvsCTSKLPFVCE 124
 
Name Accession Description Interval E-value
PLAT_polycystin cd01752
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane ...
1151-1269 1.16e-56

PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238850  Cd Length: 120  Bit Score: 191.72  E-value: 1.16e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783 1151 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1230
Cdd:cd01752     2 LYLVTVFTGWRRGAGTTAKVTITLYGAEGESEPHHLRDPEKPIFERGSVDSFLLTTPFPLGELQSIRLWHDNSGLSPSWY 81
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 755526783 1231 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1269
Cdd:cd01752    82 LSRVIVRDLQTGKKWFFLCNDWLSVEEGDGTVERTFPVA 120
PLAT_repeat cd01756
PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 ...
1152-1269 3.73e-29

PLAT/LH2 domain repeats of family of proteins with unknown function. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238854  Cd Length: 120  Bit Score: 113.03  E-value: 3.73e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783 1152 YLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTV-FERGALDVFLLSTGSwLGDLHGLRLWHDNSGDSPSWY 1230
Cdd:cd01756     3 YEVTVKTGDVKGAGTDANVFITLYGENGDTGKRKLKKSNNKNkFERGQTDKFTVEAVD-LGKLKKIRIGHDNSGLGAGWF 81
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 755526783 1231 VSQVIVSDMTTRKKWHFQCNCWLAVDLGNCERDRVFTPA 1269
Cdd:cd01756    82 LDKVEIREPGTGDEYTFPCNRWLDKDEDDGQIVRELYPS 120
PLAT pfam01477
PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. ...
1152-1267 3.97e-23

PLAT/LH2 domain; This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.


Pssm-ID: 396180  Cd Length: 115  Bit Score: 95.58  E-value: 3.97e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  1152 YLIQVYTGYRRRAATTAKVVITLYGSEGHS--EPHHLCDPEktvFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSW 1229
Cdd:pfam01477    1 YQVKVVTGDELGAGTDADVYISLYGKVGESaqLEITLDNPD---FERGAEDSFEIDTDWDVGAILKINLHWDNNGLSDEW 77
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 755526783  1230 YVSQVIV-SDMTTRKKWHFQCNCWLAVDLGNcERDRVFT 1267
Cdd:pfam01477   78 FLKSITVeVPGETGGKYTFPCNSWVYGSKKY-KETRVFF 115
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
325-652 2.09e-21

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 101.53  E-value: 2.09e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   325 PASSSPpqVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSP-PQGTSETPASNSP-PQGTSET 402
Cdd:pfam05109  461 PASTGP--TVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPtPNATSPTPAVTTPtPNATSPT 538
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   403 PGFSSPPQ-VTTATLVSSSP-PQVTSETPASSSPT--------QVTSETPASSSPTqVTSDTPASNSPPQ---GTSDTPG 469
Cdd:pfam05109  539 LGKTSPTSaVTTPTPNATSPtPAVTTPTPNATIPTlgktsptsAVTTPTPNATSPT-VGETSPQANTTNHtlgGTSSTPV 617
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASS 549
Cdd:pfam05109  618 VTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTS 697
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   550 SPPQVTSETPASSSPTNmtSDTPASSSPTNMTSDTPasssPTNMTS-DTPASSSPPWPVITEVTRPESTIPAGRSLANIT 628
Cdd:pfam05109  698 SPAPRPGTTSQASGPGN--SSTSTKPGEVNVTKGTP----PKNATSpQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHG 771
                          330       340
                   ....*....|....*....|....*.
gi 755526783   629 SKAQED--SPLGVISTHPQMSFQSST 652
Cdd:pfam05109  772 ARTSTEptTDYGGDSTTPRTRYNATT 797
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
348-697 2.37e-21

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 101.15  E-value: 2.37e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   348 SDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSP-PQGTSETPGFSSP-PQVTTATLVSSSPPQ-V 424
Cdd:pfam05109  469 STADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPtPNATSPTPAVTTPtPNATSPTLGKTSPTSaV 548
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   425 TSETPASSSPT-QVTSETPASSSPT-QVTSDTPASNSP-PQGTSDTPGFSSPTQVTTATLVSSsppqvTSDTPASSSPPQ 501
Cdd:pfam05109  549 TTPTPNATSPTpAVTTPTPNATIPTlGKTSPTSAVTTPtPNATSPTVGETSPQANTTNHTLGG-----TSSTPVVTSPPK 623
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   502 VTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSP-----PQVTSETPASSSPTNMTSDTPASSS 576
Cdd:pfam05109  624 NATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAhptggENITQVTPASTSTHHVSTSSPAPRP 703
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   577 PTNMTSDTPASSSPT------NMTSDTPA--SSSPPWPVITEVTRPESTIPAGRslANITSKAQEDSPLGV-ISTHPQMS 647
Cdd:pfam05109  704 GTTSQASGPGNSSTStkpgevNVTKGTPPknATSPQAPSGQKTAVPTVTSTGGK--ANSTTGGKHTTGHGArTSTEPTTD 781
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 755526783   648 FQSSTSQQALDETAGERVPTIPDFQAHSEFQKACAILQRLRDFLPTSPTS 697
Cdd:pfam05109  782 YGGDSTTPRTRYNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTS 831
PHA03247 PHA03247
large tegument protein UL36; Provisional
139-661 4.34e-21

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 101.17  E-value: 4.34e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  139 FICQAAAFPPQGASIWRNEFGPGPllPMKRRGAETERHMI------PGNGPPLAMCHQPAPPELFETLCFPIDPASSAPP 212
Cdd:PHA03247 2454 FFARTILGAPFSLSLLLGELFPGA--PVYRRPAEARFPFAagaapdPGGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHP 2531
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  213 kathRMtITSLTGRPQVTSDTLASSSPPQGTSDTPA----SSSPPQVTSATSASSSPPQGTS-DTPASSSPPQVTSATSA 287
Cdd:PHA03247 2532 ----RM-LTWIRGLEELASDDAGDPPPPLPPAAPPAapdrSVPPPRPAPRPSEPAVTSRARRpDAPPQSARPRAPVDDRG 2606
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  288 SSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSP----PQGTSDTPASSSPPQGTLD- 362
Cdd:PHA03247 2607 DPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSrprrARRLGRAAQASSPPQRPRRr 2686
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  363 ---------TPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSP-----PQVTSET 428
Cdd:PHA03247 2687 aarptvgslTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGParparPPTTAGP 2766
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  429 PASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSP----PQVTSDTPASSSPPQVTS 504
Cdd:PHA03247 2767 PAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPagplPPPTSAQPTAPPPPPGPP 2846
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  505 DTP-------------ASSSPPQVTSETPASSSPPQVTSDTSASISPP----------QVISDTPASSSPPQVTSETPAS 561
Cdd:PHA03247 2847 PPSlplggsvapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRStesfalppdqPERPPQPQAPPPPQPQPQPPPP 2926
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  562 SSPTNMTSDTPASSSPTNMTSDTPASSSPtnmtsdtpaSSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVIS 641
Cdd:PHA03247 2927 PQPQPPPPPPPRPQPPLAPTTDPAGAGEP---------SGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLT 2997
                         570       580
                  ....*....|....*....|
gi 755526783  642 THPQMSFQSSTSQQALDETA 661
Cdd:PHA03247 2998 GHSLSRVSSWASSLALHEET 3017
PLAT cd00113
PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. ...
1151-1253 3.79e-19

PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology 2) domain. It consists of an eight stranded beta-barrel. The domain can be found in various domain architectures, in case of lipoxygenases, alpha toxin, lipases and polycystin, but also as a single domain or as repeats.The putative function of this domain is to facilitate access to sequestered membrane or micelle bound substrates.


Pssm-ID: 238061  Cd Length: 116  Bit Score: 84.31  E-value: 3.79e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783 1151 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHhLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWY 1230
Cdd:cd00113     2 RYTVTIKTGDKKGAGTDSNISLALYGENGNSSDI-PILDGPGSFERGSTDTFQIDLKLDIGDITKVYLRRDGSGLSDGWY 80
                          90       100
                  ....*....|....*....|...
gi 755526783 1231 VSQVIVSDMTTRKKWHFQCNCWL 1253
Cdd:cd00113    81 CESITVQALGTKKVYTFPVNRWV 103
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
192-604 9.50e-19

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 92.93  E-value: 9.50e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  192 APPELFETLCFPIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPqvtsATSASSSPPQGTSD 271
Cdd:PHA03307   53 VTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSP----DPPPPTPPPASPPP 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  272 TPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQG-TSDTPASSSPPQVTSATSASSSPPQGTSdt 350
Cdd:PHA03307  129 SPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSsPEETARAPSSPPAEPPPSTPPAAASPRP-- 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  351 PASSSPPQGTLDTPSSSSPPQGTSDTPASSSppqGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPA 430
Cdd:PHA03307  207 PRRSSPISASASSPAPAPGRSAADDAGASSS---DSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPG 283
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  431 SSSPtqVTSETPASSSPTQVTSDTPASNSPPQGTSDtpGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASS 510
Cdd:PHA03307  284 PASS--SSSPRERSPSPSPSSPGSGPAPSSPRASSS--SSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPP 359
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  511 SPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 590
Cdd:PHA03307  360 ADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEP 439
                         410
                  ....*....|....
gi 755526783  591 tnmtsdTPASSSPP 604
Cdd:PHA03307  440 ------WPGSPPPP 447
PHA03247 PHA03247
large tegument protein UL36; Provisional
142-629 3.40e-18

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 91.54  E-value: 3.40e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  142 QAAAFPPQGASIWRNEFGPGPllpmkrRGAETERHMIPGNGPPLAMCHQPAPPELF---ETLCFPIDPASSAPPKATHRM 218
Cdd:PHA03247 2613 PPSPLPPDTHAPDPPPPSPSP------AANEPDPHPPPTVPPPERPRDDPAPGRVSrprRARRLGRAAQASSPPQRPRRR 2686
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  219 TItsltgRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSA----SSSPPQG 294
Cdd:PHA03247 2687 AA-----RPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATpggpARPARPP 2761
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  295 TSDTPASSSPPQVtsatsasssppqgtsdtPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPqgtS 374
Cdd:PHA03247 2762 TTAGPPAPAPPAA-----------------PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP---A 2821
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  375 DTPASSSPPQgTSETPASNSPPQG---TSETPGFSSPPqvtTATLVSSSPPQVTSETPASSSPTQVTS-ETPASSSPTQV 450
Cdd:PHA03247 2822 ASPAGPLPPP-TSAQPTAPPPPPGpppPSLPLGGSVAP---GGDVRRRPPSRSPAAKPAAPARPPVRRlARPAVSRSTES 2897
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  451 TSDTPASNSPPQgtsdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTS 530
Cdd:PHA03247 2898 FALPPDQPERPP----QPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVA 2973
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  531 DTSASISPPQVISDTPASSSPPQVTSETP-----ASSSPTNMTSDTPASSSPTNM--TSDTPASSSPTNMTSDTPASSS- 602
Cdd:PHA03247 2974 VPRFRVPQPAPSREAPASSTPPLTGHSLSrvsswASSLALHEETDPPPVSLKQTLwpPDDTEDSDADSLFDSDSERSDLe 3053
                         490       500       510
                  ....*....|....*....|....*....|..
gi 755526783  603 -----PPWPVITEVTRPESTIPAGRSLANITS 629
Cdd:PHA03247 3054 aldplPPEPHDPFAHEPDPATPEAGARESPSS 3085
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
237-636 1.59e-17

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 89.07  E-value: 1.59e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  237 SSPPQGTSDTPASSSPPQVtsatsasssppqGTSDTPASSSPPQVTSATSASSSPPQGtsdtPASSSPPQVTSATSASSS 316
Cdd:PHA03307   25 PATPGDAADDLLSGSQGQL------------VSDSAELAAVTVVAGAAACDRFEPPTG----PPPGPGTEAPANESRSTP 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  317 PPQGTSDTPASSSPPQvtsatsaSSSPPQGTSDTPA-SSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSP 395
Cdd:PHA03307   89 TWSLSTLAPASPAREG-------SPTPPGPSSPDPPpPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAA 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  396 PQGTSETPGFSSPP-QVTTATLVSSSPPQvtSETPASSSPTQVTSETPASSSPTQVTSDTPASnSPPQGTSDTPGFSSPT 474
Cdd:PHA03307  162 VASDAASSRQAALPlSSPEETARAPSSPP--AEPPPSTPPAAASPRPPRRSSPISASASSPAP-APGRSAADDAGASSSD 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  475 QVTTATLVSSSPPQvtSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPqVISDTPASSSPPQV 554
Cdd:PHA03307  239 SSSSESSGCGWGPE--NECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPS-SPGSGPAPSSPRAS 315
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  555 TSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPP--------WPVITEVTRPESTIPAGRSLAN 626
Cdd:PHA03307  316 SSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPrkrprpsrAPSSPAASAGRPTRRRARAAVA 395
                         410
                  ....*....|
gi 755526783  627 ITSKAQEDSP 636
Cdd:PHA03307  396 GRARRRDATG 405
PHA03247 PHA03247
large tegument protein UL36; Provisional
190-607 2.32e-17

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 88.84  E-value: 2.32e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  190 QPAPPELFETLCFPIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSasssppqgt 269
Cdd:PHA03247 2704 PPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAA--------- 2774
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  270 sdtPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPpqgTSDTPASSSPPQvtsatsasssppqgTSD 349
Cdd:PHA03247 2775 ---PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP---PAASPAGPLPPP--------------TSA 2834
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  350 TPASSSPPQGtldtPSSSSPPQGTSDTP----ASSSPPQGTSETPASNS-PPQGTSETPGFSSPPQvttaTLVSSSPPQV 424
Cdd:PHA03247 2835 QPTAPPPPPG----PPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPArPPVRRLARPAVSRSTE----SFALPPDQPE 2906
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  425 TSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTS 504
Cdd:PHA03247 2907 RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSR 2986
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  505 DTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDT---PASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMT 581
Cdd:PHA03247 2987 EAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQTlwpPDDTEDSDADSLFDSDSERSDLEALDPLPPEPHDPF 3066
                         410       420
                  ....*....|....*....|....*.
gi 755526783  582 SDTPASSSPTNMTSDTPASSSPPWPV 607
Cdd:PHA03247 3067 AHEPDPATPEAGARESPSSQFGPPPL 3092
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
171-619 2.57e-15

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 81.50  E-value: 2.57e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   171 AETERHMIPGNGPPLAMCHQPAppelFETLCFPIDPASSAPPKATH---RMTITSLTGRPQVTSDTlaSSSPPQGTSDTP 247
Cdd:pfam05109  412 ATTTTHKVIFSKAPESTTTSPT----LNTTGFAAPNTTTGLPSSTHvptNLTAPASTGPTVSTADV--TSPTPAGTTSGA 485
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   248 ASSSPPQVTSATSASSSPPQGTSDTPASSSPpqvtsatsasssPPQGTSDTPASSSPpqvtsatsasssPPQGTSDTPAS 327
Cdd:pfam05109  486 SPVTPSPSPRDNGTESKAPDMTSPTSAVTTP------------TPNATSPTPAVTTP------------TPNATSPTLGK 541
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   328 SSPpqvtsATSASSSPPQGTSDTPASSSP-PQGTLDTPSSSSPPQG-TSDTPASSSPPQGTSETPA--SNSPPQGTSETP 403
Cdd:pfam05109  542 TSP-----TSAVTTPTPNATSPTPAVTTPtPNATIPTLGKTSPTSAvTTPTPNATSPTVGETSPQAntTNHTLGGTSSTP 616
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   404 GFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSsptqvTSDTPASNSPPQGTSDTPGFSSPTQVTTAtlvS 483
Cdd:pfam05109  617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPS-----TSDNSTSHMPLLTSAHPTGGENITQVTPA---S 688
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   484 SSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASS-SPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASS 562
Cdd:pfam05109  689 TSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKgTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTT 768
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 755526783   563 SPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT-EVTRPESTIP 619
Cdd:pfam05109  769 GHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRPRWTFTSpPVTTAQATVP 826
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
345-623 5.64e-15

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 79.62  E-value: 5.64e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   345 QGTSDTPASSSPPqgtldTPSSSSPPQGTSDTPASSS-PPQGTSETPASNSPPQGTSETPgfSSPPQVTTATLVSSSPPQ 423
Cdd:pfam17823  104 EGAADGAASRALA-----AAASSSPSSAAQSLPAAIAaLPSEAFSAPRAAACRANASAAP--RAAIAAASAPHAASPAPR 176
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   424 V--------TSETPASSSPTQVTSETPASSSPTQVTSD------TPAS--------NSPPQGTSDTPGFSSPTQVTTATL 481
Cdd:pfam17823  177 TaassttaaSSTTAASSAPTTAASSAPATLTPARGISTaatatgHPAAgtalaavgNSSPAAGTVTAAVGTVTPAALATL 256
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   482 VSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSasisppQVISDTPASSSPPQVTSeTPAS 561
Cdd:pfam17823  257 AAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPII------QVSTDQPVHNTAGEPTP-SPSN 329
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755526783   562 SSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASS-SPPWPVITEVTRPESTIPAGRS 623
Cdd:pfam17823  330 TTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSmIPEVEATSPTTQPSPLLPTQGA 392
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
373-609 8.70e-14

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 75.94  E-value: 8.70e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  373 TSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTS 452
Cdd:COG3469     1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  453 DTPASNSPPqgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPpqVTSDT 532
Cdd:COG3469    81 TATAAAAAA---------TSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSA--GSTTT 149
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755526783  533 SASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSsptnmtsdTPASSSPPWPVIT 609
Cdd:COG3469   150 TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT--------GPPTPGLPKHVLV 218
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
206-564 1.02e-13

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 76.49  E-value: 1.02e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   206 PASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDtpaSSSPPQVTSATSASSSPPQGTSDTPASSSPP------ 279
Cdd:pfam05109  461 PASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTE---SKAPDMTSPTSAVTTPTPNATSPTPAVTTPTpnatsp 537
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   280 ---QVTSATSASSSPPQGTSDTPASSSP-PQVTSATSASSSPPQG-TSDTPASSSPP--QVTSATSASSSPPQGTSDTPA 352
Cdd:pfam05109  538 tlgKTSPTSAVTTPTPNATSPTPAVTTPtPNATIPTLGKTSPTSAvTTPTPNATSPTvgETSPQANTTNHTLGGTSSTPV 617
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   353 SSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSP-----PQVTTATLVSSSPPQVTSE 427
Cdd:pfam05109  618 VTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAhptggENITQVTPASTSTHHVSTS 697
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   428 TPASSSPTQVTSETPASSSPT------QVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQ 501
Cdd:pfam05109  698 SPAPRPGTTSQASGPGNSSTStkpgevNVTKGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTE 777
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755526783   502 VTSDTPASSSPPQvTSETPASSSPPQvtsdTSASISPPQVISDTPASSSppQVTSETPASSSP 564
Cdd:pfam05109  778 PTTDYGGDSTTPR-TRYNATTYLPPS----TSSKLRPRWTFTSPPVTTA--QATVPVPPTSQP 833
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
191-576 1.93e-13

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 75.57  E-value: 1.93e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   191 PAPPELFETLCFPIDPASSAPPKATHRMT--ITSLTGRPQVTSDTLASSSPP--QGTSDTPASSSPPQVTSATSASsspp 266
Cdd:pfam03154  197 AGPTPSAPSVPPQGSPATSQPPNQTQSTAapHTLIQQTPTLHPQRLPSPHPPlqPMTQPPPPSQVSPQPLPQPSLH---- 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   267 qgtSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvtsatsasssppqg 346
Cdd:pfam03154  273 ---GQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQ-------------- 335
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   347 tSDTPASSSP-PQGTLDTPSSSSPPQgtsdTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:pfam03154  336 -SQQPPREQPlPPAPLSMPHIKPPPT----TPIPQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSA 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   426 SETPASSSPTQVTSETPASSSP--TQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVT 503
Cdd:pfam03154  411 HPPPLQLMPQSQQLPPPPAQPPvlTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMP 490
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755526783   504 SDTPASSSPPQVTSETPASssppqvtsdTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSS 576
Cdd:pfam03154  491 GIQPPSSASVSSSGPVPAA---------VSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
LH2 smart00308
Lipoxygenase homology 2 (beta barrel) domain;
1151-1256 3.20e-12

Lipoxygenase homology 2 (beta barrel) domain;


Pssm-ID: 214608 [Multi-domain]  Cd Length: 105  Bit Score: 64.20  E-value: 3.20e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   1151 HYLIQVYTGYRRRAATTAKVVITLYGSEGHSEPHHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNsgDSPSWY 1230
Cdd:smart00308    2 KYKVTVTTGGLDFAGTTASVSLSLVGAEGDGKESKLDYLFKGIFARGSTYEFTFDVDEDFGELGAVKIKNEH--RHPEWF 79
                            90       100
                    ....*....|....*....|....*.
gi 755526783   1231 VSQVIVSDMTTRKKWHFQCNCWLAVD 1256
Cdd:smart00308   80 LKSITVKDLPTGGKYHFPCNSWVYPD 105
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
206-578 4.07e-12

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 70.37  E-value: 4.07e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   206 PASSAPPKATHRMTITSLTGRPQVTSDTLASSSPP--QGTSDTPASSSPpqvtsATSASSSPPQGTSDTPASSSPPQVTS 283
Cdd:pfam17823   67 PAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPAtrEGAADGAASRAL-----AAAASSSPSSAAQSLPAAIAALPSEA 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   284 ATSASSSPPQgtsdTPASSSPpqvtsatsasSSPPQGTSDTPASSSPPQVTSATSASssppqGTSDTPASSSPPQGTLDT 363
Cdd:pfam17823  142 FSAPRAAACR----ANASAAP----------RAAIAAASAPHAASPAPRTAASSTTA-----ASSTTAASSAPTTAASSA 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   364 PSSSSPPQGTSDTPASSSPPQ---GTSETPASNSPPQGTSETPGFSSPPQV-----------TTATLVSSSPPQVTSETP 429
Cdd:pfam17823  203 PATLTPARGISTAATATGHPAagtALAAVGNSSPAAGTVTAAVGTVTPAALatlaaaagtvaSAAGTINMGDPHARRLSP 282
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   430 ASSSPTQVTSETPASSS-------PTQVTSDTPASNSPPQGTSdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:pfam17823  283 AKHMPSDTMARNPAAPMgaqaqgpIIQVSTDQPVHNTAGEPTP-SPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSAS 361
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755526783   503 TSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQViSDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:pfam17823  362 PVPVLHTSMIPEVEATSPTTQPSPLLPTQGAAGPGILLA-PEQVATEATAGTASAGPTPRSSGDPKTLAMASCQLS 436
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
351-605 7.01e-12

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 70.26  E-value: 7.01e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  351 PASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQ---VTSE 427
Cdd:PRK07003  362 VTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPAtadRGDD 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  428 TPASSSPTQVTSETPASSSPT-QVTSDTPASNSPPQG--TSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTS 504
Cdd:PRK07003  442 AADGDAPVPAKANARASADSRcDERDAQPPADSGSASapASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDA 521
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  505 DTPASSSPPQVTSETPASSSPPQVTSDTSASI-----SPPQVISD------TPASSSPPQVTSETPASSSPtnmTSDTPA 573
Cdd:PRK07003  522 PAAAAPPAPEARPPTPAAAAPAARAGGAAAALdvlrnAGMRVSSDrgaraaAAAKPAAAPAAAPKPAAPRV---AVQVPT 598
                         250       260       270
                  ....*....|....*....|....*....|..
gi 755526783  574 SSSPTNMTSDTPASSSPTNMTSDTPaSSSPPW 605
Cdd:PRK07003  599 PRARAATGDAPPNGAARAEQAAESR-GAPPPW 629
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
211-608 8.69e-12

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 70.18  E-value: 8.69e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   211 PPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTpASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSS 290
Cdd:pfam03154   94 PERATAKKSKTQEISRPNSPSEGEGESSDGRSVNDE-GSSDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQILQTQPP 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   291 PPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPP------QVTSATSASSSPPQGTSDTPASSSPPQGTLDTP 364
Cdd:pfam03154  173 VLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPAtsqppnQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPM 252
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   365 SSSSPPQGTSDTPASSS------------------------PPQG---TSETPASNSPPQGTSETPGFSSPPQVTTATLV 417
Cdd:pfam03154  253 TQPPPPSQVSPQPLPQPslhgqmppmphslqtgpshmqhpvPPQPfplTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQS 332
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   418 SSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPgFSSPTQVTT-------ATLVSSSPP--- 487
Cdd:pfam03154  333 QLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSP-FQMNSNLPPppalkplSSLSTHHPPsah 411
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   488 ----QVTSDTPASSSPP-------QVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:pfam03154  412 ppplQLMPQSQQLPPPPaqppvltQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPG 491
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 755526783   557 ETPASSSPTNMTSDTPASSS----PTNMTSDTPASSSPTNMTSDTPASSSPPWPVI 608
Cdd:pfam03154  492 IQPPSSASVSSSGPVPAAVScplpPVQIKEEALDEAEEPESPPPPPRSPSPEPTVV 547
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
232-643 4.91e-11

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 67.70  E-value: 4.91e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  232 DTLASSSPPQGTSDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSAT 311
Cdd:PRK07764  379 ERLERRLGVAGGAGAPAAAAPSA--------------AAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSP 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  312 SASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPA-SSSPPQ------ 384
Cdd:PRK07764  445 AGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATlRERWPEilaavp 524
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  385 -------GTSETPASNSPPQGTSETPGFSSPP------QVTTATLVSSSPPQVTSET----------PASSSPTQVTSET 441
Cdd:PRK07764  525 krsrktwAILLPEATVLGVRGDTLVLGFSTGGlarrfaSPGNAEVLVTALAEELGGDwqveavvgpaPGAAGGEGPPAPA 604
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  442 PASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPA 521
Cdd:PRK07764  605 SSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPA 684
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  522 SSSPPQVtsdTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP---------ASSSPTN 592
Cdd:PRK07764  685 PAPAAPA---APAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPddppdpagaPAQPPPP 761
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|.
gi 755526783  593 MTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTH 643
Cdd:PRK07764  762 PAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLEEE 812
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
227-658 3.50e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 64.79  E-value: 3.50e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   227 PQVTSDTLASSSPPQGTSDTPASSSPPqvtsatsaSSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQ 306
Cdd:pfam03154  187 PPPGTTQAATAGPTPSAPSVPPQGSPA--------TSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPP 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   307 vtsatsasssppqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGT 386
Cdd:pfam03154  259 --------------SQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQR 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   387 SETPASNSPPQgtsetpgfssppqvttatlvsssPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSD 466
Cdd:pfam03154  325 IHTPPSQSQLQ-----------------------SQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGP 381
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   467 TPgFSSPtqvttatlvSSSPPqvtsdTPASSSPPQVTSDTPASSSPP--QVTSETPASSSPPQVTSDTSASISPPQVISD 544
Cdd:pfam03154  382 SP-FQMN---------SNLPP-----PPALKPLSSLSTHHPPSAHPPplQLMPQSQQLPPPPAQPPVLTQSQSLPPPAAS 446
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   545 TPASSSPPQVTSETP-------ASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT-------E 610
Cdd:pfam03154  447 HPPTSGLHQVPSQSPfpqhpfvPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQikeealdE 526
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*...
gi 755526783   611 VTRPESTIPAGRSlanitskaqeDSPLGVISTHPQMSFQSSTSQQALD 658
Cdd:pfam03154  527 AEEPESPPPPPRS----------PSPEPTVVNTPSHASQSARFYKHLD 564
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
165-513 3.66e-10

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 64.81  E-value: 3.66e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  165 PMKRRGAETERHMIPGNGPPlAMCHQPAPPELFETLCFPIDPASSAPPKATHRMTITSLTGRPQVTSDT-------LASS 237
Cdd:PHA03307   99 SPAREGSPTPPGPSSPDPPP-PTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAassrqaaLPLS 177
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  238 SPPQG--TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSP---------------PQVTSATSASSSPPQGTSDTPA 300
Cdd:PHA03307  178 SPEETarAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPapapgrsaaddagasSSDSSSSESSGCGWGPENECPL 257
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  301 SSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsatsassSPPQGTSDTPASSSPPQGTLDTPSS-----SSPPQGTSD 375
Cdd:PHA03307  258 PRPAPITLPTRIWEASGWNGPSSRPGPASSS----------SSPRERSPSPSPSSPGSGPAPSSPRassssSSSRESSSS 327
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  376 TPASSSPPqgtSETPASnSPPQGTSETPGFSSPPqvttatlvSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTP 455
Cdd:PHA03307  328 STSSSSES---SRGAAV-SPGPSPSRSPSPSRPP--------PPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVA 395
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 755526783  456 ASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvtsdTPASSSPP 513
Cdd:PHA03307  396 GRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEP------WPGSPPPP 447
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
160-511 4.96e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 64.40  E-value: 4.96e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   160 PGPLLPMKRRGAETERHMIPGNGPPLAMCHQPAPPelfetlcfPIDPASSAPPkATHRMTITSLTGRPQVTSDTLASSSP 239
Cdd:pfam03154  224 TAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPP--------SQVSPQPLPQ-PSLHGQMPPMPHSLQTGPSHMQHPVP 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   240 PQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvtsatsasssPPQGTSDTPASSSPPQVTSATSASSSPPQ 319
Cdd:pfam03154  295 PQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQ----------SQQPPREQPLPPAPLSMPHIKPPPTTPIP 364
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   320 GTSDTPASSSPPQVTSATSASSSppqgTSDTPASSSPPQGTLDT--PSSSSPP------QGTSDTPASSSPP---QGTSE 388
Cdd:pfam03154  365 QLPNPQSHKHPPHLSGPSPFQMN----SNLPPPPALKPLSSLSThhPPSAHPPplqlmpQSQQLPPPPAQPPvltQSQSL 440
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   389 TPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPqgtsdtp 468
Cdd:pfam03154  441 PPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCP------- 513
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 755526783   469 gfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSS 511
Cdd:pfam03154  514 --LPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVNTPSHAS 554
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
323-635 5.76e-10

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 63.55  E-value: 5.76e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   323 DTPASSSPPQVTSATSASSSppQGTSdTPASSSPPQGTLDTPSSSSPPQGTS--------DTPASSSPPQGTSETPASNS 394
Cdd:pfam03546   37 ETPAAKTPLQAKPSGKTPQV--RAAS-APAKESPRKGAPPVPPGKTGPAAAQaqagkpeeDSESSSEESDSDGETPAAAT 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   395 PPQGTSETPGFSSPPQVTTATLVSSSP------PQVTSETPASSSPTQVTSETPASSSPTQvTSDTPASNSPPQGTSDTP 468
Cdd:pfam03546  114 LTTSPAQVKPLGKNSQVRPASTVGKGPsgkganPAPPGKAGSAAPLVQVGKKEEDSESSSE-ESDSEGEAPPAATQAKPS 192
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   469 GFSSPTQVTT--ATLVSSSPPQVT------------------SDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQV 528
Cdd:pfam03546  193 GKILQVRPASgpAKGAAPAPPQKAgpvatqvkaerskedsesSEESSDSEEEAPAAATPAQAKPALKTPQTKASPRKGTP 272
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   529 TSDTSASISPPQVisDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTP---------A 599
Cdd:pfam03546  273 ITPTSAKVPPVRV--GTPAPWKAGTVTSPACASSPAVARGAQRPEEDSSSSEESESEEETAPAAAVGQAKsvgkglqgkA 350
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 755526783   600 SSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDS 635
Cdd:pfam03546  351 ASAPTKGPSGQGTAPVPPGKTGPAVAQVKAEAQEDS 386
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
203-603 6.50e-10

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 63.55  E-value: 6.50e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTlaSSSPPQGTSDTPA----SSSPPQV-----TSATSASSSPPQGTSDTP 273
Cdd:pfam03546   67 PRKGAPPVPPGKTGPAAAQAQAGKPEEDSES--SSEESDSDGETPAaatlTTSPAQVkplgkNSQVRPASTVGKGPSGKG 144
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   274 ASSSPPQVTSATSASSSPPQGTSDTPASS--------SPPQVTSATSASSSPPQGTSDTP---ASSSPPQVTSATSASSS 342
Cdd:pfam03546  145 ANPAPPGKAGSAAPLVQVGKKEEDSESSSeesdsegeAPPAATQAKPSGKILQVRPASGPakgAAPAPPQKAGPVATQVK 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   343 PPQGTSDT-------------PASSSPPQG--TLDTPSS-SSPPQGTSDTPASSSPPQGTSETPASNSppQGTSETPGFS 406
Cdd:pfam03546  225 AERSKEDSesseessdseeeaPAAATPAQAkpALKTPQTkASPRKGTPITPTSAKVPPVRVGTPAPWK--AGTVTSPACA 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   407 SPPQVTTAT----LVSSSPPQVTSETPASSSPTQVTSET-------PASSSPTQVTSDTPASNSPPQGTSdtpgfSSPTQ 475
Cdd:pfam03546  303 SSPAVARGAqrpeEDSSSSEESESEEETAPAAAVGQAKSvgkglqgKAASAPTKGPSGQGTAPVPPGKTG-----PAVAQ 377
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   476 VTTATLVSSSPPQVTSD------TPASSSPPQVTSDTPASSSPPQVTSETPASSSP-------PQVTSDTSASISPPQVI 542
Cdd:pfam03546  378 VKAEAQEDSESSEEESDseeaaaTPAQVKASGKTPQAKANPAPTKASSAKGAASAPgkvvaaaAQAKQGSPAKVKPPART 457
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 755526783   543 SDTPASSSPPQ---------VTSETPASSSPTNMTSDTPASSSPTNMTSD--TPASSSPTNMTSDTPASSSP 603
Cdd:pfam03546  458 PQNSAISVRGQasvpavgkaVATAAQAQKGPVGGPQEEDSESSEEESDSEeeAPAQAKPSGKTPQVRAASAP 529
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
194-618 9.91e-10

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 63.53  E-value: 9.91e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  194 PELFETLCFPIDP-ASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTpassSPPQVTSATSASSSPPQGT--- 269
Cdd:COG5665   168 PVAVVVTTMIAVPsAPAAPPNAVDYSVLVPIAAQDPAASVSTPQAFNASATSGR----SQHIVQAAKRVGVEWWGDPsll 243
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  270 SDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPpqvTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGT-- 347
Cdd:COG5665   244 ATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQ---LTTSNTPTSTAKAQPQPPTKKQPAKEPPSDTASGNPSAPSvl 320
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  348 --SDTPASSSPPqgtldTPSSSSPPQGTSDTPASSSPpqgtsetpasnsppqgtsetpgfSSPPQVTTATLVSSSPPQVT 425
Cdd:COG5665   321 inSDSPTSEDPA-----TASVPTTEETTAFTTPSSVP-----------------------STPAEKDTPATDLATPVSPT 372
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  426 SetPASSSPTQVTSETPASSSPTQVTSDTPASNSPP---QGTSDTPGFSSPTQvtTATLVSSSPPQVTSDTPASSSPPQv 502
Cdd:COG5665   373 P--PETSVDKKVSPDSATSSTKSEKEGGTASSPMPPniaIGAKDDVDATDPSQ--EAKEYTKNAPMTPEADSAPESSVR- 447
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  503 TSDTPASSSPPQVTSET---------PASSSPPQVTSDTSAS-----ISPPQVISDTPASSSPPQVTSETPASSSPTNMT 568
Cdd:COG5665   448 TEASPSAGSDLEPENTTlrdpapnaiPPPEDPSTIGRLSSGDklaneTGPPVIRRDSTPSSTADQSIVGVLAFGLDQRTQ 527
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|
gi 755526783  569 SdtpASSSPTNMTSDTPASSSPTNMTSDTpaSSSPPWPvITEVTRPESTI 618
Cdd:COG5665   528 A---EISVEAASRSNPLLNSQVKSFPLGK--RSEGAKG-KTQTDRGISNA 571
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
207-591 1.87e-09

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 62.01  E-value: 1.87e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   207 ASSAPPKATHRMTITSLTGRPQVTSDtlasSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATS 286
Cdd:pfam03546  145 ANPAPPGKAGSAAPLVQVGKKEEDSE----SSSEESDSEGEAPPAATQAKPSGKILQVRPASGPAKGAAPAPPQKAGPVA 220
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   287 ASSSPPQGTSDTPASSSppqvtsatsasssPPQGTSDTPASSSPPQV---TSATSASSSPPQGTSDTPASSSPPQGTLDT 363
Cdd:pfam03546  221 TQVKAERSKEDSESSEE-------------SSDSEEEAPAAATPAQAkpaLKTPQTKASPRKGTPITPTSAKVPPVRVGT 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   364 PSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPP-QVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:pfam03546  288 PAPWKAGTVTSPACASSPAVARGAQRPEEDSSSSEESESEEETAPAaAVGQAKSVGKGLQGKAASAPTKGPSGQGTAPVP 367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   443 ---ASSSPTQV-TSDTPASNSPPQGTSDTPGFSSPTQVTtatlvsssPPQVTSDTPASSSPPQVTSDTPASSSP------ 512
Cdd:pfam03546  368 pgkTGPAVAQVkAEAQEDSESSEEESDSEEAAATPAQVK--------ASGKTPQAKANPAPTKASSAKGAASAPgkvvaa 439
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   513 -PQVTSETPASSSPPQVTSDTSASISPPQ---------VISDTPASSSPPQVTSETPASSSPTNMTSD--TPASSSPTNM 580
Cdd:pfam03546  440 aAQAKQGSPAKVKPPARTPQNSAISVRGQasvpavgkaVATAAQAQKGPVGGPQEEDSESSEEESDSEeeAPAQAKPSGK 519
                          410
                   ....*....|.
gi 755526783   581 TSDTPASSSPT 591
Cdd:pfam03546  520 TPQVRAASAPA 530
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
417-595 2.47e-09

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 61.22  E-value: 2.47e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   417 VSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTpgfssptqvTTATLVSSSPPQVTSDTPAS 496
Cdd:pfam05539  157 LRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTA---------TANQRLSSTEPVGTQGTTTS 227
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   497 SSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQvisDTPASSSPPQVTSETPASSSPTNMTSDTPASSS 576
Cdd:pfam05539  228 SNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRR---KTPPATSNRRSPHSTATPPPTTKRQETGRPTPR 304
                          170       180
                   ....*....|....*....|.
gi 755526783   577 PTNMT--SDTPASSSPTNMTS 595
Cdd:pfam05539  305 PTATTqsGSSPPHSSPPGVQA 325
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
364-621 3.10e-09

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 61.48  E-value: 3.10e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  364 PSSSSPPQ--GTSDTPASSSPPQGTSETPASNS---PPQGTSETPGFSSPpqvtTATLVSSSPPqvTSETPASssptqvT 438
Cdd:PLN03209  324 PSQRVPPKesDAADGPKPVPTKPVTPEAPSPPIeeePPQPKAVVPRPLSP----YTAYEDLKPP--TSPIPTP------P 391
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  439 SETPASSSPTQVTS--DTPASNSPPQGTSDTPGfSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvtSDTPASSSPPQVT 516
Cdd:PLN03209  392 SSSPASSKSVDAVAkpAEPDVVPSPGSASNVPE-VEPAQVEAKKTRPLSPYARYEDLKPPTSP----SPTAPTGVSPSVS 466
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  517 SETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPA-SSSPTNMTS 595
Cdd:PLN03209  467 STSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGNSApPTALADEQH 546
                         250       260
                  ....*....|....*....|....*...
gi 755526783  596 DTPASSSP--PWPVITEVTRPESTIPAG 621
Cdd:PLN03209  547 HAQPKPRPlsPYTMYEDLKPPTSPTPSP 574
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
352-604 3.88e-09

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 60.57  E-value: 3.88e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   352 ASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQvTSETPAS 431
Cdd:pfam13254   58 PGLSPTKLSREGSPESTSRPSSSHSEATIVRHSKDDERPSTPDEGFVKPALPRHSRSSSALSNTGSEEDSPS-LPTSPPS 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   432 SSPTQVT---SETPAS---------SSPTQvtSDTPASNSPP------------QGTSDTPGFSSPTQVTTATLVSSSPP 487
Cdd:pfam13254  137 PSKTMDPkrwSPTKSSwlesalnrpESPKP--KAQPSQPAQPawmkelnkirqsRASVDLGRPNSFKEVTPVGLMRSPAP 214
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   488 QVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTS--ASISPPQVISDTPASS-----SPPQVTSETPA 560
Cdd:pfam13254  215 GGHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQSPAPTSASEPppKTKELPKDSEEPAAPSksaeaSTEKKEPDTES 294
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....
gi 755526783   561 SSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPP 604
Cdd:pfam13254  295 SPETSSEKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQSPP 338
Hamartin pfam04388
Hamartin protein; This family includes the hamartin protein which is thought to function as a ...
417-681 4.38e-09

Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.


Pssm-ID: 461287 [Multi-domain]  Cd Length: 730  Bit Score: 61.23  E-value: 4.38e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   417 VSSSPPqvTSETPASSSPTQVTSETP-ASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQV-TSDTP 494
Cdd:pfam04388  276 PTASPY--TDQQSSYGSSTSTPSSTPrLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGMTTPPTSPGMVpTTPSE 353
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   495 ASSSPPQVTSDtpaSSSPPQVTSE-----TPASSSPPQVTSDTSASISPPQVISDTPASSSPPQvTSETPASSSP----- 564
Cdd:pfam04388  354 LSPSSSHLSSR---GSSPPEAAGEatpetTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPR-KDGRSQSSFPplskq 429
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   565 --TNMTSDTPASSSPTNMTSDTpasSSPTNMTSDTpaSSSPPWPVITEVTRPESTipagRSLANITSKAQEdsplgviST 642
Cdd:pfam04388  430 apTNPNSRGLLEPPGDKSSVTL---SELPDFIKDL--ALSSEDSVEGAEEEAAIS----QELSEITTEKNE-------TD 493
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 755526783   643 HPQMSFQSSTSQQALDETAGERVPTIPDFQAHSEFQKAC 681
Cdd:pfam04388  494 CSRGGLDMPFSRTMESLAGSQRSRNRIASYCSSTSQSDS 532
DUF2967 pfam11179
Protein of unknown function (DUF2967); This family of proteins with unknown function appears ...
236-732 4.99e-09

Protein of unknown function (DUF2967); This family of proteins with unknown function appears to be restricted to Drosophila.


Pssm-ID: 402654 [Multi-domain]  Cd Length: 954  Bit Score: 61.20  E-value: 4.99e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   236 SSSPPQGTSDTPASSsPPQVTSATSASSSPPQGTSDTpaSSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASS 315
Cdd:pfam11179   15 SSAPPHAALAGPITA-APTGAAAAAATSTAAASAASS--TITAPGAGPGGTPTSRSRGAQAMTASLAHAAQGNANANKST 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   316 SPPQGTSDTPASSSPPQ---VTSATSASSSPPQGTSDTPASSSPPQgtldTPSSSSPPQGTSDTPASSSPPQgTSETPAS 392
Cdd:pfam11179   92 RNNSNSSNNNGKPKPLAacyMSTRSAAMMALALGQQSGEKKDKKPA----AGKAASPAQSQSQSQSQNASPH-TNNRAVS 166
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   393 NSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTsetpASSSPTQVTSDTPASNSPPQGTSDTPGFS- 471
Cdd:pfam11179  167 MTRPAATRRLPNAAAMSNVNAANSTCTATATSLPSNRARSKPSTPT----ATRAAAQLNGMGIFSGGSNSSGSDNDGFSa 242
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   472 --SPTQVTTATLVSSSPPQVTSDTPASSsppqvTSDTPASSSPPQVTSET--PASSSPPQVTSDTSASISPPQVISDTpA 547
Cdd:pfam11179  243 sgSSAATALRRLYFKSGRSIKNKINAST-----SSSTPLNGLPLNAVSNAfhNSVGGATAMHAMGTAGGVPKLVVMGT-S 316
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   548 SSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNmtsdtPASSSPPWPVITEVT-----RPESTIPAGR 622
Cdd:pfam11179  317 SASIPDTTINTSTDSACTLITNVTHTDTSETCDSLDLGDNSGPSE-----PLFSSLEEPLLTAIHidsehEGFGGMAGGR 391
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   623 SLANITSKAQ-EDSPLGVISTHPQMSFQSSTSQQ--ALDETAGERVPTIPDFQAHSEFQKACAILQRLRDFLPTSPTSAQ 699
Cdd:pfam11179  392 GGANGRGATElELTSCSRYPPRPDMNLQDSTESQesCLSILTGEPSSTTPLLSSQRRHPTGGHSPGSQRQEERRERKERE 471
                          490       500       510
                   ....*....|....*....|....*....|...
gi 755526783   700 VSVANLLIDLSEQLLVLPFQKNNSWSSQTPAVS 732
Cdd:pfam11179  472 PSTAPPPTRGREHFTFDPPQSPKSARSSEKARS 504
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
319-632 6.74e-09

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 60.04  E-value: 6.74e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  319 QGTSDTPASSSPPQvtsATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGT-----SDTPASSSPPQGT-SETPAS 392
Cdd:COG5164    18 TTPAGSQGSTKPAQ---NQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQgatgpAQNQGGTTPAQNQgGTRPAG 94
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  393 NSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPA---SSSPTQVTsdTPASNSPPQGTSDTPG 469
Cdd:COG5164    95 NTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPgpgSTGPGGST--TPPGDGGSTTPPGPGG 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  470 FSSPTQVTTATlvsssppqvTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQvTSDTSASISPPQVISDTPASS 549
Cdd:COG5164   173 STTPPDDGGST---------TPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPP-DDRGGKTGPKDQRPKTNPIER 242
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  550 SPPQVTSETPASSSPTNMTSDTPASS-SPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANIT 628
Cdd:COG5164   243 RGPERPEAAALPAELTALEAENRAANpEPATKTIPETTTVKDLATVLGKKGSDLVTNLMKKGKGTNINAALDFETAATIA 322

                  ....
gi 755526783  629 SKAQ 632
Cdd:COG5164   323 LEGN 326
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
347-677 9.74e-09

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 60.39  E-value: 9.74e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   347 TSDTPASSSPPQGTLDTPSSSSPPQGTS------DTPassSPPQGTSE----TPASNSPPQGT------SETPgfSSPPQ 410
Cdd:TIGR00927   76 SSDPPKSSSEMEGEMLAPQATVGRDEATpsiameNTP---SPPRRTAKitptTPKNNYSPTAAgtervkEDTP--ATPSR 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   411 VTTATLVSSSPPQVTSETPA------SSSPTQVTSE----TPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTAT 480
Cdd:TIGR00927  151 ALNHYISTSGRQRVKSYTPKprgevkSSSPTQTREKvrkyTPSPLGRMVNSYAPSTFMTMPRSHGITPRTTVKDSEITAT 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   481 --LVSSSPPQ----VTSDTP----ASSSPPQVTSDTPASS-SPPQVTSETPASSSPPQVTSDTSA---------SISPPQ 540
Cdd:TIGR00927  231 ykMLETNPSKrtagKTTPTPlkgmTDNTPTFLTREVETDLlTSPRSVVEKNTLTTPRRVESNSSTnhwglvgknNLTTPQ 310
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   541 --VISDTPASSSPpQVTSETPASSSPTnmtsDTPASSSPTNMTSDTPASSSPTNMTsdtpaSSSPPWPVITEVTRPESTI 618
Cdd:TIGR00927  311 gtVLEHTPATSEG-QVTISIMTGSSPA----ETKASTAAWKIRNPLSRTSAPAVRI-----ASATFRGLEKNPSTAPSTP 380
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 755526783   619 PAGRSLANITSKAQE---DSPLGVISTHPQMSFQSSTSQQALDETAGERVPTIPDFQAHSEF 677
Cdd:TIGR00927  381 ATPRVRAVLTTQVHHcvvVKPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDLHPKAEY 442
PLAT_plant_stress cd01754
PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of ...
1152-1256 1.23e-08

PLAT/LH2 domain of plant-specific single domain protein family with unknown function. Many of its members are stress induced. In general, PLAT/LH2 consists of an eight stranded beta-barrel and it's proposed function is to mediate interaction with lipids or membrane bound proteins.


Pssm-ID: 238852  Cd Length: 129  Bit Score: 54.85  E-value: 1.23e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783 1152 YLIQVYTGYRRRAATTAKVVITLYGSEGH-------SEPHHLCDPEKTVFERGALDVF------LLSTGSWLgdlhglRL 1218
Cdd:cd01754     3 YTIYVQTGSIWKAGTDSRISLQIYDADGPglrianlEAWGGLMGAGHDYFERGNLDRFsgrgpcLPSPPCWM------NL 76
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 755526783 1219 WHDNSGDSPSWYVSQVIVsdmtTRKKWHFQCNC-------WLAVD 1256
Cdd:cd01754    77 TSDGTGNHPGWYVNYVEV----TQAGQHAPCMQhlfaveqWLATD 117
PRK08581 PRK08581
amidase domain-containing protein;
360-596 1.57e-08

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 59.03  E-value: 1.57e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  360 TLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNsppqgTSETPgfssppQVTTATLVSSSPPQVTSETPASSSP-TQVT 438
Cdd:PRK08581   14 TLVLPTLTSPTAYADDPQKDSTAKTTSHDSKKSN-----DDETS------KDTSSKDTDKADNNNTSNQDNNDKKfSTID 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  439 SETPASS---SPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQvTSDTPASSSPPQVTSDTPASSSPPqv 515
Cdd:PRK08581   83 SSTSDSNniiDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISD-YEQPRNSEKSTNDSNKNSDSSIKN-- 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  516 tSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:PRK08581  160 -DTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSE 238

                  .
gi 755526783  596 D 596
Cdd:PRK08581  239 D 239
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
442-612 2.14e-08

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 58.43  E-value: 2.14e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   442 PASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPpqvTSDTPASSSPPQVTSETP- 520
Cdd:pfam17823   48 PRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREG---AADGAASRALAAAASSSPs 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   521 --ASSSPPQVTSDTSASISPPQvisdTPASSSPPQVTSETP--ASSSPTNMTSDT-PASSSPTNMTSDTPASSSPTNMTS 595
Cdd:pfam17823  125 saAQSLPAAIAALPSEAFSAPR----AAACRANASAAPRAAiaAASAPHAASPAPrTAASSTTAASSTTAASSAPTTAAS 200
                          170
                   ....*....|....*..
gi 755526783   596 DTPASSSPPWPVITEVT 612
Cdd:pfam17823  201 SAPATLTPARGISTAAT 217
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
434-634 3.59e-08

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 58.01  E-value: 3.59e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  434 PTQVTSETPASSSPTQVTSDTPASNS---PPQGTSDTPGFSSPTqvttATLVSSSPPQVTSDTPASSSPPQVTS------ 504
Cdd:PLN03209  331 KESDAADGPKPVPTKPVTPEAPSPPIeeePPQPKAVVPRPLSPY----TAYEDLKPPTSPIPTPPSSSPASSKSvdavak 406
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  505 -DTPASSSPPQVTSETPASSsPPQVTSDTSASISPPQVISD--TPASSSP-PQVTSETPASSSPT-NMTSDTPASSSPTN 579
Cdd:PLN03209  407 pAEPDVVPSPGSASNVPEVE-PAQVEAKKTRPLSPYARYEDlkPPTSPSPtAPTGVSPSVSSTSSvPAVPDTAPATAATD 485
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  580 MTSDTPASSSPTNMTS-----DTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQED 634
Cdd:PLN03209  486 AAAPPPANMRPLSPYAvyddlKPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQ 545
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
294-487 4.04e-08

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 57.84  E-value: 4.04e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  294 GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGT 373
Cdd:COG3469    20 VTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVAT 99
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  374 SDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSD 453
Cdd:COG3469   100 STASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTP 179
                         170       180       190
                  ....*....|....*....|....*....|....
gi 755526783  454 TPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPP 487
Cdd:COG3469   180 SATTTATATTASGATTPSATTTATTTGPPTPGLP 213
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
345-514 4.64e-08

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 56.98  E-value: 4.64e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   345 QGTSDTPASSSPPQGTLDTPSSSSPPQG----TSDTPASSSPPQGTSETPASNSPPQGTseTPGFSSPPQVTTATLVSSS 420
Cdd:pfam05539  176 KTTSWPTEVSHPTYPSQVTPQSQPATQGhqtaTANQRLSSTEPVGTQGTTTSSNPEPQT--EPPPSQRGPSGSPQHPPST 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   421 PPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPP 500
Cdd:pfam05539  254 TSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQET-----GRPTPRPTATTQSGSSPPHSSPPGVQANPT 328
                          170
                   ....*....|....
gi 755526783   501 QVTSDTPASSSPPQ 514
Cdd:pfam05539  329 TQNLVDCKELDPPK 342
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
364-577 5.06e-08

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 56.98  E-value: 5.06e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   364 PSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETpgfssppqvTTATLVSSSPPQVTSETPASSSPTQVTSETPA 443
Cdd:pfam05539  169 KTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTA---------TANQRLSSTEPVGTQGTTTSSNPEPQTEPPPS 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   444 SSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVT-------TATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQvT 516
Cdd:pfam05539  240 QRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTppatsnrRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPH-S 318
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755526783   517 SETPASSSPPQVTSDTSASISPPQVIS-----DTPASSSPPQVTSETPASSSPTNMTSDTPASSSP 577
Cdd:pfam05539  319 SPPGVQANPTTQNLVDCKELDPPKPNSicygvGIYNEALPRGCDIVVPLCSTYTIMCMDTYYSKPF 384
PHA03255 PHA03255
BDLF3; Provisional
438-608 9.00e-08

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 54.52  E-value: 9.00e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  438 TSETPASSSPTQVTSDTPAsnsppqgTSDTPGFSSPTQVTTATLVSSSPPqVTSDTPASSSPPQVTSdTPASSSPPQVTS 517
Cdd:PHA03255   25 TSSGSSTASAGNVTGTTAV-------TTPSPSASGPSTNQSTTLTTTSAP-ITTTAILSTNTTTVTS-TGTTVTPVPTTS 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  518 ETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDT 597
Cdd:PHA03255   96 NASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQ 175
                         170
                  ....*....|...
gi 755526783  598 PASSS--PPWPVI 608
Cdd:PHA03255  176 PSLSYglPLWTLV 188
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
382-577 1.89e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 56.01  E-value: 1.89e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  382 PPQGTSETPASNSPPQGTSETPGfsSPPQVTTATLVSSSPPQvtseTPASSSPTQVTSETPASSSPTQVTSDTPASNSPP 461
Cdd:PRK07003  360 PAVTGGGAPGGGVPARVAGAVPA--PGARAAAAVGASAVPAV----TAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPP 433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  462 Q-GTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSE-TPASSSPPQVTSDTSASISPP 539
Cdd:PRK07003  434 AtADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEpAPRAAAPSAATPAAVPDARAP 513
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 755526783  540 QVIS--DTPASSSPPqvtseTPASSSPTnmtsdtPASSSP 577
Cdd:PRK07003  514 AAASreDAPAAAAPP-----APEARPPT------PAAAAP 542
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
245-644 2.07e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 55.89  E-value: 2.07e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  245 DTPASSSPPQVTSATSA------SSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPP 318
Cdd:PRK14949  369 DDPAEISLPEGQTPSALaaavqaPHANEPQFVNAAPAEKKTALTEQTTAQQQVQAANAEAVAEADASAEPADTVEQALDD 448
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  319 QgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPA---SSSPPQGTSETPASNSP 395
Cdd:PRK14949  449 E-SELLAALNAEQAVILSQAQSQGFEASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVddtSASNNSAADNTVDDNYS 527
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  396 PQGTSETPG--------------------------FSSPPQVTTATLVSSSPPQVTSET-PASSSPTQVTSETPASSSP- 447
Cdd:PRK14949  528 AEDTLESNGldegdyaqdsapldayqddyvafsseSYNALSDDEQHSANVQSAQSAAEAqPSSQSLSPISAVTTAAASLa 607
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  448 ----------------TQVTSDTPASNSPPQGTSDTPGFSSPtqvttatlvsSSPPQVTSDTPASSSPPQVTSdtpASSS 511
Cdd:PRK14949  608 dddildavlaardsllSDLDALSPKEGDGKKSSADRKPKTPP----------SRAPPASLSKPASSPDASQTS---ASFD 674
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  512 PPQVTSETPASSSPPQVTSDTSASISPPQV-ISDTPASSSPPQVtseTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 590
Cdd:PRK14949  675 LDPDFELATHQSVPEAALASGSAPAPPPVPdPYDRPPWEEAPEV---ASANDGPNNAAEGNLSESVEDASNSELQAVEQQ 751
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....
gi 755526783  591 TNMTSDTPASSSPPwpviTEVTRPESTIPAGRSLANITSKaqedSPLGVISTHP 644
Cdd:PRK14949  752 ATHQPQVQAEAQSP----ASTTALTQTSSEVQDTELNLVL----LSSGSITGHP 797
Treacle pfam03546
Treacher Collins syndrome protein Treacle;
209-603 2.18e-07

Treacher Collins syndrome protein Treacle;


Pssm-ID: 460967 [Multi-domain]  Cd Length: 531  Bit Score: 55.46  E-value: 2.18e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   209 SAPPKATHRMTITSLTGRPQVTSDtlaSSSPPQGTSD--TPASSSPPQVTSATSASSSPPQGTS--DTPASSSPPQVTSA 284
Cdd:pfam03546    2 PATPGKAGPAATQAKAGKPEEDSE---SSSEEESDSEeeTPAAKTPLQAKPSGKTPQVRAASAPakESPRKGAPPVPPGK 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   285 TSASSSPPQ-GTSDTPASSSPPQVTSATSASSSPPQGTSdtPASSSP----PQVTSATSASS-SPPQGTSDTPASSSPPQ 358
Cdd:pfam03546   79 TGPAAAQAQaGKPEEDSESSSEESDSDGETPAAATLTTS--PAQVKPlgknSQVRPASTVGKgPSGKGANPAPPGKAGSA 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   359 GTL--------DTPSSSSPPQGTSDTPASSS---PPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSE 427
Cdd:pfam03546  157 APLvqvgkkeeDSESSSEESDSEGEAPPAATqakPSGKILQVRPASGPAKGAAPAPPQKAGPVATQVKAERSKEDSESSE 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   428 TPASSSPTQVTSETPASSSPTQVTSDTPAsnSPPQGTSDTPgfsSPTQVTTATLVSSSPPQV-TSDTPASSSPPQVTSDT 506
Cdd:pfam03546  237 ESSDSEEEAPAAATPAQAKPALKTPQTKA--SPRKGTPITP---TSAKVPPVRVGTPAPWKAgTVTSPACASSPAVARGA 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   507 --PASSSPPQVTSETPASSSPP----QVTS-----DTSASISPPQVISDTPASSSPP--------QVTSETPASSSPTNM 567
Cdd:pfam03546  312 qrPEEDSSSSEESESEEETAPAaavgQAKSvgkglQGKAASAPTKGPSGQGTAPVPPgktgpavaQVKAEAQEDSESSEE 391
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|..
gi 755526783   568 TSD------TPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:pfam03546  392 ESDseeaaaTPAQVKASGKTPQAKANPAPTKASSAKGAASAP 433
PHA03378 PHA03378
EBNA-3B; Provisional
183-624 2.44e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 55.46  E-value: 2.44e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  183 PPLAMCHQPAPPELFETLCFPID-PASSAPPKAThrmtITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSA 261
Cdd:PHA03378  529 PPQPRAGRRAPCVYTEDLDIESDePASTEPVHDQ----LLPAPGLGPLQIQPLTSPTTSQLASSAPSYAQTPWPVPHPSQ 604
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  262 SSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDT-------PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVT 334
Cdd:PHA03378  605 TPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPItfnvlvfPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTM 684
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  335 SATSASSSPPQGTSDTPASSSPPQGtldTPSSSSPPQGtsdTPASSSPPQGtseTPASNSPPQGtseTPGFSSPPQvttA 414
Cdd:PHA03378  685 LPIQWAPGTMQPPPRAPTPMRPPAA---PPGRAQRPAA---ATGRARPPAA---APGRARPPAA---APGRARPPA---A 749
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  415 TLVSSSPPQvtsetpASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGtsdTPGFSSPTQVTTATLVSSSPPQVTSDTP 494
Cdd:PHA03378  750 APGRARPPA------AAPGRARPPAAAPGAPTPQPPPQAPPAPQQRPRG---APTPQPPPQAGPTSMQLMPRAAPGQQGP 820
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  495 ASSSPPQ-----VTSDTPASSSPPQVTSETPASSSP-PQvtSDTSASISPPQVIsdTPASSSPPQVtsetpasssPTNMT 568
Cdd:PHA03378  821 TKQILRQlltggVKRGRPSLKKPAALERQAAAGPTPsPG--SGTSDKIVQAPVF--YPPVLQPIQV---------MRQLG 887
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 755526783  569 SDTPASSSptnmtsdtPASSSPTNMTSDTPASSSPPWPVITEVTR------PESTIPAGRSL 624
Cdd:PHA03378  888 SVRAAAAS--------TVTQAPTEYTGERRGVGPMHPTDIPPSKRaktdayVESQPPHGGQS 941
PRK08581 PRK08581
amidase domain-containing protein;
402-705 2.49e-07

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 55.18  E-value: 2.49e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  402 TPGFSSPpQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSsptqvTSDTPASNSPPQGTSDTPgFSSPTQVTTAT- 480
Cdd:PRK08581   17 LPTLTSP-TAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKD-----TDKADNNNTSNQDNNDKK-FSTIDSSTSDSn 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  481 ---------LVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTS-ETPASSS-PPQVTSDTSAS-ISPPQvisdTPAS 548
Cdd:PRK08581   90 niidfiyknLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDyEQPRNSEkSTNDSNKNSDSsIKNDT----DTQS 165
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  549 SSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSppwpvitevtrpESTIPAGRSLANIT 628
Cdd:PRK08581  166 SKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDTANQKSSSK------------DNQSMSDSALDSIL 233
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755526783  629 SKAQEDSPLgvisTHPQMSFQSSTSQQaldETAGERVPTIPdfqAHSEFQKACAILQRLRDFLPTSPTSAQVSVANL 705
Cdd:PRK08581  234 DQYSEDAKK----TQKDYASQSKKDKT---ETSNTKNPQLP---TQDELKHKSKPAQSFENDVNQSNTRSTSLFETG 300
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
203-512 2.75e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 55.31  E-value: 2.75e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   203 PIDPASSAPPKATHRmTITSLTGRPQVTSDTLASSSP--------PQGTSDTPASSSP-PQVTSATSASSSPPQG-TSDT 272
Cdd:pfam05109  509 PTSAVTTPTPNATSP-TPAVTTPTPNATSPTLGKTSPtsavttptPNATSPTPAVTTPtPNATIPTLGKTSPTSAvTTPT 587
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   273 PASSSPP--QVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtSATSASSSPPqgTSDT 350
Cdd:pfam05109  588 PNATSPTvgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRP---SSISETLSPS--TSDN 662
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   351 PASSSPpqgTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPA 430
Cdd:pfam05109  663 STSHMP---LLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATSPQ 739
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   431 SSSpTQVTSETPASSSPTQVTSDTPASNSPPQG--TSDTP----GFSSPTQVTTATLVSSSPPQVTSDTPAS---SSPP- 500
Cdd:pfam05109  740 APS-GQKTAVPTVTSTGGKANSTTGGKHTTGHGarTSTEPttdyGGDSTTPRTRYNATTYLPPSTSSKLRPRwtfTSPPv 818
                          330
                   ....*....|....*
gi 755526783   501 ---QVTSDTPASSSP 512
Cdd:pfam05109  819 ttaQATVPVPPTSQP 833
rne PRK10811
ribonuclease E; Reviewed
409-615 2.81e-07

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 55.43  E-value: 2.81e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  409 PQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDT----PASNSPPQGTSDTPGFSSPTQVTTATLVSs 484
Cdd:PRK10811  848 VRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEvveePVVVAEPQPEEVVVVETTHPEVIAAPVTE- 926
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  485 sPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPP-QVISDTPASSSPPQVTSETPASSS 563
Cdd:PRK10811  927 -QPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAaPVVAEVAAEVETVTAVEPEVAPAQ 1005
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 755526783  564 PTNMTSDTPASSSPtnMTSdTPAsssptnmtsdtPASSSPPwPVITEVTRPE 615
Cdd:PRK10811 1006 VPEATVEHNHATAP--MTR-APA-----------PEYVPEA-PRHSDWQRPT 1042
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
414-666 2.91e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 55.27  E-value: 2.91e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  414 ATLVSSSPPQVTSETPASSSPTQVTsetPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDT 493
Cdd:PRK12323  372 AGPATAAAAPVAQPAPAAAAPAAAA---PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAP 448
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  494 PASSSPPQVtsdtPASSSPPQVTS-ETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTP 572
Cdd:PRK12323  449 APAPAPAAA----PAAAARPAAAGpRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE 524
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  573 ASSSPTNMTSDTPASSSptnmtsdTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTHP------QM 646
Cdd:PRK12323  525 SIPDPATADPDDAFETL-------APAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPALAARLPvrglaqQL 597
                         250       260
                  ....*....|....*....|
gi 755526783  647 SFQSSTsQQALDETAGERVP 666
Cdd:PRK12323  598 ARQSEL-AGVEGDTVRLRVP 616
CLECT cd00037
C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type ...
37-142 3.21e-07

C-type lectin (CTL)/C-type lectin-like (CTLD) domain; CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.


Pssm-ID: 153057 [Multi-domain]  Cd Length: 116  Bit Score: 50.31  E-value: 3.21e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   37 SCYQLNRLFCDFQEADNYCHAQRGRLAHTWNPKLRGFLKSFL---NEETVW-------------WVRGNLTLPGSHPGIN 100
Cdd:cd00037     1 SCYKFSTEKLTWEEAQEYCRSLGGHLASIHSEEENDFLASLLkksSSSDVWiglndlssegtwkWSDGSPLVDYTNWAPG 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 755526783  101 QTGGDDvlrnqkPGECpsvVTHSNAVFSRWN--LCIEKHHFICQ 142
Cdd:cd00037    81 EPNPGG------SEDC---VVLSSSSDGKWNdvSCSSKLPFICE 115
PRK10905 PRK10905
cell division protein DamX; Validated
377-593 3.64e-07

cell division protein DamX; Validated


Pssm-ID: 236792 [Multi-domain]  Cd Length: 328  Bit Score: 53.79  E-value: 3.64e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  377 PASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQV-TSETPASSSPTQVTSDTP 455
Cdd:PRK10905   23 PSTSSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAEQTAGNTQQDVSLPPISSTPTQGqTPVATDGQQRVEVQGDLN 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  456 ASNSPPQGTSDTPGFSS----PTQVTT-----------ATLVSSSPPQVTSDTPA------SSSPPQVTSDTPASSSPPQ 514
Cdd:PRK10905  103 NALTQPQNQQQLNNVAVnstlPTEPATvapvrngnasrQTAKTQTAERPATTRPArkqaviEPKKPQATAKTEPKPVAQT 182
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  515 VTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT--NMTSDTPASSSptNMTSDTPASSSPTN 592
Cdd:PRK10905  183 PKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTagNVGSLKSAPSS--HYTLQLSSSSNYDN 260

                  .
gi 755526783  593 M 593
Cdd:PRK10905  261 L 261
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
384-608 6.21e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 54.11  E-value: 6.21e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  384 QGTSETPASNSPPQGTSETPGFSSPPQVTTAtlvSSSPPQVTSETPASSSPTQVTSETPASSSPTQVtSDTPASNSPPQG 463
Cdd:PRK12323  368 SGGGAGPATAAAAPVAQPAPAAAAPAAAAPA---PAAPPAAPAAAPAAAAAARAVAAAPARRSPAPE-ALAAARQASARG 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  464 tsdTPGFSSPTQVTTATLVSSSPPQVTS-DTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVI 542
Cdd:PRK12323  444 ---PGGAPAPAPAPAAAPAAAARPAAAGpRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAG 520
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755526783  543 SDTpASSSPPQVTSETPASSSPTNMTSDTPAsSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVI 608
Cdd:PRK12323  521 WVA-ESIPDPATADPDDAFETLAPAPAAAPA-PRAAAATEPVVAPRPPRASASGLPDMFDGDWPAL 584
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
352-578 6.59e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 53.94  E-value: 6.59e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  352 ASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETpgfSSPPQVTTATLVSSSPPQVTSETPAS 431
Cdd:PRK08691  363 AASCDANAVIENTELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAA---AMPSEGKTAGPVSNQENNDVPPWEDA 439
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  432 SSPTQvTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDtpasSSPPQVTSDTPASSS 511
Cdd:PRK08691  440 PDEAQ-TAAGTAQTSAKSIQTASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPN----DEAVETETFAHEAPA 514
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755526783  512 PPQVTSETPASSSPPQvtsdTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PRK08691  515 EPFYGYGFPDNDCPPE----DGAEIPPPDWEHAAPADTAGGGADEEAEAGGIGGNNTPSAPPPEFST 577
GPS pfam01825
GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for ...
1041-1079 7.48e-07

GPCR proteolysis site, GPS, motif; The GPS motif is found in GPCRs, and is the site for auto-proteolysis, so is thus named, GPS. The GPS motif is a conserved sequence of ~40 amino acids containing canonical cysteine and tryptophan residues, and is the most highly conserved part of the domain. In most, if not all, cell-adhesion GPCRs these undergo autoproteolysis in the GPS between a conserved aliphatic residue (usually a leucine) and a threonine, serine, or cysteine residue. In higher eukaryotes this motif is found embedded in the C-terminal beta-stranded part of a GAIN domain - GPCR-Autoproteolysis INducing (GAIN). The GAIN-GPS domain adopts a fold in which the GPS motif, at the C-terminus, forms five beta-strands that are tightly integrated into the overall GAIN domain. The GPS motif, evolutionarily conserved from tetrahymena to mammals, is the only extracellular domain shared by all human cell-adhesion GPCRs and PKD proteins, and is the locus of multiple human disease mutations. The GAIN-GPS domain is both necessary and sufficient functionally for autoproteolysis, suggesting an autoproteolytic mechanism whereby the overall GAIN domain fine-tunes the chemical environment in the GPS to catalyze peptide bond hydrolysis. In the cell-adhesion GPCRs and PKD proteins, the GPS motif is always located at the end of their long N-terminal extracellular regions, immediately before the first transmembrane helix of the respective protein.


Pssm-ID: 460350  Cd Length: 44  Bit Score: 46.92  E-value: 7.48e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 755526783  1041 QCYFWDRYNRT---WKSDGCQVGPKSTiLKTQCLCDHLTFFS 1079
Cdd:pfam01825    2 QCVFWDFTNSTtgrWSTEGCTTVSLND-THTVCSCNHLTSFA 42
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
349-669 1.36e-06

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 53.15  E-value: 1.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  349 DTPASSSPPQGtldTPSSSSPPQGTSDTPASSSPPQGTSEtpaSNSPPQGTSetPGFSSPPQVTtatlvssSPPQVTSET 428
Cdd:PTZ00449  503 DSDKHDEPPEG---PEASGLPPKAPGDKEGEEGEHEDSKE---SDEPKEGGK--PGETKEGEVG-------KKPGPAKEH 567
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  429 PASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQ--VTSDT 506
Cdd:PTZ00449  568 KPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQrpSSPER 647
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  507 PASSSPPQvTSETPASSSPP-------QVTSDTS-ASISPPQVISDTPASSSPPQVTSETPASSSPTNMTsdTPASSSPT 578
Cdd:PTZ00449  648 PEGPKIIK-SPKPPKSPKPPfdpkfkeKFYDDYLdAAAKSKETKTTVVLDESFESILKETLPETPGTPFT--TPRPLPPK 724
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  579 NMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTiPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQALD 658
Cdd:PTZ00449  725 LPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHET-PADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSEHE 803
                         330
                  ....*....|.
gi 755526783  659 ETAGERVPTIP 669
Cdd:PTZ00449  804 DKPPGDHPSLP 814
PHA03255 PHA03255
BDLF3; Provisional
464-622 1.40e-06

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 51.06  E-value: 1.40e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  464 TSDTPGFSSPTQVTTATlvsssppQVTSDTPASSSPPQVTSDTPASSSPPqVTSETPASSSPPQVTSdTSASISPPQVIS 543
Cdd:PHA03255   25 TSSGSSTASAGNVTGTT-------AVTTPSPSASGPSTNQSTTLTTTSAP-ITTTAILSTNTTTVTS-TGTTVTPVPTTS 95
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  544 DTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGR 622
Cdd:PHA03255   96 NASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDER 174
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
240-589 1.52e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 52.99  E-value: 1.52e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  240 PQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQ 319
Cdd:NF033609  558 PEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSA 637
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  320 GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGT 399
Cdd:NF033609  638 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 717
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  400 SETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPGFSSPTQVTT 478
Cdd:NF033609  718 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDSDSDSDSDSD 797
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  479 ATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSET 558
Cdd:NF033609  798 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNS 877
                         330       340       350
                  ....*....|....*....|....*....|..
gi 755526783  559 PASSSPTNMTSDTPASSSPTNMT-SDTPASSS 589
Cdd:NF033609  878 PKNGTNASNKNEAKDSKEPLPDTgSEDEANTS 909
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
423-582 1.88e-06

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 52.40  E-value: 1.88e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  423 QVTSETPASSSPTQVTSETPASSSPTQVTSDTPASN------SPPQGTSDTPG-FSSPTQVTTATLVSSSPPQVTSDTPA 495
Cdd:PLN02217  508 EVQNTGPGAAITKRVTWPGIKKLSDEEILKFTPAQYiqgdawIPGKGVPYIPGlFAGNPGSTNSTPTGSAASSNTTFSSD 587
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  496 SSSppqvTSDTPASSSPPQVTSETPASSSpPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNM-TSDTPAS 574
Cdd:PLN02217  588 SPS----TVVAPSTSPPAGHLGSPPATPS-KIVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIkVASTESS 662

                  ....*...
gi 755526783  575 SSPTNMTS 582
Cdd:PLN02217  663 VSMVSMST 670
PHA03255 PHA03255
BDLF3; Provisional
425-588 2.48e-06

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 50.29  E-value: 2.48e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  425 TSETPASSSPTQVTSETpASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTS-DTPASSSPPQVT 503
Cdd:PHA03255   25 TSSGSSTASAGNVTGTT-AVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPvPTTSNASTINVT 103
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  504 SDTPASSSppqVTSETPASSSPPQVTSDTSasisppQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSD 583
Cdd:PHA03255  104 TKVTAQNI---TATEAGTGTSTGVTSNVTT------RSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDER 174

                  ....*
gi 755526783  584 TPASS 588
Cdd:PHA03255  175 QPSLS 179
motB PRK12799
flagellar motor protein MotB; Reviewed
347-465 2.72e-06

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 51.64  E-value: 2.72e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  347 TSDTPASSSPPQGTldTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVS--SSPPQV 424
Cdd:PRK12799  296 HGTVPVAAVTPSSA--VTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVAlpAAEPVN 373
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 755526783  425 TSETPASSSPTQVTSE---TPASSSPTQVTSDTPASNSPPQGTS 465
Cdd:PRK12799  374 MQPQPMSTTETQQSSTgniTSTANGPTTSLPAAPASNIPVSPTS 417
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
377-670 3.12e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 51.91  E-value: 3.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  377 PASSSPPQGTS---ETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPtQVTSETPASSSPTQVTSD 453
Cdd:PRK07764  365 PSASDDERGLLarlERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAP-AAAPQPAPAPAPAPAPPS 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  454 TPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQ------------------- 514
Cdd:PRK07764  444 PAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAgaddaatlrerwpeilaav 523
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  515 ---VTSETPASSSPPQVTS----------DTSAS---ISPPQ---------------------VISDTPASSSPPQVTSE 557
Cdd:PRK07764  524 pkrSRKTWAILLPEATVLGvrgdtlvlgfSTGGLarrFASPGnaevlvtalaeelggdwqveaVVGPAPGAAGGEGPPAP 603
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  558 TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPL 637
Cdd:PRK07764  604 ASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP 683
                         330       340       350
                  ....*....|....*....|....*....|...
gi 755526783  638 GVISTHPQMSFQSSTSQQALDETAGERVPTIPD 670
Cdd:PRK07764  684 APAPAAPAAPAGAAPAQPAPAPAATPPAGQADD 716
PHA03255 PHA03255
BDLF3; Provisional
413-575 3.68e-06

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 49.90  E-value: 3.68e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  413 TATLVSSSPPQVTSETPASSSpTQVTSETPASSSPTQVTSDTPASNSPPQGTsdTPGFSSPTQVTTATLVSSSPPQVTSD 492
Cdd:PHA03255   20 TSLIWTSSGSSTASAGNVTGT-TAVTTPSPSASGPSTNQSTTLTTTSAPITT--TAILSTNTTTVTSTGTTVTPVPTTSN 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  493 TPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTP 572
Cdd:PHA03255   97 ASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQP 176

                  ...
gi 755526783  573 ASS 575
Cdd:PHA03255  177 SLS 179
motB PRK12799
flagellar motor protein MotB; Reviewed
477-607 3.81e-06

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 51.25  E-value: 3.81e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  477 TTATLVSSSPPqvTSDTPASSSPPQVTSDTPASSSPPqvtseTPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:PRK12799  296 HGTVPVAAVTP--SSAVTQSSAITPSSAAIPSPAVIP-----SSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPA 368
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 755526783  557 ETPASSSPTNMTSDTPASSSPTNMtsdTPASSSPTNMTSDTPASSSPPWPV 607
Cdd:PRK12799  369 AEPVNMQPQPMSTTETQQSSTGNI---TSTANGPTTSLPAAPASNIPVSPT 416
PHA03247 PHA03247
large tegument protein UL36; Provisional
352-606 4.53e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.48  E-value: 4.53e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  352 ASSSPPQGTLDTPSSssPPQGTSDTPASSSPPQGTSETPAsnsPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPAS 431
Cdd:PHA03247  243 VISHPLRGDIAAPAP--PPVVGEGADRAPETARGATGPPP---PPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPP 317
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  432 SSPTQVTSETPASSSPTQVTSDTPASNSP-PQGTSDT--PGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSD--- 505
Cdd:PHA03247  318 PAPAGDAEEEDDEDGAMEVVSPLPRPRQHyPLGFPKRrrPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHAATPfar 397
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  506 TPASSSPPQVTSETPASSSPPqvtsdtsasiSPPQVISDTPASSSPPQVTSETPASSSPTNmtsdTPASSSPTNMTSDT- 584
Cdd:PHA03247  398 GPGGDDQTRPAAPVPASVPTP----------APTPVPASAPPPPATPLPSAEPGSDDGPAP----PPERQPPAPATEPAp 463
                         250       260
                  ....*....|....*....|..
gi 755526783  585 PASSSPTNMTSDTPASSSPPWP 606
Cdd:PHA03247  464 DDPDDATRKALDALRERRPPEP 485
PRK10856 PRK10856
cytoskeleton protein RodZ;
496-577 5.74e-06

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 50.02  E-value: 5.74e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  496 SSSPPQVTSDTPASSSPPQVTSETPASSSP--PQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:PRK10856  168 TTTDPATTPAPAAPVDTTPTNSQTPAVATApaPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247

                  ....
gi 755526783  574 SSSP 577
Cdd:PRK10856  248 AADP 251
PAP1 pfam08601
Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene ...
346-536 8.02e-06

Transcription factor PAP1; The transcription factor Pap1 regulates antioxidant-gene transcription in response to H2O2. This region is cysteine rich. Alkylation of cysteine residues following treatment with a cysteine alkylating agent can mask the accessibility of the nuclear exporter Crm1, triggering nuclear accumulation and Pap1 dependent transcriptional expression.


Pssm-ID: 369990  Cd Length: 363  Bit Score: 49.86  E-value: 8.02e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   346 GTSDTPASSSPPQGTLDTPSSSSPPqgtSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:pfam08601   31 QLSKAKQNTAKPGVRSDSRSPSPNA---STSTPDSQPPPSASSSTTPNQGSNGLNAFTGEDNNNYSNSAANPGATRGSTA 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   426 SETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQ--VTTATLVSSSPPQVTSDTPASSSPPQVT 503
Cdd:pfam08601  108 SSARSQSSPYSFGSGTSTSSDSPSSSSSSHQGQLSSCGTSPEPSTQSPGGqkSVETMIGEEQCAHGTIDGEKSFCAKLGM 187
                          170       180       190
                   ....*....|....*....|....*....|...
gi 755526783   504 SDTPASSSPPQVTSETPASSSPPQVTSDTSASI 536
Cdd:pfam08601  188 ACGNINNPIPAAMSKSNSLSNTPGHASNDSNGL 220
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
270-603 8.18e-06

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 50.68  E-value: 8.18e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  270 SDT-PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTS 348
Cdd:NF033609  561 SDSdPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 640
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  349 DTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:NF033609  641 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 720
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  429 PA-SSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:NF033609  721 DSdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 800
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPAS 587
Cdd:NF033609  801 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKN 880
                         330
                  ....*....|....*.
gi 755526783  588 SSPTNMTSDTPASSSP 603
Cdd:NF033609  881 GTNASNKNEAKDSKEP 896
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
473-638 8.91e-06

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 50.05  E-value: 8.91e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   473 PTQVTTAtlvsssppQVTSDTPASSSPPQVTSDTPASSSPPQ----VTSETPASSSPPQVTSDTSASISPPQVISDTPAS 548
Cdd:pfam05539  169 KTAVTTS--------KTTSWPTEVSHPTYPSQVTPQSQPATQghqtATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQ 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   549 SSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSdTPASSSPPWPVITEVTRPESTIPAGRSLANIT 628
Cdd:pfam05539  241 RGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTA-TPPPTTKRQETGRPTPRPTATTQSGSSPPHSS 319
                          170
                   ....*....|
gi 755526783   629 SKAQEDSPLG 638
Cdd:pfam05539  320 PPGVQANPTT 329
PHA03377 PHA03377
EBNA-3C; Provisional
203-657 9.60e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 50.44  E-value: 9.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQ---VTSATSASSSPPQGTSDTPASSSPP 279
Cdd:PHA03377  450 PERPGPSDQPSVPVEPAHLTPVEHTTVILHQPPQSPPTVAIKPAPPPSRRRRgacVVYDDDIIEVIDVETTEEEESVTQP 529
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  280 QVTSATSASSSPPQGTSD--------TPASSSPPQVTSATSASSSPPQGTSDTPASSspPQVTSATSASSSPPQGTSDTP 351
Cdd:PHA03377  530 AKPHRKVQDGFQRSGRRQkratppkvSPSDRGPPKASPPVMAPPSTGPRVMATPSTG--PRDMAPPSTGPRQQAKCKDGP 607
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  352 ASSSPPQgtlDTPSSSSP-----------------PQGTSDTPAS------SSPPQGTSETPASNSPPQGTSETPGFSSP 408
Cdd:PHA03377  608 PASGPHE---KQPPSSAPrdmapsvvrmflrerllEQSTGPKPKSfwemraGRDGSGIQQEPSSRRQPATQSTPPRPSWL 684
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  409 PQVTTATLVSSSPPQVTSETPASS-SPTQVTS--ETPASSSPTQVT--SDTPASNSPPQGTSDTPGFSSP--TQVTTATL 481
Cdd:PHA03377  685 PSVFVLPSVDAGRAQPSEESHLSSmSPTQPISheEQPRYEDPDDPLdlSLHPDQAPPPSHQAPYSGHEEPqaQQAPYPGY 764
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  482 VSSSPPQV--------------TSDTPASSSPPQVTSDTPASSSPPQVTSETPAsSSPPQVTSDTSASISPPQviSDTPA 547
Cdd:PHA03377  765 WEPRPPQApylgyqepqaqgvqVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPG-HGHPQGPWAPRPPHLPPQ--WDGSA 841
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  548 SSSPPQVTSETPASSSPTNMTSDTpaSSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPvitevTR-PESTIPAGRSLAn 626
Cdd:PHA03377  842 GHGQDQVSQFPHLQSETGPPRLQL--SQVPQLPYSQTLVSSSAPSWSSPQPRAPIRPIP-----TRfPPPPMPLQDSMA- 913
                         490       500       510
                  ....*....|....*....|....*....|.
gi 755526783  627 itskAQEDSPLgviSTHPQMSFQSSTSQQAL 657
Cdd:PHA03377  914 ----VGCDSSG---TACPSMPFASDYSQGAF 937
GPS smart00303
G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin ...
1040-1079 9.86e-06

G-protein-coupled receptor proteolytic site domain; Present in latrophilin/CL-1, sea urchin REJ and polycystin.


Pssm-ID: 197639  Cd Length: 49  Bit Score: 43.92  E-value: 9.86e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 755526783   1040 TQCYFWDRYNRTWKSDGCQVGPKS-TIlkTQCLCDHLTFFS 1079
Cdd:smart00303    3 PICVFWDESSGEWSTRGCELLETNgTH--TTCSCNHLTTFA 41
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
203-598 9.94e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 50.03  E-value: 9.94e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSppqvtsatsASSSPPQ-----GTSDTPASSS 277
Cdd:COG5164    21 AGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQ---------GATGPAQnqggtTPAQNQGGTR 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  278 PPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSP--PQVTSATSASSSPPQGTSDTPASSS 355
Cdd:COG5164    92 PAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPpgPGSTGPGGSTTPPGDGGSTTPPGPG 171
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  356 PPQGTLDTPSSSSPPQGTSDTPassSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATlvsSSPPQVTSETPASSSPT 435
Cdd:COG5164   172 GSTTPPDDGGSTTPPNKGETGT---DIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKT---GPKDQRPKTNPIERRGP 245
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  436 QVTSETPASSSPTQvTSDTPASNSPPQGTSDTPgfSSPTQVTTATL---VSSSPPQVTSDTPASSSPPQVTSDTPASSSP 512
Cdd:COG5164   246 ERPEAAALPAELTA-LEAENRAANPEPATKTIP--ETTTVKDLATVlgkKGSDLVTNLMKKGKGTNINAALDFETAATIA 322
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  513 P--QVTSETPASSSP-PQVTSDTSASISPPQVISDTPASSSPPQVTSetpASSSPTNMTSDTPASSSPTNMTSDTPAS-- 587
Cdd:COG5164   323 LegNVITEKEIEADImETVTTEEQETDSLLEETPPVPVVMGHVDHGK---TSLLDAIRHSDVTDGEVGTISQHIGAYTvq 399
                         410
                  ....*....|..
gi 755526783  588 -SSPTNMTSDTP 598
Cdd:COG5164   400 iAGTPITFLDTP 411
PRK13914 PRK13914
invasion associated endopeptidase;
417-602 1.00e-05

invasion associated endopeptidase;


Pssm-ID: 237555 [Multi-domain]  Cd Length: 481  Bit Score: 49.80  E-value: 1.00e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  417 VSSSPPQVTSETPASSSPTQVTsetPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATL--------------- 481
Cdd:PRK13914  143 VTSTPVAPTQEVKKETTTQQAA---PAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVksgdtiwalsvkygv 219
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  482 ----------VSSSPPQVTSD----TPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPA 547
Cdd:PRK13914  220 svqdimswnnLSSSSIYVGQKlaikQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPT 299
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  548 SSS----PPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSS 602
Cdd:PRK13914  300 EAAkpapAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSS 358
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
224-422 1.08e-05

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 49.75  E-value: 1.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  224 TGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSS 303
Cdd:COG3469    12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  304 PPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS--SSPPQGTSDTPASSS 381
Cdd:COG3469    92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSgtETATGGTTTTSTTTT 171
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 755526783  382 PPQGTSETPASNSPPQGTSETPGFSSPP-QVTTATLVSSSPP 422
Cdd:COG3469   172 TTSASTTPSATTTATATTASGATTPSATtTATTTGPPTPGLP 213
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
349-601 1.39e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.91  E-value: 1.39e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  349 DTPASSSPPqgtlDTPSSSSP-PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSE 427
Cdd:NF033609  540 DKPVVPEQP----DEPGEIEPiPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASD 615
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  428 TPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:NF033609  616 SDSASDSDSASDSDSASDSDSASDSD---SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 692
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASS-SPTNMTSDTPA 586
Cdd:NF033609  693 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDS 772
                         250
                  ....*....|....*.
gi 755526783  587 -SSSPTNMTSDTPASS 601
Cdd:NF033609  773 dSDSDSDSDSDSDSDS 788
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
319-547 1.44e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 49.49  E-value: 1.44e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  319 QGTSDTPASSSPPQVTSATSASSSPPQGTSdTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQG 398
Cdd:PRK12323  368 SGGGAGPATAAAAPVAQPAPAAAAPAAAAP-APAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGG 446
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  399 TSETPgfSSPPQVTTATlvsSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPtqvTT 478
Cdd:PRK12323  447 APAPA--PAPAAAPAAA---ARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDA---AP 518
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  479 ATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPqvtsDTSASISPPQVISDTPA 547
Cdd:PRK12323  519 AGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPRPP----RASASGLPDMFDGDWPA 583
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
356-586 1.46e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 49.49  E-value: 1.46e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  356 PPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPT 435
Cdd:PRK12323  365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  436 QVTSETPASSSPTQVTSDTPASnSPPQGTSDTPGFSSPTQVTTATlvssSPPQVTSDTPASSSPPQVTSDTPASSSPPQV 515
Cdd:PRK12323  445 GGAPAPAPAPAAAPAAAARPAA-AGPRPVAAAAAAAPARAAPAAA----PAPADDDPPPWEELPPEFASPAPAQPDAAPA 519
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 755526783  516 TSETPASSSPPQVTSDTSASISPPQvisdtPASSSPPQVTSETPASSSPTnmTSDTPASSSPTNMTSDTPA 586
Cdd:PRK12323  520 GWVAESIPDPATADPDDAFETLAPA-----PAAAPAPRAAAATEPVVAPR--PPRASASGLPDMFDGDWPA 583
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
485-636 1.73e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 49.32  E-value: 1.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  485 SPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSE---TPASSSPPQVTSDTSASISPPQvisDTPASSSPPQVTSETPAS 561
Cdd:PRK08691  379 SPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAaamPSEGKTAGPVSNQENNDVPPWE---DAPDEAQTAAGTAQTSAK 455
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 755526783  562 SSPTNMTSDTPASSSPT-NMTSDTPASSSPTNMTSDTPASSSP-PWPVITEVTRPESTIPAgrslANITSKAQEDSP 636
Cdd:PRK08691  456 SIQTASEAETPPENQVSkNKAADNETDAPLSEVPSENPIQATPnDEAVETETFAHEAPAEP----FYGYGFPDNDCP 528
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
439-631 1.92e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.52  E-value: 1.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  439 SETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQvtSDTPASSSPPQVTSE 518
Cdd:NF033609   33 SSKEADASENSVTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSSNTNNGETSVAQNPAQ--QETTQSASTNATTEE 110
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  519 TPASSSPPQVTSDTSASISPPQViSDTPASSSPPQVTSETpaSSSPTNMTSDTpasSSPTN--------MTSDTPASSSP 590
Cdd:NF033609  111 TPVTGEATTTATNQANTPATTQS-SNTNAEELVNQTSNET--TSNDTNTVSSV---NSPQNstnaenvsTTQDTSTEATP 184
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 755526783  591 TNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKA 631
Cdd:NF033609  185 SNNESAPQSTDASNKDVVNQAVNTSAPRMRAFSLAAVAADA 225
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
371-484 1.98e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 49.32  E-value: 1.98e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  371 QGTSDTPASSSP--PQGTSETPAS-NSPPQGTSeTPGFSSPPQVTTATLV--SSSPPQVTSETPASSSPTQVTSETPASS 445
Cdd:PLN02217  545 QGDAWIPGKGVPyiPGLFAGNPGStNSTPTGSA-ASSNTTFSSDSPSTVVapSTSPPAGHLGSPPATPSKIVSPSTSPPA 623
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 755526783  446 SPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSS 484
Cdd:PLN02217  624 SHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESS 662
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
203-665 2.08e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 48.87  E-value: 2.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPA----SSSPPQvtsatsasssppQGTSDTPASSSp 278
Cdd:COG5164     3 LYGPGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAqnqgSTTPAG------------NTGGTRPAGNQ- 69
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  279 pqvtsatsASSSPPQ-----GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPAS 353
Cdd:COG5164    70 --------GATGPAQnqggtTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGG 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  354 SSPPQGTLDTPSSSSPPQGTSdtpASSSPPQGTSETPASNSPPQGTSETPGFSSPPqvttatlvsssPPQVTSETPASSS 433
Cdd:COG5164   142 STPPGPGSTGPGGSTTPPGDG---GSTTPPGPGGSTTPPDDGGSTTPPNKGETGTD-----------IPTGGTPRQGPDG 207
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  434 PTQVTSETPASSSPTQVTSDTPAsnSPPQGTSDTPGFSSPTQVTTATLVSSSPPQvtSDTPASSSPPQVTSDTPASSSP- 512
Cdd:COG5164   208 PVKKDDKNGKGNPPDDRGGKTGP--KDQRPKTNPIERRGPERPEAAALPAELTAL--EAENRAANPEPATKTIPETTTVk 283
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  513 PQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPP--QVTSETPASSSPT-NMTSDTPASSSPTNMTSDTPasss 589
Cdd:COG5164   284 DLATVLGKKGSDLVTNLMKKGKGTNINAALDFETAATIALegNVITEKEIEADIMeTVTTEEQETDSLLEETPPVP---- 359
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  590 PTNMTSDTPASSSPPWPVITEVTRPESTIP-----AGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQALDETAGER 664
Cdd:COG5164   360 VVMGHVDHGKTSLLDAIRHSDVTDGEVGTIsqhigAYTVQIAGTPITFLDTPGFESFTAMAMRVAQITDIAILVVAADDG 439

                  .
gi 755526783  665 V 665
Cdd:COG5164   440 D 440
rne PRK10811
ribonuclease E; Reviewed
474-620 2.12e-05

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 49.27  E-value: 2.12e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  474 TQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSP----------PQVTSETPASSSPPQVTSDTSASIS-----P 538
Cdd:PRK10811  848 VRPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPvveavaevveEPVVVAEPQPEEVVVVETTHPEVIAapvteQ 927
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  539 PQVISDTPASssppqVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTI 618
Cdd:PRK10811  928 PQVITESDVA-----VAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVA 1002

                  ..
gi 755526783  619 PA 620
Cdd:PRK10811 1003 PA 1004
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
239-551 2.69e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 48.69  E-value: 2.69e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  239 PPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsatsassspp 318
Cdd:PRK07003  360 PAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAP------------- 426
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  319 qgtsdtPASSSPPQvtsatsasssppQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQG 398
Cdd:PRK07003  427 ------PAAPAPPA------------TADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDA 488
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  399 TSEtpgfSSPPqvttATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPqgtSDTPGFSSPTQV-T 477
Cdd:PRK07003  489 AFE----PAPR----AAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPA---ARAGGAAAALDVlR 557
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 755526783  478 TATLVSSSPPQVTSDTPASSSPPQVTSDTPAsssPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSP 551
Cdd:PRK07003  558 NAGMRVSSDRGARAAAAAKPAAAPAAAPKPA---APRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628
PPE COG5651
PPE-repeat protein [Function unknown];
363-587 2.85e-05

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 48.35  E-value: 2.85e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  363 TPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGtsetpGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:COG5651   165 TPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANL-----GLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAG 239
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  443 ASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPAS 522
Cdd:COG5651   240 AAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAA 319
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755526783  523 SSPP-QVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPAS 587
Cdd:COG5651   320 GATGaGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
203-502 2.88e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 48.42  E-value: 2.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   203 PIDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVT 282
Cdd:pfam17823  153 NASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVG 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   283 SATSASssppqGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPpQVTSATSASSSPPQGTSDTPASSSPPQgtld 362
Cdd:pfam17823  233 NSSPAA-----GTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDP-HARRLSPAKHMPSDTMARNPAAPMGAQ---- 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   363 tpSSSSPPQGTSDTPASSSPPQGTSEtpasnspPQGTSETPGFSSPPQVTTATLVSSSPPQvTSETPASSSPTQVTSETP 442
Cdd:pfam17823  303 --AQGPIIQVSTDQPVHNTAGEPTPS-------PSNTTLEPNTPKSVASTNLAVVTTTKAQ-AKEPSASPVPVLHTSMIP 372
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755526783   443 --ASSSPTQVTSDTPasnsPPQGTSDTPGFSSPTQVTT-ATLVSSSppqvTSDTPASSSPPQV 502
Cdd:pfam17823  373 evEATSPTTQPSPLL----PTQGAAGPGILLAPEQVATeATAGTAS----AGPTPRSSGDPKT 427
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
320-499 3.15e-05

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 48.24  E-value: 3.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   320 GTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPasnspPQGT 399
Cdd:pfam13254  190 ASVDLGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQSPAPTSASEP-----PPKT 264
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   400 SETPGFSSPPQVTTaTLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTA 479
Cdd:pfam13254  265 KELPKDSEEPAAPS-KSAEASTEKKEPDTESSPETSSEKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQSPPKDFRA 343
                          170       180
                   ....*....|....*....|
gi 755526783   480 TLVSSsppQVTSDTPASSSP 499
Cdd:pfam13254  344 NLRSR---EVPKDKSKKDEP 360
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
479-668 3.17e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 48.42  E-value: 3.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   479 ATLVSSSPPQVTSDTPASS-SPPQVTSDTpassSPPQVTSETPASSSPpqvTSDTSASISPPqvisdTPASSSPPQVTSE 557
Cdd:pfam17823   62 AATAAPAPVTLTKGTSAAHlNSTEVTAEH----TPHGTDLSEPATREG---AADGAASRALA-----AAASSSPSSAAQS 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   558 TPASSS-PTNMTSDTPASSSPTnmTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAqedsP 636
Cdd:pfam17823  130 LPAAIAaLPSEAFSAPRAAACR--ANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSA----P 203
                          170       180       190
                   ....*....|....*....|....*....|..
gi 755526783   637 LGVISTHPQMSFQSSTSQQALDeTAGERVPTI 668
Cdd:pfam17823  204 ATLTPARGISTAATATGHPAAG-TALAAVGNS 234
PHA03377 PHA03377
EBNA-3C; Provisional
167-574 3.39e-05

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 48.51  E-value: 3.39e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  167 KRRGAETERHMIP-----GNGPPLAMCHQPAPPElfetlcfpIDPASSAPPKATHRMTITSLTGRPQVTSdtLASSSPPQ 241
Cdd:PHA03377  541 QRSGRRQKRATPPkvspsDRGPPKASPPVMAPPS--------TGPRVMATPSTGPRDMAPPSTGPRQQAK--CKDGPPAS 610
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  242 GTSD-TPASSSP----PQVTSATSASSSPPQGTSDTP-------ASSSPPQVTSATSASSSPPQgTSDTPASSSPPQVTS 309
Cdd:PHA03377  611 GPHEkQPPSSAPrdmaPSVVRMFLRERLLEQSTGPKPksfwemrAGRDGSGIQQEPSSRRQPAT-QSTPPRPSWLPSVFV 689
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  310 ATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLdtPSSSSPPQGTSDTPASSSPPQGTSET 389
Cdd:PHA03377  690 LPSVDAGRAQPSEESHLSSMSPTQPISHEEQPRYEDPDDPLDLSLHPDQAPP--PSHQAPYSGHEEPQAQQAPYPGYWEP 767
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  390 PASNSPPQGTSETPGfssppqvtTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:PHA03377  768 RPPQAPYLGYQEPQA--------QGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGHGHPQGPWAPRPPHLPPQWDG 839
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  470 FSSPTQvttaTLVSSSP---PQVTSDTPASSSPPQVT-SDTPASSSPPqvtsetPASSSPPQvtsdtsASISPpqVISDT 545
Cdd:PHA03377  840 SAGHGQ----DQVSQFPhlqSETGPPRLQLSQVPQLPySQTLVSSSAP------SWSSPQPR------APIRP--IPTRF 901
                         410       420
                  ....*....|....*....|....*....
gi 755526783  546 PASSSPPQVTSETPASSSPTNMTSDTPAS 574
Cdd:PHA03377  902 PPPPMPLQDSMAVGCDSSGTACPSMPFAS 930
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
508-595 3.62e-05

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 48.31  E-value: 3.62e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  508 ASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDT-PASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPA 586
Cdd:PRK11907   18 LTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEaDNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEART 97

                  ....*....
gi 755526783  587 SSSPTNMTS 595
Cdd:PRK11907   98 VTPAATETS 106
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
403-558 4.32e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 48.16  E-value: 4.32e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  403 PGFSSPPQVTTATLVSSSPPQVTSETPASSspTQVTSETPASSSPTQ---VTSDTPASNSPPQGTSdTPGFSSPTQVTTA 479
Cdd:PLN02217  514 PGAAITKRVTWPGIKKLSDEEILKFTPAQY--IQGDAWIPGKGVPYIpglFAGNPGSTNSTPTGSA-ASSNTTFSSDSPS 590
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  480 TLV--SSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPqVTSDTSASISPPQVISDTPASSSPPQVTSE 557
Cdd:PLN02217  591 TVVapSTSPPAGHLGSPPATPSKIVSPSTSPPASHLGSPSTTPSSPESS-IKVASTETASPESSIKVASTESSVSMVSMS 669

                  .
gi 755526783  558 T 558
Cdd:PLN02217  670 T 670
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
534-633 4.36e-05

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 48.31  E-value: 4.36e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  534 ASISPPQVISDTPASSSPPQvTSETPASSSPTnmtsDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTR 613
Cdd:PRK11907   18 LTASNPKLAQAEEIVTTTPA-TSTEAEQTTPV----ESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTT 92
                          90       100
                  ....*....|....*....|
gi 755526783  614 PEStiPAGRSLANITSKAQE 633
Cdd:PRK11907   93 SEA--RTVTPAATETSKPVE 110
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
413-520 4.66e-05

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 47.43  E-value: 4.66e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  413 TATLVSSSPPQVTSETPASSSPTQV--TSETPASSSPTQVTSDTPASNsppQGTSDTPGFSSPTQVTTATLVSSSPPQVT 490
Cdd:PRK13335   55 TAGANSATTQAANTRQERTPKLEKApnTNEEKTSASKIEKISQPKQEE---QKSLNISATPAPKQEQSQTTTESTTPKTK 131
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 755526783  491 SDTPASSSPPQ----VTSDTPASSSPPQVTSE-TP 520
Cdd:PRK13335  132 VTTPPSTNTPQpmqsTKSDTPQSPTIKQAQTDmTP 166
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
487-666 5.26e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 47.92  E-value: 5.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  487 PQVTSDT-PASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:PRK07003  360 PAVTGGGaPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRG 439
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  566 N--MTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTH 643
Cdd:PRK07003  440 DdaADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASRE 519
                         170       180
                  ....*....|....*....|...
gi 755526783  644 PQMSFQSSTSQQALDETAGERVP 666
Cdd:PRK07003  520 DAPAAAAPPAPEARPPTPAAAAP 542
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
395-532 5.26e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 47.78  E-value: 5.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  395 PPQGTSETPG-FSSPPQVTTATLVSSSPPQVTSETpaSSSPTqvTSETPASSsptqvtsdtpasnsPPQGTSDTPGFSSP 473
Cdd:PLN02217  551 PGKGVPYIPGlFAGNPGSTNSTPTGSAASSNTTFS--SDSPS--TVVAPSTS--------------PPAGHLGSPPATPS 612
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  474 TQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPaSSSPPQVTSETPASSSPPQVTSDT 532
Cdd:PLN02217  613 KIVSPSTSPPASHLGSPSTTPSSPESSIKVASTE-TASPESSIKVASTESSVSMVSMST 670
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
476-616 5.35e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 47.50  E-value: 5.35e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  476 VTTATLVSSSPPQVTSDTPASSSPPQvtsDTPASSSPPQVTSETPASSSPPQvtsdtsASISPPQVISDTPASSSPPQVT 555
Cdd:PRK14950  355 VIEALLVPVPAPQPAKPTAAAPSPVR---PTPAPSTRPKAAAAANIPPKEPV------RETATPPPVPPRPVAPPVPHTP 425
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 755526783  556 SETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASssppWPVITEVTRPES 616
Cdd:PRK14950  426 ESAPKLTRAAIPVDEKPKYTPPAPPKEEEKALIADGDVLEQLEAI----WKQILRDVPPRS 482
motB PRK12799
flagellar motor protein MotB; Reviewed
477-595 5.42e-05

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 47.40  E-value: 5.42e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  477 TTATLVSSSPPQVTSDTPASSSPPqvtsdTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:PRK12799  307 SSAVTQSSAITPSSAAIPSPAVIP-----SSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMST 381
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 755526783  557 ETPASSSPTNMtsdTPASSSPTNMTSDTPASSSPTNMTS 595
Cdd:PRK12799  382 TETQQSSTGNI---TSTANGPTTSLPAAPASNIPVSPTS 417
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
448-585 5.89e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 47.46  E-value: 5.89e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  448 TQVTSDTPASNSPPQGTSdtPGFS---SPTQVTTATLVSSSPPQvTSDTPASSSPPQVTsdtPASSSPPQVtSETPASSS 524
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIK--PVFTqpaAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSAT---QPAGTPPTV-SVDPPAAV 435
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 755526783  525 PPQVTSDTSASISPPQVISDTPASSSppQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:PRK14971  436 PVNPPSTAPQAVRPAQFKEEKKIPVS--KVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQK 494
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
472-564 5.98e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 47.50  E-value: 5.98e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  472 SPTQVTTATLVSSSPPQVTsDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSAsisPPQVISDTPASSSP 551
Cdd:PRK14950  362 PVPAPQPAKPTAAAPSPVR-PTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPH---TPESAPKLTRAAIP 437
                          90
                  ....*....|...
gi 755526783  552 PQVTSETPASSSP 564
Cdd:PRK14950  438 VDEKPKYTPPAPP 450
PHA02682 PHA02682
ORF080 virion core protein; Provisional
437-634 6.18e-05

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 46.39  E-value: 6.18e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  437 VTSETPAS--SSPTQVTSDTPASNSPPQGTSDTPG----------FSSPTQVTTATLVSSSPPQVTSDTPASSSP-PQVT 503
Cdd:PHA02682   17 VLADTSSSlfTKCPQATIPAPAAPCPPDADVDPLDkysvkeagryYQSRLKANSACMQRPSGQSPLAPSPACAAPaPACP 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  504 SDTPASSSPPqVTSETPASSSPPQvtsdTSASISPPQVISDT--PASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMT 581
Cdd:PHA02682   97 ACAPAAPAPA-VTCPAPAPACPPA----TAPTCPPPAVCPAParPAPACPPSTRQCPPAPPLPTPKPAPAAKPIFLHNQL 171
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 755526783  582 S--DTPASSSPTNMTSdtPASSsppwPVItEVTRPESTIPAGRSLANITSKAQED 634
Cdd:PHA02682  172 PppDYPAASCPTIETA--PAAS----PVL-EPRIPDKIIDADNDDKDLIKKELAD 219
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
405-521 6.50e-05

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 47.54  E-value: 6.50e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  405 FSSPPQVTTATLVSSSPPQVTSETPASssptqvtseTPASSSPTQVTSDTPASNSppqGTSDTPGFSSPTQVTTATLVSS 484
Cdd:PRK11907    6 FSKSAVALTLALLTASNPKLAQAEEIV---------TTTPATSTEAEQTTPVESD---ATEEADNTETPVAATTAAEAPS 73
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 755526783  485 SPPQVTSDTPASSSPPQVTSDTPasSSPPQVTSETPA 521
Cdd:PRK11907   74 SSETAETSDPTSEATDTTTSEAR--TVTPAATETSKP 108
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
367-517 6.92e-05

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 45.33  E-value: 6.92e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   367 SSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSS 446
Cdd:pfam09595   32 SLILIGESNKEAALIITDIIDININKQHPEQEHHENPPLNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKTK 111
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755526783   447 PTQV----TSDTPASNSPPQGTSdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTS 517
Cdd:pfam09595  112 PSEHepanPPDASNRLSPPDAST-----AAIREARTFRKPSTGKRNNPSSAQSDQSPPRANHEAIGRANPFAMSS 181
KAR9 pfam08580
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ...
414-621 7.03e-05

Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.


Pssm-ID: 430088 [Multi-domain]  Cd Length: 684  Bit Score: 47.52  E-value: 7.03e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   414 ATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQV-----TSDTPASNSP-----PQGTSDTPGFSSPTQVTTATLVS 483
Cdd:pfam08580  422 ATLVANKTPGSSPPSSVIMTPVNKGSKTPSSRRGSSFdfgssSERVINSKLRresklPQIASTLKQTKRPSKIPRASPNH 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   484 SSPPQVTSDTPASSSPpqvtsdTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSS 563
Cdd:pfam08580  502 SGFLSTPSNTATSETP------TPALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHNFKPLTLTTPSPTPSRSSRSSSTL 575
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783   564 PTNMTSDTPASSSPT-NMTSDTPASSSPTNMTSDTPASSSPPWPVITEVTRPESTIPAG 621
Cdd:pfam08580  576 PPVSPLSRDKSRSPApTCRSVSRASRRRASRKPTRIGSPNSRTSLLDEPPYPKLTLSKG 634
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
470-569 7.20e-05

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 47.54  E-value: 7.20e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  470 FSSPTQVTTATLVSSSPPQVTSDTPASSSPPqvTSDTPASSSPPQVTSET---PASSSPPQVTSDTSASISPPQVISDTP 546
Cdd:PRK11907    6 FSKSAVALTLALLTASNPKLAQAEEIVTTTP--ATSTEAEQTTPVESDATeeaDNTETPVAATTAAEAPSSSETAETSDP 83
                          90       100
                  ....*....|....*....|...
gi 755526783  547 ASSSPPQVTSETPASSSPTNMTS 569
Cdd:PRK11907   84 TSEATDTTTSEARTVTPAATETS 106
PHA03255 PHA03255
BDLF3; Provisional
363-532 7.31e-05

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 46.05  E-value: 7.31e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  363 TPSSSSPPQGTSDTPASSSppqgTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:PHA03255   25 TSSGSSTASAGNVTGTTAV----TTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTI 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  443 ASSspTQVTSDTPASNSPPQGTsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTpaSSSPPQVTSETPAS 522
Cdd:PHA03255  101 NVT--TKVTAQNITATEAGTGT------STGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKG--TSNATKTTAELPTV 170
                         170
                  ....*....|
gi 755526783  523 SSPPQVTSDT 532
Cdd:PHA03255  171 PDERQPSLSY 180
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
468-628 7.52e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 47.39  E-value: 7.52e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  468 PGFSSPTQVTTATLVSSSPPQVTSDTPASssppQVTSDT--PASSSP--PQVTSETPASSsppqvTSDTSASISPpqviS 543
Cdd:PLN02217  514 PGAAITKRVTWPGIKKLSDEEILKFTPAQ----YIQGDAwiPGKGVPyiPGLFAGNPGST-----NSTPTGSAAS----S 580
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  544 DTPASSSPPQvTSETPASSSPTNMTSDTPASSSpTNMTSDTPASSSPTNMTSDTPAS-SSPPWPVITEVTRPESTIPAGR 622
Cdd:PLN02217  581 NTTFSSDSPS-TVVAPSTSPPAGHLGSPPATPS-KIVSPSTSPPASHLGSPSTTPSSpESSIKVASTETASPESSIKVAS 658

                  ....*.
gi 755526783  623 SLANIT 628
Cdd:PLN02217  659 TESSVS 664
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
444-651 8.79e-05

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 46.32  E-value: 8.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   444 SSSPTQVTSDTPASNSPPqgTSDTPGFSSPTQVTTATLVSSSPPqVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASS 523
Cdd:pfam12287   32 SAQPPSQSPDLSQMVCPP--ASPEQRLSQQSDVLQQPEQTQVSP-VSPSSNACASSGSEYQFHTSEPPQPEAIDPIQSSM 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   524 SPPQVTSDTSASISPpqvisdtpasSSPPQVTSETPASSSPTNMTSdTPASSSPT--NMTSDTP-ASSSPTNMTSDTPAS 600
Cdd:pfam12287  109 SLPSELAPPSPPLSP----------ASQPQVFQSKPASSSGINVNA-APFQSMQTvfNVNAPVPpRNEQELKESSQYSSG 177
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 755526783   601 SSPPWPVITEVTRPESTIPAGRSLANITSKAQEDSPLGVISTHpqMSFQSS 651
Cdd:pfam12287  178 YNQSFSSQSTQTVPQCQLPSEQLEQTVVGAYHPDGTIQVSNGH--LAFYPA 226
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
225-461 9.20e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 47.15  E-value: 9.20e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  225 GRPQVTSDTLASSSPPQGTSDTPASSSP-PQVTSATSASSSPPQGTSDTPASSSppQVTSATSASSSPPQGTSDTPASSS 303
Cdd:PRK07003  383 PGARAAAAVGASAVPAVTAVTGAAGAALaPKAAAAAAATRAEAPPAAPAPPATA--DRGDDAADGDAPVPAKANARASAD 460
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  304 PPqvTSATSASSSPPQGTSDTPASSSPPQVTSATSasssppqgTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPP 383
Cdd:PRK07003  461 SR--CDERDAQPPADSGSASAPASDAPPDAAFEPA--------PRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAP 530
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  384 QGTSETPASNSPP---------------------QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETP 442
Cdd:PRK07003  531 EARPPTPAAAAPAaraggaaaaldvlrnagmrvsSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPP 610
                         250
                  ....*....|....*....
gi 755526783  443 ASSSPTQVTSDTPASnSPP 461
Cdd:PRK07003  611 NGAARAEQAAESRGA-PPP 628
PLAT_LOX cd01753
PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they ...
1152-1253 9.35e-05

PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they catalyze enzymatic lipid peroxidation in complex biological structures via direct dioxygenation of phospholipids and cholesterol esters of biomembranes and plasma lipoproteins. Both types of enzymes are cytosolic but need this domain to access their sequestered membrane or micelle bound substrates.


Pssm-ID: 238851  Cd Length: 113  Bit Score: 43.07  E-value: 9.35e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783 1152 YLIQVYTGYRRRAATTAKVVITLYGSEGHSEPhHLCDPEKTVFERGALDVFLLSTGSWLGDLHGLRLWHDNSGDSPSWYV 1231
Cdd:cd01753     3 YKVTVATGSSLFAGTDDYIYLTLVGTAGESEK-QLLDRPGYDFERGAVDEYKVKVPEDLGELLLVRLRKRKYLLFDAWFC 81
                          90       100
                  ....*....|....*....|..
gi 755526783 1232 SQVIVSDmTTRKKWHFQCNCWL 1253
Cdd:cd01753    82 NYITVTG-PGGDEYHFPCYRWI 102
PLAT_RAB6IP1 cd01757
PLAT/LH2 domain present in RAB6 interacting protein 1 (Rab6IP1)_like family. PLAT/LH2 domains ...
1210-1262 9.79e-05

PLAT/LH2 domain present in RAB6 interacting protein 1 (Rab6IP1)_like family. PLAT/LH2 domains consists of an eight stranded beta-barrel. In RabIP1 this domain may participate in lipid-mediated modulation of Rab6IP1's function via it's generally proposed function of mediating interaction with lipids or membrane bound proteins.


Pssm-ID: 238855  Cd Length: 114  Bit Score: 43.30  E-value: 9.79e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783 1210 LGDLHGLRLWHDNSGDSPSWYVSQVIVSDMTTRKKWHFQCNCWL-------AVDLGNCER 1262
Cdd:cd01757    52 LGKLTTVQIGHDNSGLLAKWLVEYVMVRNEITGHTYKFPCGRWLgegvddgNGEDGSLER 111
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
482-620 1.00e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 46.78  E-value: 1.00e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  482 VSSSPPQVTSDTPASssppQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPAS 561
Cdd:PRK07994  367 EPEVPPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSE 442
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755526783  562 SSPTNMTsdTPASSSPTNMTSDTPASSSPTNMTSDTPA----SSSPPWPVITEVTRPESTIPA 620
Cdd:PRK07994  443 PAAASRA--RPVNSALERLASVRPAPSALEKAPAKKEAyrwkATNPVEVKKEPVATPKALKKA 503
PHA03255 PHA03255
BDLF3; Provisional
347-511 1.07e-04

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 45.28  E-value: 1.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  347 TSDTPASSSppqgtldTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTseTPGFSSPPQVTTATLVSSSPPQVTS 426
Cdd:PHA03255   25 TSSGSSTAS-------AGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITT--TAILSTNTTTVTSTGTTVTPVPTTS 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  427 ETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTpgfSSPTQVTTATLVSSSPPQVTSD--TPASSSPPQVTS 504
Cdd:PHA03255   96 NASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTT---SATTRITNATTLAPTLSSKGTSnaTKTTAELPTVPD 172

                  ....*..
gi 755526783  505 DTPASSS 511
Cdd:PHA03255  173 ERQPSLS 179
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
411-549 1.10e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 46.73  E-value: 1.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  411 VTTATLVSSSPPQvtSETPASSSPTQVTsETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATlvsSSPPQVT 490
Cdd:PRK14950  355 VIEALLVPVPAPQ--PAKPTAAAPSPVR-PTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPV---PHTPESA 428
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  491 SDTPASSSPPQVtsdTPASSSPPQVTSETPASSSPPQVTSDTSASIspPQVISDTPASS 549
Cdd:PRK14950  429 PKLTRAAIPVDE---KPKYTPPAPPKEEEKALIADGDVLEQLEAIW--KQILRDVPPRS 482
KAR9 pfam08580
Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal ...
322-589 1.10e-04

Yeast cortical protein KAR9; The KAR9 protein in Saccharomyces cerevisiae is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules. KAR9 localizes at the shmoo tip in mating cells and at the tip of the growing bud in anaphase.


Pssm-ID: 430088 [Multi-domain]  Cd Length: 684  Bit Score: 46.75  E-value: 1.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   322 SDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQG-TS 400
Cdd:pfam08580  427 NKTPGSSPPSS--------------VIMTPVNKGSKTPSSRRGSSFDFGSSSERVINSKLRRESKLPQIASTLKQTKrPS 492
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   401 ETPGFSSPPQVTtatlVSSSPPQVTSETPA-SSSPTQVTSETPASSSPTQVTSDTPASNSPPQ-GTSDTPGFSSPTQVTT 478
Cdd:pfam08580  493 KIPRASPNHSGF----LSTPSNTATSETPTpALRPPSRPQPPPPGNRPRWNASTNTNDLDVGHnFKPLTLTTPSPTPSRS 568
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   479 ATLVSSSPPQVTSDTPASSSPpqVTSDTPASSSPPQVTSETPASSSPPQvtsdtsasiSPPQVISDTPASSSP-PQVTSE 557
Cdd:pfam08580  569 SRSSSTLPPVSPLSRDKSRSP--APTCRSVSRASRRRASRKPTRIGSPN---------SRTSLLDEPPYPKLTlSKGLPR 637
                          250       260       270
                   ....*....|....*....|....*....|..
gi 755526783   558 TPASSSPTNMTSDTPASSSPTNMTSDTPASSS 589
Cdd:pfam08580  638 TPRNRQSYAGTSPSRSVSVSSGLGPQTRPGTS 669
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
358-566 1.27e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 46.52  E-value: 1.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  358 QGTLDTPSSSSP--PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPT 435
Cdd:PRK12727   52 QRALETARSDTPatAAAPAPAPQAPTKPAAPVHAPLKLSANANMSQRQRVASAAEDMIAAMALRQPVSVPRQAPAAAPVR 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  436 QVTSETPAssspTQVTSDTPASNSPPQGTSDTPgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQV 515
Cdd:PRK12727  132 AASIPSPA----AQALAHAAAVRTAPRQEHALS--AVPEQLFADFLTTAPVPRAPVQAPVVAAPAPVPAIAAALAAHAAY 205
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 755526783  516 TSETPASSSPPQVTSDTS--ASISPPQVISDTPASSSPPQVTSETPASSSPTN 566
Cdd:PRK12727  206 AQDDDEQLDDDGFDLDDAlpQILPPAALPPIVVAPAAPAALAAVAAAAPAPQN 258
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
366-498 1.33e-04

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 45.94  E-value: 1.33e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   366 SSSPPQGTSDTPASSSPPQgtsetpasnSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASS 445
Cdd:pfam12287   32 SAQPPSQSPDLSQMVCPPA---------SPEQRLSQQSDVLQQPEQTQVSPVSPSSNACASSGSEYQFHTSEPPQPEAID 102
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 755526783   446 SPtqvtsdtPASNSPPqgtsdtpgfSSPTQvTTATLVSSSPPQVTSDTPASSS 498
Cdd:pfam12287  103 PI-------QSSMSLP---------SELAP-PSPPLSPASQPQVFQSKPASSS 138
PHA03291 PHA03291
envelope glycoprotein I; Provisional
351-448 1.42e-04

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 46.10  E-value: 1.42e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  351 PASSSPPQGTLDTPSSSSPPQ-GTSD--TPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSE 427
Cdd:PHA03291  176 PLGEGSADGSCDPALPLSAPRlGPADvfVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEG 255
                          90       100
                  ....*....|....*....|.
gi 755526783  428 TPASSSPTqVTSETPASSSPT 448
Cdd:PHA03291  256 TPAPPTPG-GGEAPPANATPA 275
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
293-521 1.43e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.41  E-value: 1.43e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  293 QGTSDTPASSSPPQVTSATSASSSPpqgTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQG 372
Cdd:PRK12323  368 SGGGAGPATAAAAPVAQPAPAAAAP---AAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  373 TSDTPASSSPPQgtseTPASNSPPQgtsetpgfSSPPQVTTATLVSSSPPQvtseTPASSSPTQVTSETPASSSPTQVTS 452
Cdd:PRK12323  445 GGAPAPAPAPAA----APAAAARPA--------AAGPRPVAAAAAAAPARA----APAAAPAPADDDPPPWEELPPEFAS 508
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755526783  453 DTPASNSPPQGTSDTPGFSSP-TQVTTATLVSSSPPQVTSDTPASSSP-----PQVTSDTPASSSPPQVTSETPA 521
Cdd:PRK12323  509 PAPAQPDAAPAGWVAESIPDPaTADPDDAFETLAPAPAAAPAPRAAAAtepvvAPRPPRASASGLPDMFDGDWPA 583
PHA03378 PHA03378
EBNA-3B; Provisional
410-632 1.50e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.60  E-value: 1.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  410 QVTTATLVSSSPPQVTSetpASSSPTQVTSETpasssptQVTSDTPASNSPPQGTS-DTPGfssPTQVTTATLVSSSPPQ 488
Cdd:PHA03378  518 QRVMATLLPPSPPQPRA---GRRAPCVYTEDL-------DIESDEPASTEPVHDQLlPAPG---LGPLQIQPLTSPTTSQ 584
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  489 VTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPP---QVIS-DTPASSSPPQVTSETPASSSP 564
Cdd:PHA03378  585 LASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPlrmQPITfNVLVFPTPHQPPQVEITPYKP 664
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 755526783  565 T-NMTSDTPASSSPTNMTSDTPASSSPTNMTSD--TPASSSPPWPVITEVTRPESTIPAGRSLANITSKAQ 632
Cdd:PHA03378  665 TwTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPprAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRAR 735
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
443-595 1.53e-04

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 44.17  E-value: 1.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   443 ASSSPTQVTSDTPASNSppqgtSDTPGFSSPTQVTTATLVSSSPPqvtsdTPASSSPPQVTSDTPASSSPPQVTSETPAS 522
Cdd:pfam09595   31 ASLILIGESNKEAALII-----TDIIDININKQHPEQEHHENPPL-----NEAAKEAPSESEDAPDIDPNNQHPSQDRSE 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   523 SSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSS----PTNMTSDTPAS----SSPTNMTSDTPASSSPTNMT 594
Cdd:pfam09595  101 APPLEPAAKTKPSEHEPANPPDASNRLSPPDASTAAIREARtfrkPSTGKRNNPSSaqsdQSPPRANHEAIGRANPFAMS 180

                   .
gi 755526783   595 S 595
Cdd:pfam09595  181 S 181
PRK10856 PRK10856
cytoskeleton protein RodZ;
451-538 1.59e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 45.40  E-value: 1.59e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  451 TSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTS 530
Cdd:PRK10856  164 LDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAG 243

                  ....*...
gi 755526783  531 DTSASISP 538
Cdd:PRK10856  244 VSTPAADP 251
PRK10856 PRK10856
cytoskeleton protein RodZ;
457-551 1.61e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 45.40  E-value: 1.61e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  457 SNSPPQGTSDTPgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASI 536
Cdd:PRK10856  159 GQSVPLDTSTTT--DPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAP 236
                          90
                  ....*....|....*
gi 755526783  537 SPPQVISDTPASSSP 551
Cdd:PRK10856  237 LPTDQAGVSTPAADP 251
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
427-588 1.62e-04

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 46.13  E-value: 1.62e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  427 ETPASSSPTQVTSETPASSSPTQVTS---DTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSppqvtsDTPASSSPPQVT 503
Cdd:PRK13108  280 EAPGALRGSEYVVDEALEREPAELAAaavASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVT------DEVAAESVVQVA 353
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  504 SDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSS--PTNMT 581
Cdd:PRK13108  354 DRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIPDPakPDELA 433

                  ....*..
gi 755526783  582 SDTPASS 588
Cdd:PRK13108  434 VAGPGDD 440
PRK13042 PRK13042
superantigen-like protein SSL4; Reviewed;
481-577 1.72e-04

superantigen-like protein SSL4; Reviewed;


Pssm-ID: 183854 [Multi-domain]  Cd Length: 291  Bit Score: 45.39  E-value: 1.72e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  481 LVSSSPPQVTSDTPASSSPPQVTSDTPaSSSPPQVTSETPASSspPQVTSDTSASISPPQvisDTPASSSPPQVTSETPa 560
Cdd:PRK13042   15 LLTTGVITTTTQAANATTPSSTKVEAP-QSTPPSTKVEAPQSK--PNATTPPSTKVEAPQ---QTPNATTPSSTKVETP- 87
                          90
                  ....*....|....*..
gi 755526783  561 sSSPTnmTSDTPASSSP 577
Cdd:PRK13042   88 -QSPT--TKQVPTEINP 101
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
231-576 1.78e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 46.06  E-value: 1.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  231 SDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSA 310
Cdd:NF033609  571 SDSSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDS 650
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  311 TSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETP 390
Cdd:NF033609  651 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 730
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  391 ASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPA-SNSPPQGTSDTPG 469
Cdd:NF033609  731 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdSDSDSDSDSDSDS 810
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  470 FSSPTQVTTATLVSSSPPQVTSDTPASSSppqvtSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPqvisDTPASS 549
Cdd:NF033609  811 DSDSDSDSDSDSDSDSDSDSDSDSDSDSD-----SDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPP----NSPKNG 881
                         330       340
                  ....*....|....*....|....*...
gi 755526783  550 SPPQVTSETPASSSPTNMT-SDTPASSS 576
Cdd:NF033609  882 TNASNKNEAKDSKEPLPDTgSEDEANTS 909
Metaviral_G pfam09595
Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. ...
418-569 1.80e-04

Metaviral_G glycoprotein; This is a viral attachment glycoprotein from region G of metaviruses. It is high in serine and threonine suggesting it is highly glycosylated.


Pssm-ID: 462833 [Multi-domain]  Cd Length: 183  Bit Score: 44.17  E-value: 1.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   418 SSSPPQVTSETpASSSPTQVTSETPASSSPTQVTSD----TPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDT 493
Cdd:pfam09595   32 SLILIGESNKE-AALIITDIIDININKQHPEQEHHEnpplNEAAKEAPSESEDAPDIDPNNQHPSQDRSEAPPLEPAAKT 110
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755526783   494 PASSSPPQVTSDTPASSSPPQVTSETPASSsppqvTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTS 569
Cdd:pfam09595  111 KPSEHEPANPPDASNRLSPPDASTAAIREA-----RTFRKPSTGKRNNPSSAQSDQSPPRANHEAIGRANPFAMSS 181
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
426-565 1.81e-04

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 45.71  E-value: 1.81e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  426 SETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPG--FSSPTQVTTATLVSSSPPQVTSDTPA-SSSPPQV 502
Cdd:PTZ00436  208 AAAPSGKKSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAkaAAPPAKAAAPPAKAAAPPAKAAAPPAkAAAPPAK 287
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755526783  503 TSDTPASSSppqvTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:PTZ00436  288 AAAPPAKAA----AAPAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
494-656 1.83e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.32  E-value: 1.83e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  494 PASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQvisDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:PHA03307   22 PRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPP---TGPPPGPGTEAPANESRSTPTWSLSTLAPA 98
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  574 SSSPTNmtSDTPASSSPTNMTSDTPASSSP---PWPVITEVTRPE-STIPAGRSLANITSKAQEDSPLGVISTHPQMSFQ 649
Cdd:PHA03307   99 SPAREG--SPTPPGPSSPDPPPPTPPPASPppsPAPDLSEMLRPVgSPGPPPAASPPAAGASPAAVASDAASSRQAALPL 176

                  ....*..
gi 755526783  650 SSTSQQA 656
Cdd:PHA03307  177 SSPEETA 183
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
513-714 1.83e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.92  E-value: 1.83e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  513 PQVTSETPASSSPPQVTSD-TSASISPPQVISDTPASSSPPQvTSETPASSSPTNMTsdTPASSSPTNMTSdtPASSSPT 591
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIKPvFTQPAAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSAT--QPAGTPPTVSVD--PPAAVPV 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  592 NMTSDTPASSSPPwpviteVTRPESTIPagrslaniTSKAqedSPLGVISTHPQmsfQSSTSQQALDETAgerVPTIPDF 671
Cdd:PRK14971  438 NPPSTAPQAVRPA------QFKEEKKIP--------VSKV---SSLGPSTLRPI---QEKAEQATGNIKE---APTGTQK 494
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 755526783  672 QAHSEFQkacaiLQRL-RDFLPTSPtSAQVSVANLLIDLSEQLL 714
Cdd:PRK14971  495 EIFTEED-----LQYYwQEFAGTRP-QEEKALKETMINCRPKLL 532
PRK10856 PRK10856
cytoskeleton protein RodZ;
361-450 1.85e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 45.40  E-value: 1.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  361 LDTPSSSSP-PQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTS 439
Cdd:PRK10856  164 LDTSTTTDPaTTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAG 243
                          90
                  ....*....|.
gi 755526783  440 ETPASSSPTQV 450
Cdd:PRK10856  244 VSTPAADPNAL 254
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
321-469 2.11e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 45.42  E-value: 2.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   321 TSDTPASSSPPQ----VTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPqgTSDTPASSSPPQGTSETPASNSPP 396
Cdd:pfam05539  191 SQVTPQSQPATQghqtATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPS--GSPQHPPSTTSQDQSTTGDGQEHT 268
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755526783   397 QGTSETPGFSSPPQVTTATLVSSSPPQVTS--ETPASSSPTQVTSeTPASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:pfam05539  269 QRRKTPPATSNRRSPHSTATPPPTTKRQETgrPTPRPTATTQSGS-SPPHSSPPGVQANPTTQNLVDCKELDPPK 342
PRK10856 PRK10856
cytoskeleton protein RodZ;
522-603 2.34e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 45.02  E-value: 2.34e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  522 SSSPPQVTSDTSASISPPQVISDTPASSSP--PQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPA 599
Cdd:PRK10856  168 TTTDPATTPAPAAPVDTTPTNSQTPAVATApaPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247

                  ....
gi 755526783  600 SSSP 603
Cdd:PRK10856  248 AADP 251
PHA03255 PHA03255
BDLF3; Provisional
399-559 2.40e-04

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 44.51  E-value: 2.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  399 TSETPGFSSPPQVTTATLVSSSPPQvTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSppqGTSDTPGFSSPTQVTT 478
Cdd:PHA03255   25 TSSGSSTASAGNVTGTTAVTTPSPS-ASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST---GTTVTPVPTTSNASTI 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  479 ATLVSSSPPQVTSDTPASSSPPQVTSD--TPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTS 556
Cdd:PHA03255  101 NVTTKVTAQNITATEAGTGTSTGVTSNvtTRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQPSLSY 180

                  ...
gi 755526783  557 ETP 559
Cdd:PHA03255  181 GLP 183
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
468-591 2.47e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.48  E-value: 2.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  468 PGFSSPTQVT--TATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTsdtsASISPPQVISDT 545
Cdd:PRK14951  366 PAAAAEAAAPaeKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAP----AAAAPAAAPAAA 441
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 755526783  546 PASSSPPQVTSETPAS--SSPTNMTSDTPASSSPTNMTSDTPASSSPT 591
Cdd:PRK14951  442 PAAVALAPAPPAQAAPetVAIPVRVAPEPAVASAAPAPAAAPAAARLT 489
motB PRK12799
flagellar motor protein MotB; Reviewed
373-504 2.58e-04

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 45.09  E-value: 2.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  373 TSDTPASSSPPqgTSETPASNSPPQGTSETPGfssppQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTS 452
Cdd:PRK12799  296 HGTVPVAAVTP--SSAVTQSSAITPSSAAIPS-----PAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPA 368
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 755526783  453 DTPASNSPPQGTSDTPGFSSPTQVTTATlvsSSPPQVTSDTPASSSPPQVTS 504
Cdd:PRK12799  369 AEPVNMQPQPMSTTETQQSSTGNITSTA---NGPTTSLPAAPASNIPVSPTS 417
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
487-594 2.62e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.54  E-value: 2.62e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  487 PQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVisdTPASSSPPQVtSETPASSSPTN 566
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSA---TQPAGTPPTV-SVDPPAAVPVN 438
                          90       100
                  ....*....|....*....|....*...
gi 755526783  567 MTSDTPASSSPTNMTSDTPASSSPTNMT 594
Cdd:PRK14971  439 PPSTAPQAVRPAQFKEEKKIPVSKVSSL 466
PRK13914 PRK13914
invasion associated endopeptidase;
352-588 2.91e-04

invasion associated endopeptidase;


Pssm-ID: 237555 [Multi-domain]  Cd Length: 481  Bit Score: 45.18  E-value: 2.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  352 ASSSPPQGTLDTPSSSSPPQGTsdtPASSSPPQGTSETPASNSPPQG--TSETPGFSSppQVTTATLVSSSPPQVTSETP 429
Cdd:PRK13914  143 VTSTPVAPTQEVKKETTTQQAA---PAAETKTEVKQTTQATTPAPKVaeTKETPVVDQ--NATTHAVKSGDTIWALSVKY 217
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  430 ASSSPTQVTSETPASSS---PTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDT 506
Cdd:PRK13914  218 GVSVQDIMSWNNLSSSSiyvGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKA 297
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  507 PASSSPPqvtseTPAssspPQVTSDTSASISPPQVISDTPASSSPPQVTSETpaSSSPTNMTSDTPASSSPTNMTSDTPA 586
Cdd:PRK13914  298 PTEAAKP-----APA----PSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTN--TNSNTNTNSNTNANQGSSNNNSNSSA 366

                  ..
gi 755526783  587 SS 588
Cdd:PRK13914  367 SA 368
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
435-572 3.27e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 45.15  E-value: 3.27e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  435 TQVTSETPASSSPTQVTSDT---PASNSPPQGTSDtpgfssPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSS 511
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIKPVftqPAAAPQPSAAAA------ASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVP 436
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 755526783  512 PPqvtsetPASSSPPQVTSDTSAS---ISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTP 572
Cdd:PRK14971  437 VN------PPSTAPQAVRPAQFKEekkIPVSKVSSLGPSTLRPIQEKAEQATGNIKEAPTGTQK 494
PRK10856 PRK10856
cytoskeleton protein RodZ;
362-476 3.44e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 44.63  E-value: 3.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  362 DTPSSSSPPQGTSDTP--ASSSPPQGTSETPASNSPPQGTSetpgfSSPPQVTTATlvsssPPQVTSETPASSSPTQVTS 439
Cdd:PRK10856  148 DQSSAELSQNSGQSVPldTSTTTDPATTPAPAAPVDTTPTN-----SQTPAVATAP-----APAVDPQQNAVVAPSQANV 217
                          90       100       110
                  ....*....|....*....|....*....|....*..
gi 755526783  440 ETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQV 476
Cdd:PRK10856  218 DTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNAL 254
PRK10856 PRK10856
cytoskeleton protein RodZ;
483-564 3.44e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 44.63  E-value: 3.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  483 SSSPPQVTSDTPASSSPPQVTSDTPASSSP--PQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPA 560
Cdd:PRK10856  168 TTTDPATTPAPAAPVDTTPTNSQTPAVATApaPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247

                  ....
gi 755526783  561 SSSP 564
Cdd:PRK10856  248 AADP 251
PRK08581 PRK08581
amidase domain-containing protein;
353-550 3.45e-04

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 45.16  E-value: 3.45e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  353 SSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETpgfSSPPQVTTATLVSSSPPQVTSETPASS 432
Cdd:PRK08581  129 LNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPS---SNNTKPSTSNKQPNSPKPTQPNQSNSQ 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  433 SPTQVTSETPASSSPTQVTSDTPASNSPPQgtsdtpgFSSPTQVTTATLVSSSPPQVTSdTPASSSPPQVTSDTPASSSP 512
Cdd:PRK08581  206 PASDDTANQKSSSKDNQSMSDSALDSILDQ-------YSEDAKKTQKDYASQSKKDKTE-TSNTKNPQLPTQDELKHKSK 277
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 755526783  513 PQVTSETPAssspPQVTSDTSASISPPQVISDTPASSS 550
Cdd:PRK08581  278 PAQSFENDV----NQSNTRSTSLFETGPSLSNNDDSGS 311
PPE COG5651
PPE-repeat protein [Function unknown];
267-500 3.47e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 44.88  E-value: 3.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  267 QGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPqvtsATSASSSPPQG 346
Cdd:COG5651   155 AAASAAAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGP----IGLNSGPGNTG 230
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  347 TSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTS 426
Cdd:COG5651   231 FAGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGL 310
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 755526783  427 ETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPP 500
Cdd:COG5651   311 GAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
455-573 3.66e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 44.86  E-value: 3.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  455 PASNSPPQGTSDTPGfsspTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSdtSA 534
Cdd:PRK07994  366 PEPEVPPQSAAPAAS----AQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATK--AK 439
                          90       100       110
                  ....*....|....*....|....*....|....*....
gi 755526783  535 SISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:PRK07994  440 KSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEA 478
ARG80 COG5068
Regulator of arginine metabolism and related MADS box-containing transcription factors ...
347-604 3.68e-04

Regulator of arginine metabolism and related MADS box-containing transcription factors [Transcription];


Pssm-ID: 227400 [Multi-domain]  Cd Length: 412  Bit Score: 44.62  E-value: 3.68e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  347 TSDTPASSSPPQGTLDTPSSSSpPQGTSDTPASSSPPQGTSETPASNSPPQGtsetpgfSSPPQVTTATLVSSSPPQVts 426
Cdd:COG5068   163 PSDSSEEPSSSASFSVDPNDNN-PMGSFQHNGSPQTNFIPLQNPQTQQYQQH-------SSRKDHPTVPHSNTNNGRP-- 232
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  427 etPASSSPTQVTSETPASSSPTQVTSDtpaSNSPPQG-TSDTPGFSSPTQVTTATLVsSSPPQVTSDTPASSSPPQVTSD 505
Cdd:COG5068   233 --PAKFMIPELHSSHSTLDLPSDFISD---SGFPNQSsTSIFPLDSAIIQITPPHLP-NNPPQENRHELYSNDSSMVSET 306
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  506 TPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:COG5068   307 PPPKNLPNGSPNQSPLNNLSRGNPASPNSIIRENNQVEDESFNGRQGSAIWNALISTTQPNSGLHTEASTAPSSTIPADP 386
                         250
                  ....*....|....*....
gi 755526783  586 ASSSPTNMTSDTPASSSPP 604
Cdd:COG5068   387 LKNAAQTNSGTRNNNFSDN 405
PHA03377 PHA03377
EBNA-3C; Provisional
408-697 3.81e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 45.04  E-value: 3.81e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  408 PPQVTTATLVSSSPPQVTSE----TPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVS 483
Cdd:PHA03377  399 PVQQRPVMFVSRVPWRKPRTlpwpTPKTHPVKRTLVKTSGRSDEAEQAQSTPERPGPSDQPSVPVEPAHLTPVEHTTVIL 478
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  484 SSPPQVTSDTPASSSPPQV---------------------TSDTPASSSPPQVTSETPASSSppQVTSDTSASISPPQVI 542
Cdd:PHA03377  479 HQPPQSPPTVAIKPAPPPSrrrrgacvvydddiievidveTTEEEESVTQPAKPHRKVQDGF--QRSGRRQKRATPPKVS 556
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  543 -SDT--PASSSP---PQVTSETPASSSPTNMTSDTPASSSPTNMT--SDTPASSSPTNMTSDTPASSSPPWPVITEVTR- 613
Cdd:PHA03377  557 pSDRgpPKASPPvmaPPSTGPRVMATPSTGPRDMAPPSTGPRQQAkcKDGPPASGPHEKQPPSSAPRDMAPSVVRMFLRe 636
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  614 ---PESTIPAGRSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQAldetageRVPTIPDFQAHSEFQKACAILQRLRDF 690
Cdd:PHA03377  637 rllEQSTGPKPKSFWEMRAGRDGSGIQQEPSSRRQPATQSTPPRPS-------WLPSVFVLPSVDAGRAQPSEESHLSSM 709

                  ....*..
gi 755526783  691 LPTSPTS 697
Cdd:PHA03377  710 SPTQPIS 716
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
366-456 3.99e-04

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 44.84  E-value: 3.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  366 SSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSET-PGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPAS 444
Cdd:PRK11907   19 TASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEaDNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEARTV 98
                          90
                  ....*....|..
gi 755526783  445 SSPTqvTSDTPA 456
Cdd:PRK11907   99 TPAA--TETSKP 108
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
471-591 4.07e-04

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 44.56  E-value: 4.07e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  471 SSPTQVTTATLVSSSPPQVTSDTPA-SSSPPQVTSDTPA-SSSPPQVTSETP--ASSSPPQVTSDTSASISPPQVISDTP 546
Cdd:PTZ00436  220 AAPAKAAAAPAKAAAPPAKAAAAPAkAAAAPAKAAAPPAkAAAPPAKAAAPPakAAAPPAKAAAPPAKAAAPPAKAAAAP 299
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 755526783  547 A-SSSPPQVTSETPA-SSSPTNMTSDTPASSSPTNMTSDTPASSSPT 591
Cdd:PTZ00436  300 AkAAAAPAKAAAAPAkAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
429-561 4.81e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 44.71  E-value: 4.81e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  429 PASSSPTQVTSEtpaSSSPTQVTSDTPASNSPPQgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPA 508
Cdd:PRK14951  366 PAAAAEAAAPAE---KKTPARPEAAAPAAAPVAQ--------AAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAP 434
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 755526783  509 SSSPPQVTSETPASSSPPQVTSDTSASI-----SPPQVISDTPASSSPPQVTSETPAS 561
Cdd:PRK14951  435 AAAPAAAPAAVALAPAPPAQAAPETVAIpvrvaPEPAVASAAPAPAAAPAAARLTPTE 492
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
390-530 4.98e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 44.47  E-value: 4.98e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  390 PASNSPPQGTSetpgfSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPG 469
Cdd:PRK07994  366 PEPEVPPQSAA-----PAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKK 440
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755526783  470 fSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASS--SPPQVTSETPASSSPPQVTS 530
Cdd:PRK07994  441 -SEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwkATNPVEVKKEPVATPKALKK 502
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
235-487 5.23e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 44.46  E-value: 5.23e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  235 ASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSAS 314
Cdd:PRK07003  363 TGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDA 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  315 SSPPQGT-SDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTL-----DTPSSSSPPQGTSDTPASSSPPQGTSE 388
Cdd:PRK07003  443 ADGDAPVpAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFepaprAAAPSAATPAAVPDARAPAAASREDAP 522
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  389 TPASNSPPQGTSETPGFSSPPQVTT---ATL---------VSSSPPQVTSETPASSSPTQVTSETPASSSPTQVtsdtPA 456
Cdd:PRK07003  523 AAAAPPAPEARPPTPAAAAPAARAGgaaAALdvlrnagmrVSSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQV----PT 598
                         250       260       270
                  ....*....|....*....|....*....|.
gi 755526783  457 SNSPPQGTSDTPGFSSPTQVTTATLvSSSPP 487
Cdd:PRK07003  599 PRARAATGDAPPNGAARAEQAAESR-GAPPP 628
PHA03132 PHA03132
thymidine kinase; Provisional
354-473 5.24e-04

thymidine kinase; Provisional


Pssm-ID: 222997 [Multi-domain]  Cd Length: 580  Bit Score: 44.37  E-value: 5.24e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  354 SSPPQGTLDTPSSSSPPQGTSDTPasSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSS 433
Cdd:PHA03132   65 GVATSTIYTVPRPPRGPEQTLDKP--DSLPASRELPPGPTPVPPGGFRGASSPRLGADSTSPRFLYQVNFPVILAPIGES 142
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 755526783  434 PTqvTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSP 473
Cdd:PHA03132  143 NS--SSEELSEEEEHSRPPPSESLKVKNGGKVYPKGFSKH 180
PPE COG5651
PPE-repeat protein [Function unknown];
322-539 5.58e-04

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 44.11  E-value: 5.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  322 SDTPASSSPPQVTSATSASSSPPQGTSDTPAS--------SSPPQGTLDTPSSSSPPQGTsdtpASSSPPQGTSETPASN 393
Cdd:COG5651   163 ALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNpgfanlglTGLNQVGIGGLNSGSGPIGL----NSGPGNTGFAGTGAAA 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  394 SPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSP 473
Cdd:COG5651   239 GAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGA 318
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755526783  474 TQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPP 539
Cdd:COG5651   319 AGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
357-501 5.69e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.38  E-value: 5.69e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  357 PQGTLDTPSSSSPPQGTSDT---PASSSPPQgtseTPASNSPPQGTSETPgfSSPPQVTTATLVSSSPPQVtSETPASSS 433
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIKPVftqPAAAPQPS----AAAAASPSPSQSSAA--AQPSAPQSATQPAGTPPTV-SVDPPAAV 435
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 755526783  434 PTQVTSETPASSSPTQVTSDTPasNSPPQGTSDTPGFSSPTQVTTAtlvsssppQVTSDTPASSSPPQ 501
Cdd:PRK14971  436 PVNPPSTAPQAVRPAQFKEEKK--IPVSKVSSLGPSTLRPIQEKAE--------QATGNIKEAPTGTQ 493
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
204-410 5.80e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.59  E-value: 5.80e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  204 IDPASSAPPKATHRMTITSLTGRPQVTSDTLASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvts 283
Cdd:PRK07764  588 VGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD--- 664
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  284 atSASSSPPQGTSDTPASSSPPQVTsatsasssppqGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDT 363
Cdd:PRK07764  665 --GGDGWPAKAGGAAPAAPPPAPAP-----------AAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAP 731
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 755526783  364 PSSSS---PPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQ 410
Cdd:PRK07764  732 SPAADdpvPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781
Hamartin pfam04388
Hamartin protein; This family includes the hamartin protein which is thought to function as a ...
205-585 6.06e-04

Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.


Pssm-ID: 461287 [Multi-domain]  Cd Length: 730  Bit Score: 44.28  E-value: 6.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   205 DPASSAPPKATHRMTITSLTGrpqvTSDTLASSSPPQGTSDTPASSSPPQVTsatsasssppqGTSDTPASS-----SPP 279
Cdd:pfam04388  288 YGSSTSTPSSTPRLQLSSSSG----TSPPYLSPPSIRLKTDSFPLWSPSSVC-----------GMTTPPTSPgmvptTPS 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   280 QVTSATSASSSPPQGTSD---------TPASSSPPQvtsatsasssppqgtsdtpasSSPPqvtsatsasssppqgtsdt 350
Cdd:pfam04388  353 ELSPSSSHLSSRGSSPPEaageatpetTPAKDSPYL---------------------KQPP------------------- 392
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   351 PASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQgTSETPASNSppqgtseTPGFSSPP-QVTTATLvsSSPPQVTSETP 429
Cdd:pfam04388  393 PLSDSHVHRALPASSQPSSPPRKDGRSQSSFPPL-SKQAPTNPN-------SRGLLEPPgDKSSVTL--SELPDFIKDLA 462
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   430 ASSSPTQVTSETPAS-----SSPTQVTSDTPASNsppqgtsdtPGFSSPTQVTTATLVSSsppQVTSDTPASSSPPQVTS 504
Cdd:pfam04388  463 LSSEDSVEGAEEEAAisqelSEITTEKNETDCSR---------GGLDMPFSRTMESLAGS---QRSRNRIASYCSSTSQS 530
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   505 DTPASSSPPQVTSETPASSSPPQVTSDTSASISPP--QVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTS 582
Cdd:pfam04388  531 DSHGPATTPESKPSALAEDGLRRTKSCSFKQSFTPieQPIESSDDCPTDEQDGENGLETSILTPSPCKIPSRQKVSTQSG 610

                   ...
gi 755526783   583 DTP 585
Cdd:pfam04388  611 QPL 613
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
356-529 6.29e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 44.15  E-value: 6.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  356 PPQGTLDTPSSSSPPQG----------TSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:PLN03209  382 PPTSPIPTPPSSSPASSksvdavakpaEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGV 461
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  426 SETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTP--ASSSPPQVT 503
Cdd:PLN03209  462 SPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVkvGNSAPPTAL 541
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 755526783  504 SDT---------PASSSPPQVTSETPASSSPPQVT 529
Cdd:PLN03209  542 ADEqhhaqpkprPLSPYTMYEDLKPPTSPTPSPVL 576
PRK10856 PRK10856
cytoskeleton protein RodZ;
431-525 6.51e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 43.48  E-value: 6.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  431 SSSPTQVTSETPASSSPTQVTSDTPASNSPPQGtsdtpgfSSPTQVTTATlvssSPPQVTSDTPASSSPPQVTSDTPASS 510
Cdd:PRK10856  168 TTTDPATTPAPAAPVDTTPTNSQTPAVATAPAP-------AVDPQQNAVV----APSQANVDTAATPAPAAPATPDGAAP 236
                          90
                  ....*....|....*
gi 755526783  511 SPPQVTSETPASSSP 525
Cdd:PRK10856  237 LPTDQAGVSTPAADP 251
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
377-513 6.54e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 44.32  E-value: 6.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  377 PASSSPPQGTSETPASNSPPQGTsetPGFSSPPQVTTATLVSSSPPQVTSET---PASSSPTQVTSETPASSSPTQVTSD 453
Cdd:PRK14951  366 PAAAAEAAAPAEKKTPARPEAAA---PAAAPVAQAAAAPAPAAAPAAAASAPaapPAAAPPAPVAAPAAAAPAAAPAAAP 442
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  454 TPASNSPPQGTSDTPGFSSPtqvttatlvsssPPQVTSDTPASSSPPQVTSDTPASSSPP 513
Cdd:PRK14951  443 AAVALAPAPPAQAAPETVAI------------PVRVAPEPAVASAAPAPAAAPAAARLTP 490
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
396-540 6.58e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 44.00  E-value: 6.58e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  396 PQGTSETPGFSSPPQVTT-ATLVSSSPPQVTSETPASSSPTQvTSETPASSSPTQVtsdTPASNSPPQGTSDTPGfSSPT 474
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIKpVFTQPAAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSA---TQPAGTPPTVSVDPPA-AVPV 437
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 755526783  475 QVTTATLVSSSPPQVTSDTPASSSppQVTSDTPASSSPPQVTSEtpasssppQVTSDTSASISPPQ 540
Cdd:PRK14971  438 NPPSTAPQAVRPAQFKEEKKIPVS--KVSSLGPSTLRPIQEKAE--------QATGNIKEAPTGTQ 493
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
378-468 6.70e-04

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 44.07  E-value: 6.70e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  378 ASSSPPQGTSETPASNSPPQGTSETPgfSSPPQVTTAT---LVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDT 454
Cdd:PRK11907   18 LTASNPKLAQAEEIVTTTPATSTEAE--QTTPVESDATeeaDNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEA 95
                          90
                  ....*....|....
gi 755526783  455 PAsnSPPQGTSDTP 468
Cdd:PRK11907   96 RT--VTPAATETSK 107
PHA02732 PHA02732
hypothetical protein; Provisional
355-609 7.09e-04

hypothetical protein; Provisional


Pssm-ID: 165099 [Multi-domain]  Cd Length: 1467  Bit Score: 44.36  E-value: 7.09e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  355 SPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSP-PQGTSETPGFS-SPPQVTTATLVSSSPPQ----VTSET 428
Cdd:PHA02732 1074 SPSYIFLNSWASSYVAPGFLGSPYALPYFMNQTSALVGNTAlPKGLNVFSGYMfGAGTVASAFLYMNSTPQspvlALLLA 1153
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  429 PASSSPTQVTSETPASSSPTQVTSDTP------ASNSPPQG----TSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSS 498
Cdd:PHA02732 1154 PYISYKFNALSLGFSITADAAIFSLFGipapqlLSSYIPTGsvlyQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASS 1233
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  499 PPQVTSDTPASSSPPQVTSETpASSSPP--QVTSDTSASISPPQVISdtPASSSPP--QVTSETPASSSPTNMTS-DTPA 573
Cdd:PHA02732 1234 PPAATTPTPPPSSSSSSSAQS-ISTSPGqiQIVLNGSTTIHINFLFF--PALSTPKigQILAMPIVNSSGAFISLyVNSA 1310
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 755526783  574 SSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT 609
Cdd:PHA02732 1311 ISANFNVTIEYVFSNGTVIKRFTDEPGQIFPLPLIN 1346
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
430-603 7.23e-04

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 44.22  E-value: 7.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   430 ASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPgfssptQVTTAtlVSSSPPQVT-SDTPasSSPPQVTSDTPA 508
Cdd:TIGR00927   55 SSQQPIKLASRDLSNDEMMMVSSDPPKSSSEMEGEMLAP------QATVG--RDEATPSIAmENTP--SPPRRTAKITPT 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   509 SSSppqvTSETPASSSPPQVTSDTSAsiSPPQVISDTPASSSPPQVTSETPA------SSSPTNMTSDTPA-SSSPTNMT 581
Cdd:TIGR00927  125 TPK----NNYSPTAAGTERVKEDTPA--TPSRALNHYISTSGRQRVKSYTPKprgevkSSSPTQTREKVRKyTPSPLGRM 198
                          170       180
                   ....*....|....*....|..
gi 755526783   582 SDTPASSspTNMTSDTPASSSP 603
Cdd:TIGR00927  199 VNSYAPS--TFMTMPRSHGITP 218
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
460-636 7.29e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 44.07  E-value: 7.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  460 PPQGTSDTPGFSSPTQVTTAtlVSSSPPQVTSDTPASSSPPQvtsdTPASSSPPQVTSETPASSSPPQVTSDTSASISPP 539
Cdd:PRK07003  360 PAVTGGGAPGGGVPARVAGA--VPAPGARAAAAVGASAVPAV----TAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPP 433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  540 Q---VISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTnmTSDTPASSSPTNMTSDtPASSSPPWPVITEVTRPES 616
Cdd:PRK07003  434 AtadRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSG--SASAPASDAPPDAAFE-PAPRAAAPSAATPAAVPDA 510
                         170       180
                  ....*....|....*....|
gi 755526783  617 TIPAGRSLANITSKAQEDSP 636
Cdd:PRK07003  511 RAPAAASREDAPAAAAPPAP 530
PHA03255 PHA03255
BDLF3; Provisional
490-670 7.33e-04

BDLF3; Provisional


Pssm-ID: 165513 [Multi-domain]  Cd Length: 234  Bit Score: 42.97  E-value: 7.33e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  490 TSDTPASSSPPQVTSDTPasssppqVTSETPASSSPPQVTSDTSASISPPqvISDTPASSSPPQVTSETPASSSPTNMTS 569
Cdd:PHA03255   25 TSSGSSTASAGNVTGTTA-------VTTPSPSASGPSTNQSTTLTTTSAP--ITTTAILSTNTTTVTSTGTTVTPVPTTS 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  570 DTPASSSPTNMTSDTPASSSptnmtsdTPASSSPPwPVITEVTRPESTIPAGRSLANITSKAQEdsplgvisthpqmsfQ 649
Cdd:PHA03255   96 NASTINVTTKVTAQNITATE-------AGTGTSTG-VTSNVTTRSSSTTSATTRITNATTLAPT---------------L 152
                         170       180
                  ....*....|....*....|.
gi 755526783  650 SSTSQQALDETAGErVPTIPD 670
Cdd:PHA03255  153 SSKGTSNATKTTAE-LPTVPD 172
PRK13042 PRK13042
superantigen-like protein SSL4; Reviewed;
428-518 7.42e-04

superantigen-like protein SSL4; Reviewed;


Pssm-ID: 183854 [Multi-domain]  Cd Length: 291  Bit Score: 43.08  E-value: 7.42e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  428 TPASSSPTQVTSETPASSSPTQVTSDTPASNSPpqgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTP 507
Cdd:PRK13042   18 TGVITTTTQAANATTPSSTKVEAPQSTPPSTKV----------EAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVETP 87
                          90
                  ....*....|.
gi 755526783  508 ASSSPPQVTSE 518
Cdd:PRK13042   88 QSPTTKQVPTE 98
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
364-499 7.88e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 43.90  E-value: 7.88e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  364 PSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPqvTSETPASSSPTQVTSETPa 443
Cdd:PRK14959  367 PVESLRPSGGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPA--PSAAPSPRVPWDDAPPAP- 443
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 755526783  444 sssptqvtsdtPASNSPPQGTSDTPGFSSPT--QVTTATLVSSSPPQVTSDTPASSSP 499
Cdd:PRK14959  444 -----------PRSGIPPRPAPRMPEASPVPgaPDSVASASDAPPTLGDPSDTAEHTP 490
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
555-603 8.08e-04

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 42.30  E-value: 8.08e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 755526783  555 TSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:cd21441    65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVP 113
PHA03193 PHA03193
tegument protein VP11/12; Provisional
368-511 8.14e-04

tegument protein VP11/12; Provisional


Pssm-ID: 177555  Cd Length: 594  Bit Score: 43.94  E-value: 8.14e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  368 SPPQGTSDTPASSSPpqgTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPAS------SSPTQVTSET 441
Cdd:PHA03193  441 SPFQRKRAMPEDGGE---IHEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTSNEMkgdaecPAAQDAAAIL 517
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  442 PASSsptQVTSDTPASNSPPQGTSDTpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSS 511
Cdd:PHA03193  518 PASF---QIENGGAADGSGLAIPAAM---CDATAVESPSTVAETPPERLLAAESGPRCKATAKHKGGSSK 581
SP2_N cd22540
N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins ...
351-604 8.18e-04

N-terminal domain of transcription factor Specificity Protein (SP) 2; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP2 contains the least conserved DNA-binding domain within the SP subfamily of proteins, and its DNA sequence specificity differs from the other SP proteins. It localizes primarily within subnuclear foci associated with the nuclear matrix, and can activate, or in some cases, repress expression from different promoters. The transcription factor SP2 serves as a paradigm for indirect genomic binding. It does not require its DNA-binding domain for genomic DNA binding and occupies target promoters independently of whether they contain a cognate DNA-binding motif. SP2 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP2.


Pssm-ID: 411776 [Multi-domain]  Cd Length: 511  Bit Score: 43.76  E-value: 8.18e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  351 PASSSPPQGTLDTPSSSS------PPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSS---- 420
Cdd:cd22540    66 PLPLGPGKNSIGFLSAKGniiqlqGSQLSSSAPGGQQVFAIQNPTMIIKGSQTRSSTNQQYQISPQIQAAGQINNSgqiq 145
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  421 -----------PPQVTSETPASSSPTQV---------TSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTAT 480
Cdd:cd22540   146 iipgtnqaiitPVQVLQQPQQAHKPVPIkpaplqtsnTNSASLQVPGNVIKLQSGGNVALTLPVNNLVGTQDGATQLQLA 225
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  481 LVSSSPPQVTSDTPASSSPPQVTS--------------------------DTPASSSPPQV--------TSETPASSSPP 526
Cdd:cd22540   226 AAPSKPSKKIRKKSAQAAQPAVTVaeqvetvliettadniiqagnnllivQSPGTGQPAVLqqvqvlqpKQEQQVVQIPQ 305
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  527 ------QVTSDTSASI--SPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTP 598
Cdd:cd22540   306 qalrvvQAASATLPTVpqKPLQNIQIQNSEPTPTQVYIKTPSGEVQTVLLQEAPAATATPSSSTSTVQQQVTANNGTGTS 385

                  ....*.
gi 755526783  599 ASSSPP 604
Cdd:cd22540   386 KPNYNV 391
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
383-514 8.86e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 43.61  E-value: 8.86e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  383 PQGTSETPASNSPPQGTSetPGFS---SPPQVTTATLVSSSPPQvTSETPASSSPTQVtseTPASSSPTQVTSDTP-ASN 458
Cdd:PRK14971  363 TQKGDDASGGRGPKQHIK--PVFTqpaAAPQPSAAAAASPSPSQ-SSAAAQPSAPQSA---TQPAGTPPTVSVDPPaAVP 436
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  459 SPPQGT---SDTPGFSSPTQVTTATLVSSSPPQVTSdtPASSSPPQVTSDTPASSSPPQ 514
Cdd:PRK14971  437 VNPPSTapqAVRPAQFKEEKKIPVSKVSSLGPSTLR--PIQEKAEQATGNIKEAPTGTQ 493
CytochromB561_N pfam09786
Cytochrome B561, N terminal; Members of this family are found in the N terminal region of ...
350-558 9.59e-04

Cytochrome B561, N terminal; Members of this family are found in the N terminal region of cytochrome B561, as well as in various other putative uncharacterized proteins.


Pssm-ID: 462899  Cd Length: 579  Bit Score: 43.66  E-value: 9.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   350 TPASSSPPQGTLDTPSssspPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFS--SPPQVTTATLVSSSPPQVTSE 427
Cdd:pfam09786  129 PPKSKSSPQSPSPVLV----PLHQSVSPSSSESRKGGDKSPAGSGKKLRSFSTSSKSpaSPSVYLRGSPVPLNSSPLPSD 204
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   428 TPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSpPQVTSDTP 507
Cdd:pfam09786  205 RNYENSVQSSPEIDSAVSTPWSRKRATIGKEIRTEKMLERFLAEVDEKITESAFGKASPSNVSGSANRSGS-TRSTPLRS 283
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 755526783   508 ASSSPPQVTSETPASSSPpqvtSDTSASISPPQVISDTPASSSPPQVTSET 558
Cdd:pfam09786  284 VRMSPGSQKFTTPPKKGE----GDLPSPMSMEENIEAFENLGIYPQIEQWR 330
PHA03379 PHA03379
EBNA-3A; Provisional
211-606 9.65e-04

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 43.89  E-value: 9.65e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  211 PP---KATHRMTITSLTGRPQVTSDTLASssPPQGTSDTPassSPPQVTSATSASSSPPQGTSDTPASS-SPPQVTSATS 286
Cdd:PHA03379  379 PPiflRRLHRLLLMRAGKLTERAREALEK--ASEPTYGTP---RPPVEKPRPEVPQSLETATSHGSAQVpEPPPVHDLEP 453
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  287 ASSSPPQGTSDTPASSSPPqvtsatsasssppqgtsdtpassSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSS 366
Cdd:PHA03379  454 GPLHDQHSMAPCPVAQLPP-----------------------GPLQDLEPGDQLPGVVQDGRPACAPVPAPAGPIVRPWE 510
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  367 SSPPQGTSDTPASSSP----------PQGTSETPASNSPP----QGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASS 432
Cdd:PHA03379  511 ASLSQVPGVAFAPVMPqpmpvepvpvPTVALERPVCPAPPliamQGPGETSGIVRVRERWRPAPWTPNPPRSPSQMSVRD 590
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  433 SPTQVTSET-----PASSSPTQVTSDTPAS-----NSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTS---DTPASSSP 499
Cdd:PHA03379  591 RLARLRAEAqpyqaSVEVQPPQLTQVSPQQpmeypLEPEQQMFPGSPFSQVADVMRAGGVPAMQPQYFDlplQQPISQGA 670
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  500 PQVTSDTPASSSPPqVTSETPASSSPPqVTSDTSASISPPQVISDTPAssSPPQVtseTPASSSPTNMTSDTPASSSPTN 579
Cdd:PHA03379  671 PLAPLRASMGPVPP-VPATQPQYFDIP-LTEPINQGASAAHFLPQQPM--EGPLV---PERWMFQGATLSQSVRPGVAQS 743
                         410       420
                  ....*....|....*....|....*..
gi 755526783  580 MTSDTPASSSPTNMTSDTPASSSPPWP 606
Cdd:PHA03379  744 QYFDLPLTQPINHGAPAAHFLHQPPME 770
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
434-560 9.84e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 43.70  E-value: 9.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  434 PTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTS----DTPAS 509
Cdd:PRK07994  361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRaqgaTKAKK 440
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 755526783  510 SSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQ-------VTSETPA 560
Cdd:PRK07994  441 SEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATnpvevkkEPVATPK 498
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
442-578 9.92e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.55  E-value: 9.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  442 PASSSPTQVTSDTPASNSPPQGTsdtPGFSSPTQVTTATLVSSSPPQVTSdtpASSSPPqvtsdTPASSSPPQVTSETPA 521
Cdd:PRK14951  366 PAAAAEAAAPAEKKTPARPEAAA---PAAAPVAQAAAAPAPAAAPAAAAS---APAAPP-----AAAPPAPVAAPAAAAP 434
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 755526783  522 SSSPPQVTSdtSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PRK14951  435 AAAPAAAPA--AVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLT 489
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
487-620 1.02e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 43.70  E-value: 1.02e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  487 PQVTSDTPASssPPQVTSDTPASssppQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTN 566
Cdd:PRK07994  361 PAAPLPEPEV--PPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQG 434
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 755526783  567 mtSDTPASSSPTNMTSDTPASSSPTNMTSDTP-ASSSPPWPVITEVTRPESTIPA 620
Cdd:PRK07994  435 --ATKAKKSEPAAASRARPVNSALERLASVRPaPSALEKAPAKKEAYRWKATNPV 487
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
427-578 1.03e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 43.01  E-value: 1.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  427 ETPASSSPTQVTSETPASSSPTQVTSdtpASNSPPQGTSDTPGFSS--PTQVTTATLVSSSPPQVTSDTPA-SSSPPQVT 503
Cdd:PTZ00436  191 EDAAAAAAAKQKAAAKKAAAPSGKKS---AKAAAPAKAAAAPAKAAapPAKAAAAPAKAAAAPAKAAAPPAkAAAPPAKA 267
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  504 SDTPA-SSSPPQVTSETP--ASSSPPQVTSDTSASISPPQVISDTPA-SSSPPQVTSETPASSSPTNMTSDTPASSSPT 578
Cdd:PTZ00436  268 AAPPAkAAAPPAKAAAPPakAAAPPAKAAAAPAKAAAAPAKAAAAPAkAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
436-638 1.03e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 43.38  E-value: 1.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  436 QVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQvtSDTPASsspPQVTSDTPASSSPPQV 515
Cdd:PLN03209  304 EVIAETTAPLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPI--EEEPPQ---PKAVVPRPLSPYTAYE 378
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  516 TSETPASSSPPQVTSDTSAS-----ISPPQVISDTPASSSPPQV------TSET-----------------PASSSPTNM 567
Cdd:PLN03209  379 DLKPPTSPIPTPPSSSPASSksvdaVAKPAEPDVVPSPGSASNVpevepaQVEAkktrplspyaryedlkpPTSPSPTAP 458
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755526783  568 TSDTPASSSPT--NMTSDTPASSSPTNMTSDTPASSSP--PWPVITEVTRPESTIPAGRSLANITSKAQEDSPLG 638
Cdd:PLN03209  459 TGVSPSVSSTSsvPAVPDTAPATAATDAAAPPPANMRPlsPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVG 533
motB PRK12799
flagellar motor protein MotB; Reviewed
425-556 1.07e-03

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 43.17  E-value: 1.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  425 TSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVts 504
Cdd:PRK12799  296 HGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQS----ATTTQASAVALSSAGVLPSDVTLPGTVALPAA-- 369
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 755526783  505 dTPASSSPPQVTSETPASSSPPQVTSDTSasiSPPQVISDTPASSSPPQVTS 556
Cdd:PRK12799  370 -EPVNMQPQPMSTTETQQSSTGNITSTAN---GPTTSLPAAPASNIPVSPTS 417
DUF612 pfam04747
Protein of unknown function, DUF612; This family includes several uncharacterized proteins ...
270-575 1.12e-03

Protein of unknown function, DUF612; This family includes several uncharacterized proteins from Caenorhabditis elegans.


Pssm-ID: 282585 [Multi-domain]  Cd Length: 511  Bit Score: 43.13  E-value: 1.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   270 SDTPASSSPpQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSppqgTSDTPASSSPPQvtSATSASSSPPQGTSD 349
Cdd:pfam04747  198 TNTPAEPAE-QVQEITGKKNKKNKKKSESEATAAPASVEQVVEQPKV----VTEEPHQQAAPQ--EKKNKKNKRKSESEN 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   350 TPASS-SPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQgtSETPGFSSPPQVTTATLVSSSPPQVTSET 428
Cdd:pfam04747  271 VPAASeTPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQAEAPK--SKKPTADDNMDFLDFVTAKEEPKDEPAET 348
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   429 PASssPTQVTSETPASSSPTQVTSdTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVT----SDTPASSSPPQVTS 504
Cdd:pfam04747  349 PAA--PVEEVVENVVENVVEKSTT-PPATENKKKNKKDKKKSESEKVTEQPVESAPAPPQVEqvveTTPPASENKKKNKK 425
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 755526783   505 DTPASSSPPQVtsETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASS 575
Cdd:pfam04747  426 DKKKSESEKAV--EEPVQAAPSSKKPTADDNMDFLDFVTAKPDKSESVEEHIAAPMIVEPAHADEETAAAA 494
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
347-551 1.15e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.52  E-value: 1.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  347 TSDTPASSSPPQgTLDTPSSSSPPQgtsdTPASSSPPQGTsETPASNSPPQgtSETPGFSspPQVTTATLVSSSppqvts 426
Cdd:PTZ00449  617 LLDIPKSPKRPE-SPKSPKRPPPPQ----RPSSPERPEGP-KIIKSPKPPK--SPKPPFD--PKFKEKFYDDYL------ 680
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  427 eTPASSSPTQVTSETPASSSPTQVTSDTPASNSPPqgtSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASS---SPPQVT 503
Cdd:PTZ00449  681 -DAAAKSKETKTTVVLDESFESILKETLPETPGTP---FTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIeffTPPEEE 756
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 755526783  504 S----DTPASSSPPQVTSETPASsspPQVTSDTSASISP------PQVISDTPASSSP 551
Cdd:PTZ00449  757 RtffhETPADTPLPDILAEEFKE---EDIHAETGEPDEAmkrpdsPSEHEDKPPGDHP 811
PRK11907 PRK11907
bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;
443-552 1.31e-03

bifunctional 2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase;


Pssm-ID: 237019 [Multi-domain]  Cd Length: 814  Bit Score: 43.30  E-value: 1.31e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  443 ASSSPTQVTSDTPASNSPPqgtsdtpgfSSPTQVTTATLVSSSppqVTSDTPASSSPPQVTSDTPASSSPpqvtsETPAS 522
Cdd:PRK11907   18 LTASNPKLAQAEEIVTTTP---------ATSTEAEQTTPVESD---ATEEADNTETPVAATTAAEAPSSS-----ETAET 80
                          90       100       110
                  ....*....|....*....|....*....|
gi 755526783  523 SSPPQVTSDTSASISPPQVISDTpaSSSPP 552
Cdd:PRK11907   81 SDPTSEATDTTTSEARTVTPAAT--ETSKP 108
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
404-526 1.38e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 43.16  E-value: 1.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  404 GFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVS 483
Cdd:PRK14951  369 AAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALA 448
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 755526783  484 SSPPQVTSDTPAsSSPPQVTSDTPASSSPPQVTSETPASSSPP 526
Cdd:PRK14951  449 PAPPAQAAPETV-AIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
430-554 1.43e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 42.63  E-value: 1.43e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  430 ASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPgfssPTQVTTATLVSSSPPQVTSDTPA-SSSPPQVTSDTPA 508
Cdd:PTZ00436  225 AAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAPPAKAAAP----PAKAAAPPAKAAAPPAKAAAPPAkAAAPPAKAAAAPA 300
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 755526783  509 -SSSPPQVTSETPA--SSSPPQVTSDTSASISPPQVISDTPASSSPPQV 554
Cdd:PTZ00436  301 kAAAAPAKAAAAPAkaAAPPAKAAAPPAKAATPPAKAAAPPAKAAAAPV 349
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
542-603 1.64e-03

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 41.53  E-value: 1.64e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 755526783  542 ISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSP 603
Cdd:cd21441    65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
PRK12495 PRK12495
hypothetical protein; Provisional
450-563 1.68e-03

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 41.78  E-value: 1.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  450 VTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQV-TSDTPASSSPPQVTSETPASSSPPQV 528
Cdd:PRK12495   68 VTEDGAAGDDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTsATDEAATDPPATAAARDGPTPDPTAQ 147
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 755526783  529 TSDTSASISP---PQVISDTPASSSPPQVTSETPASSS 563
Cdd:PRK12495  148 PATPDERRSPrqrPPVSGEPPTPSTPDAHVAGTLQAAR 185
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
507-660 1.69e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 42.73  E-value: 1.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   507 PASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPA----SSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTS 582
Cdd:pfam05539  169 KTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATAnqrlSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQ 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   583 DTPASSSP-TNMTSD----TPASSSPPWPVITEVTRPESTIPAgrSLANITSKAQEDSPLGVISTHPQMSFQSSTSQQAL 657
Cdd:pfam05539  249 HPPSTTSQdQSTTGDgqehTQRRKTPPATSNRRSPHSTATPPP--TTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQAN 326

                   ...
gi 755526783   658 DET 660
Cdd:pfam05539  327 PTT 329
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
414-552 1.71e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.93  E-value: 1.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  414 ATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTsdtPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSdt 493
Cdd:PRK07994  363 APLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVP---PPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATK-- 437
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  494 PASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSAS-ISPPQVISDTPASSSPP 552
Cdd:PRK07994  438 AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYrWKATNPVEVKKEPVATP 497
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
485-609 1.71e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.78  E-value: 1.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  485 SPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSAsiSPPQVISDTPASSSPPQVTSETPASSSP 564
Cdd:PRK14951  372 AAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAA--PPAPVAAPAAAAPAAAPAAAPAAVALAP 449
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 755526783  565 TNMTSDTPASSSPTNMTSDTPASSSPtnmtSDTPASSSPPWPVIT 609
Cdd:PRK14951  450 APPAQAAPETVAIPVRVAPEPAVASA----APAPAAAPAAARLTP 490
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
208-572 1.73e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.06  E-value: 1.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   208 SSAPPKATHRMTITSLTGRPQVTSDtlaSSSPPQGTSDTPassSPPQVTSATSASSSPpqgTSDTPASSSPPQVtsatsa 287
Cdd:TIGR00927   76 SSDPPKSSSEMEGEMLAPQATVGRD---EATPSIAMENTP---SPPRRTAKITPTTPK---NNYSPTAAGTERV------ 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   288 sssppqgTSDTPASssPPQVTSATSASSSPPQGTSDTPA------SSSPPQVTSatsasssppQGTSDTPASSSPPQGTL 361
Cdd:TIGR00927  141 -------KEDTPAT--PSRALNHYISTSGRQRVKSYTPKprgevkSSSPTQTRE---------KVRKYTPSPLGRMVNSY 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   362 DTPSSSSPPQGTSDTPASSsppQGTSETPASNS--PPQGTSETPGFSSPpqVTTATLVSSSPPQVTS--ETPASSSPTQV 437
Cdd:TIGR00927  203 APSTFMTMPRSHGITPRTT---VKDSEITATYKmlETNPSKRTAGKTTP--TPLKGMTDNTPTFLTRevETDLLTSPRSV 277
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   438 TsETPASSSPTQVTSDTPAS-------NSP--PQGT--SDTPGfSSPTQVTTATLVSSSPPQVTSDTPA-SSSPPQVTSD 505
Cdd:TIGR00927  278 V-EKNTLTTPRRVESNSSTNhwglvgkNNLttPQGTvlEHTPA-TSEGQVTISIMTGSSPAETKASTAAwKIRNPLSRTS 355
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   506 TPA-------------SSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPaSSSPTNMTSDTP 572
Cdd:TIGR00927  356 APAvriasatfrglekNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSPSLTTALFPEAP-SPSPSALPPGQP 434
motB PRK12799
flagellar motor protein MotB; Reviewed
399-530 1.81e-03

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 42.40  E-value: 1.81e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  399 TSETPGFSSPPqvTTATLVSSSPPQVTSETPASssptqVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTT 478
Cdd:PRK12799  296 HGTVPVAAVTP--SSAVTQSSAITPSSAAIPSP-----AVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPA 368
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 755526783  479 ATLVSSSPPQVTSDTPASSSppqVTSDTPASSSPPQVTSETPASSSPPQVTS 530
Cdd:PRK12799  369 AEPVNMQPQPMSTTETQQSS---TGNITSTANGPTTSLPAAPASNIPVSPTS 417
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
346-460 2.12e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 42.36  E-value: 2.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  346 GTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGF---SSPPQVTTATLVSSSPP 422
Cdd:PRK14959  375 GGGASAPSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVpwdDAPPAPPRSGIPPRPAP 454
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 755526783  423 QVTSETPASSSPTQVTSEtpASSSPTQVTSDTPASNSP 460
Cdd:PRK14959  455 RMPEASPVPGAPDSVASA--SDAPPTLGDPSDTAEHTP 490
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
460-647 2.18e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 42.56  E-value: 2.18e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  460 PPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASI--- 536
Cdd:PRK12323  365 PGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASArgp 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  537 ----SPPQVISDTPASSSPPQVTS-ETPASSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWP--VIT 609
Cdd:PRK12323  445 ggapAPAPAPAAAPAAAARPAAAGpRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAgwVAE 524
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 755526783  610 EVTRPESTIPAGRSLANITSKAQEDSPLGVISTHPQMS 647
Cdd:PRK12323  525 SIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVA 562
SAP130_C pfam16014
Histone deacetylase complex subunit SAP130 C-terminus;
415-590 2.26e-03

Histone deacetylase complex subunit SAP130 C-terminus;


Pssm-ID: 464973 [Multi-domain]  Cd Length: 371  Bit Score: 42.23  E-value: 2.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   415 TLVSSSPPQVTSETPA-----SSSPTQVTSETPASSSPTQVT---SDTPASNSPPqgTSDTPGFSSPTQVTTATlVSSSP 486
Cdd:pfam16014    1 ALGSSPRPSILRKKPAtegakPKPDIHVAVAPPVTVAVEALPgqnSEQQTASASP--PSQHPAQAIPTILAPAA-PPSQP 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   487 PQVTSDTPASS--SPPQVTSDTPASSSPPQvtsetPASSSPPQ-VTSDTSASISPPQVISDTPASSSPPQVTSETPASSs 563
Cdd:pfam16014   78 SVVLSTLPAAMavTPPIPASMANVVAPPTQ-----PAASSTAAcAVSSVLPEIKIKQEAEPMDTSQSVPPLTPTSISPA- 151
                          170       180
                   ....*....|....*....|....*..
gi 755526783   564 ptnMTSDTPASSSPtnmTSDTPASSSP 590
Cdd:pfam16014  152 ---LTSLANNLSVP---AGDLLPGASP 172
PRK10856 PRK10856
cytoskeleton protein RodZ;
419-512 2.27e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 41.94  E-value: 2.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  419 SSPPQVTSETPASSSPTQVtseTPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSS 498
Cdd:PRK10856  161 SVPLDTSTTTDPATTPAPA---APVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPL 237
                          90
                  ....*....|....
gi 755526783  499 PPQVTSDTPASSSP 512
Cdd:PRK10856  238 PTDQAGVSTPAADP 251
PLN03131 PLN03131
hypothetical protein; Provisional
378-609 2.48e-03

hypothetical protein; Provisional


Pssm-ID: 178677 [Multi-domain]  Cd Length: 705  Bit Score: 42.46  E-value: 2.48e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  378 ASSSPPQGTSETPA-SNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTS---ETPASSSPTqvtSD 453
Cdd:PLN03131  356 APMAPPIDLFQLPAtSPAPPVDLFEIPPLDPAPAINAYQPPQTSLPSSIDLFGGITQQQSINSldeKSPELSIPK---NE 432
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  454 TPASNSPPQGTSDTPGFS--SPTQVTTATLVSSSPPQVTSDTPASSSPP-QVTSDTPASSSPPQVTSETPASSSPPQVTS 530
Cdd:PLN03131  433 GWATFDGIQPIASTPGNEnlTPFSIGPSMAGSANFDQVPSLDKGMQWPPfQNSSDEESASGPAPWLGDLHNVEAPDNTSA 512
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  531 DTSASISPPQVISDTPASSSPPQVTSETPASSSPT-NMTSDTPASSSPT---------NMTSDTPASSSPTNMTSDtPAS 600
Cdd:PLN03131  513 QNWNAFEFDDSVAGIPLEGIKQSSEPQTAANMPPTaDQLIGCKALEDFNkdgikrtapHGQGELPGLDEPSDILAE-PSY 591

                  ....*....
gi 755526783  601 SSPPWPVIT 609
Cdd:PLN03131  592 TPPAHPIME 600
PRK10856 PRK10856
cytoskeleton protein RodZ;
418-499 2.68e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 41.55  E-value: 2.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  418 SSSPPQVTSETPA--SSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPA 495
Cdd:PRK10856  168 TTTDPATTPAPAApvDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247

                  ....
gi 755526783  496 SSSP 499
Cdd:PRK10856  248 AADP 251
DUF3583 pfam12126
Protein of unknown function (DUF3583); This domain is found in eukaryotes, and is typically ...
388-571 2.77e-03

Protein of unknown function (DUF3583); This domain is found in eukaryotes, and is typically between 302 and 338 amino acids in length. It is found in association with pfam00097 and pfam00643. Most members are promyelocytic leukemia proteins, and this family lies towards the C-terminus.


Pssm-ID: 432347 [Multi-domain]  Cd Length: 329  Bit Score: 41.72  E-value: 2.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   388 ETPASnspPQGTSETPGF-----------SSPPQVTTATLVSSSPPQvTSETPASSSptqvTSETPASSSPTQVTSDTPA 456
Cdd:pfam12126  123 EEPQN---LQAAVRTDGFdefkvrlqdlvSCITQGTDAAVSRRASPE-AASTPRDPS----DVDLPEEVQRVQAQALGLA 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   457 SNSPPQGTSDTPGfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSET---PASSSPPQVTSDTS 533
Cdd:pfam12126  195 ETQPVAVVQSVPG-AHPVPVYAFSIKDPSYREEVSNTVTPQKRKSCQTECPRKVIKMESEEEKearLARSSPEQPRPSTS 273
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 755526783   534 ASISPPQVisDTPASSSPPQVTSETPASSSpTNMTSDT 571
Cdd:pfam12126  274 KAVSPPHL--DGPPSPESPIVGKEVLLPNS-NHVTSDP 308
PHA03193 PHA03193
tegument protein VP11/12; Provisional
347-476 2.97e-03

tegument protein VP11/12; Provisional


Pssm-ID: 177555  Cd Length: 594  Bit Score: 42.01  E-value: 2.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  347 TSDTPASSSPPQGTLDTPSSSSPPqgtsdtpASSSPPQGTSETPASNSPPQGTsETPGFSSPPQVTTATLVSSSppQVTS 426
Cdd:PHA03193  456 IHEALANNGQAIFPECFSGDLPPI-------AQALLSADELPNDTTASTSNEM-KGDAECPAAQDAAAILPASF--QIEN 525
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 755526783  427 ETPASSSPTQVTSetpASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQV 476
Cdd:PHA03193  526 GGAADGSGLAIPA---AMCDATAVESPSTVAETPPERLLAAESGPRCKAT 572
PHA03291 PHA03291
envelope glycoprotein I; Provisional
350-435 3.05e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 41.86  E-value: 3.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  350 TPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASnspPQGTSETPGFSSPPqvttatlvsssPPQVTSETP 429
Cdd:PHA03291  204 VPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQ---AGTTPEAEGTPAPP-----------TPGGGEAPP 269

                  ....*.
gi 755526783  430 ASSSPT 435
Cdd:PHA03291  270 ANATPA 275
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
345-444 3.12e-03

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 42.00  E-value: 3.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  345 QGTSDTPASSSP---------PQGTLDTPSSSSPPQ---GTSDTPASSSPPQGTSETPASNSPPQGTSE--TPGFSSP-- 408
Cdd:PLN02217  545 QGDAWIPGKGVPyipglfagnPGSTNSTPTGSAASSnttFSSDSPSTVVAPSTSPPAGHLGSPPATPSKivSPSTSPPas 624
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 755526783  409 ----PQVTTATLVSSSPPQVTSETPASSSPTQVTSETPAS 444
Cdd:PLN02217  625 hlgsPSTTPSSPESSIKVASTETASPESSIKVASTESSVS 664
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
494-650 3.41e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.01  E-value: 3.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  494 PASSSPPQVTSdtpASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDT---PASSSPPQVTSETPASSSPTNMTSD 570
Cdd:PRK14951  366 PAAAAEAAAPA---EKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPaapPAAAPPAPVAAPAAAAPAAAPAAAP 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  571 TPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVIT--EVTRPESTIPAGRSLANITSKAQEDSPLGVISthpQMSF 648
Cdd:PRK14951  443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAapAAARLTPTEEGDVWHATVQQLAAAEAITALAR---ELAL 519

                  ..
gi 755526783  649 QS 650
Cdd:PRK14951  520 QS 521
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
325-532 3.45e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 41.87  E-value: 3.45e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  325 PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPG 404
Cdd:PRK14948  364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPTPPANAANAPPSLNLEE 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  405 -----FSSPPQVTT-------ATLVSSSPPQVT------------SETP--------ASSSPTQVTSETPASSSPTQVTS 452
Cdd:PRK14948  444 lwqqiLAKLELPSTrmllsqqAELVSLDSNRAViavspnwlgmvqSRKPlleqafakVLGRSIKLNLESQSGSASNTAKT 523
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  453 DTPASNSPPQGTSdTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPqvtsdtPASSSPPQVTSETPASSSPPQVTSDT 532
Cdd:PRK14948  524 PPPPQKSPPPPAP-TPPLPQPTATAPPPTPPPPPPTATQASSNAPAQI------PADSSPPPPIPEEPTPSPTKDSSPEE 596
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
239-599 3.49e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.98  E-value: 3.49e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  239 PPQGTSDTPASSSPPQvtsatsasssppqGtsdtPASSSPPQVTSATSASSSPPQGTSDtpaSSSPPQVTSATSASSSPP 318
Cdd:PTZ00449  497 APIEEEDSDKHDEPPE-------------G----PEASGLPPKAPGDKEGEEGEHEDSK---ESDEPKEGGKPGETKEGE 556
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  319 QGTSDTPASSSPPqvtsatsasssppqgtSDTPASSSPPQGTLDTPSSSSPpqgtsDTPASSSPPQgTSETPASNSPPQg 398
Cdd:PTZ00449  557 VGKKPGPAKEHKP----------------SKIPTLSKKPEFPKDPKHPKDP-----EEPKKPKRPR-SAQRPTRPKSPK- 613
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  399 tseTPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPtqvtsdtPASNSPP----------QGTSDTP 468
Cdd:PTZ00449  614 ---LPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKP-------PKSPKPPfdpkfkekfyDDYLDAA 683
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  469 GFSSPTqVTTATLVSSSPPQVTSDTPASSSPPQVTSDT--PASSSPPQVTSETPASSSPPQvtSDTSASISPPQ----VI 542
Cdd:PTZ00449  684 AKSKET-KTTVVLDESFESILKETLPETPGTPFTTPRPlpPKLPRDEEFPFEPIGDPDAEQ--PDDIEFFTPPEeertFF 760
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 755526783  543 SDTPASSSPPQVTSETPASSsptNMTSDTPASSSPTNmTSDTPASSSPTNmTSDTPA 599
Cdd:PTZ00449  761 HETPADTPLPDILAEEFKEE---DIHAETGEPDEAMK-RPDSPSEHEDKP-PGDHPS 812
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
354-502 3.54e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 41.47  E-value: 3.54e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  354 SSPPQGTlDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPgfssPPQVTTATLVSSSPPQVTSETPASSS 433
Cdd:PTZ00436  208 AAAPSGK-KSAKAAAPAKAAAAPAKAAAPPAKAAAAPAKAAAAPAKAAAP----PAKAAAPPAKAAAPPAKAAAPPAKAA 282
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 755526783  434 ptqvTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSS--PTQVTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:PTZ00436  283 ----APPAKAAAPPAKAAAAPAKAAAAPAKAAAAPAKAAapPAKAAAPPAKAATPPAKAAAPPAKAAAAPV 349
PPE COG5651
PPE-repeat protein [Function unknown];
295-526 3.62e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 41.42  E-value: 3.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  295 TSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSppqvtSATSASSSPPQGTSDTPASSSPPQGTldtpSSSSPPQGTS 374
Cdd:COG5651   162 VALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPG-----FANLGLTGLNQVGIGGLNSGSGPIGL----NSGPGNTGFA 232
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  375 DTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSETPASSSPTQVTSDT 454
Cdd:COG5651   233 GTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGA 312
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 755526783  455 PASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSdTPASSSPPQVTSDTPASSSPPQVTSETPASSSPP 526
Cdd:COG5651   313 GGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAA-AAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGA 383
PHA03193 PHA03193
tegument protein VP11/12; Provisional
433-583 3.62e-03

tegument protein VP11/12; Provisional


Pssm-ID: 177555  Cd Length: 594  Bit Score: 41.63  E-value: 3.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  433 SPTQVTSETPASSSPTqvtSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSD-----TPASSSPPQ-VTSDT 506
Cdd:PHA03193  441 SPFQRKRAMPEDGGEI---HEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTsnemkGDAECPAAQdAAAIL 517
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  507 PASSsppQVTSETPASSSPPQVTSDtsaSISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPT------NM 580
Cdd:PHA03193  518 PASF---QIENGGAADGSGLAIPAA---MCDATAVESPSTVAETPPERLLAAESGPRCKATAKHKGGSSKVEeilrrlRM 591

                  ...
gi 755526783  581 TSD 583
Cdd:PHA03193  592 ASD 594
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
347-501 3.77e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.77  E-value: 3.77e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  347 TSDTPASSSPPQG----TLDTPSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTAtlVSSSPP 422
Cdd:PRK07994  367 EPEVPPQSAAPAAsaqaTAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKA--KKSEPA 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  423 QVTSETPASSSPTQVTSETPASSSPTQVTSDTPASN---SPPQGTSDTPGfSSPTQVTTATLVSSSPPQVTSDTPASSSP 499
Cdd:PRK07994  445 AASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwkaTNPVEVKKEPV-ATPKALKKALEHEKTPELAAKLAAEAIER 523

                  ..
gi 755526783  500 PQ 501
Cdd:PRK07994  524 DP 525
CLECT smart00034
C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function ...
35-142 3.80e-03

C-type lectin (CTL) or carbohydrate-recognition domain (CRD); Many of these domains function as calcium-dependent carbohydrate binding modules.


Pssm-ID: 214480 [Multi-domain]  Cd Length: 124  Bit Score: 38.73  E-value: 3.80e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783     35 GNSCYQLNRLFCDFQEADNYCHAQRGRLA-------HTWnpkLRGFLKSFLNEETVW------WVRGNLTlpgshpGINQ 101
Cdd:smart00034    9 GGKCYKFSTEKKTWEDAQAFCQSLGGHLAsihseaeNDF---VASLLKNSGSSDYYWiglsdpDSNGSWQ------WSDG 79
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*..
gi 755526783    102 TGGDDVLrNQKPGEcPSVVTHSNAVFS----RWNL--CIEKHHFICQ 142
Cdd:smart00034   80 SGPVSYS-NWAPGE-PNNSSGDCVVLStsggKWNDvsCTSKLPFVCE 124
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
484-607 3.92e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 41.09  E-value: 3.92e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  484 SSPPQVTSDTPA-SSSPPQVTSDTPASSSPPQVTSETP---ASSSPPQVTSDTSASISPPQVISDTPA-SSSPPQVTSET 558
Cdd:PTZ00436  219 AAAPAKAAAAPAkAAAPPAKAAAAPAKAAAAPAKAAAPpakAAAPPAKAAAPPAKAAAPPAKAAAPPAkAAAPPAKAAAA 298
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 755526783  559 PA--SSSPTNMTSDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPV 607
Cdd:PTZ00436  299 PAkaAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAAAPV 349
PRK10856 PRK10856
cytoskeleton protein RodZ;
391-485 3.97e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 41.17  E-value: 3.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  391 ASNSPPQGTSE--TPGFSSPPQVTTATLVSSSPPQVTSETPASSSPT--QVTSETPASSSPTQVTSDTPASNSPPQGTSD 466
Cdd:PRK10856  152 AELSQNSGQSVplDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPapAVDPQQNAVVAPSQANVDTAATPAPAAPATP 231
                          90       100
                  ....*....|....*....|
gi 755526783  467 TPGFSSPT-QVTTATLVSSS 485
Cdd:PRK10856  232 DGAAPLPTdQAGVSTPAADP 251
dnaA PRK14086
chromosomal replication initiator protein DnaA;
363-565 3.97e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 41.73  E-value: 3.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  363 TPSSSSPPQGTSDTPASSSP--PQGTSETPASNSPPQGTSETPGFSSPPQVTTAtlVSSSPPQVTSETPASSSPTqvtse 440
Cdd:PRK14086   89 DPSAGEPAPPPPHARRTSEPelPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTA--RPAYPAYQQRPEPGAWPRA----- 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  441 tPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPpqvtsetP 520
Cdd:PRK14086  162 -ADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRPRRDRTDRPEPP-------P 233
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 755526783  521 ASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:PRK14086  234 GAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGEPT 278
CytochromB561_N pfam09786
Cytochrome B561, N terminal; Members of this family are found in the N terminal region of ...
417-577 4.02e-03

Cytochrome B561, N terminal; Members of this family are found in the N terminal region of cytochrome B561, as well as in various other putative uncharacterized proteins.


Pssm-ID: 462899  Cd Length: 579  Bit Score: 41.73  E-value: 4.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   417 VSSSPPQVTSETPASSSPTQVTS---ETPASSSPTQVTSDTPASNSPPQGTSDTPGfssptqvttatlvssspPQVTSDT 493
Cdd:pfam09786   89 VQSKSPSKGTKTPSRLTNQQLGLlglKPNDSSFVTTHRKKPPKSKSSPQSPSPVLV-----------------PLHQSVS 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   494 PASSSPPQVTSDTPASSSPPQVTSETPASSSppqvtsdtsasISPPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPA 573
Cdd:pfam09786  152 PSSSESRKGGDKSPAGSGKKLRSFSTSSKSP-----------ASPSVYLRGSPVPLNSSPLPSDRNYENSVQSSPEIDSA 220

                   ....
gi 755526783   574 SSSP 577
Cdd:pfam09786  221 VSTP 224
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
345-510 4.11e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 41.77  E-value: 4.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  345 QGTSDTPAssSPPQGTLDTPSSssppQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQV 424
Cdd:PRK07994  362 AAPLPEPE--VPPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGA 435
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  425 TSetPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFS-SPTQVTTATLVSSSPPQVTSDTPASSSPPQVT 503
Cdd:PRK07994  436 TK--AKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKKEAYRwKATNPVEVKKEPVATPKALKKALEHEKTPELA 513

                  ....*..
gi 755526783  504 SDTPASS 510
Cdd:PRK07994  514 AKLAAEA 520
PHA03193 PHA03193
tegument protein VP11/12; Provisional
409-570 4.15e-03

tegument protein VP11/12; Provisional


Pssm-ID: 177555  Cd Length: 594  Bit Score: 41.63  E-value: 4.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  409 PQVTTATLVSSSPPqvTSETPASSSPTQVTSETPASSSPtqvtsdtPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQ 488
Cdd:PHA03193  442 PFQRKRAMPEDGGE--IHEALANNGQAIFPECFSGDLPP-------IAQALLSADELPNDTTASTSNEMKGDAECPAAQD 512
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  489 VTSDTPASSsppQVTSDTPASSSPPQVTSetpASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPT--- 565
Cdd:PHA03193  513 AAAILPASF---QIENGGAADGSGLAIPA---AMCDATAVESPSTVAETPPERLLAAESGPRCKATAKHKGGSSKVEeil 586

                  ....*...
gi 755526783  566 ---NMTSD 570
Cdd:PHA03193  587 rrlRMASD 594
PRK11901 PRK11901
hypothetical protein; Reviewed
345-530 4.62e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 40.82  E-value: 4.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  345 QGTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQgtsetPASNSPPQGTS--ETPGFssppqvttatlVSSSPP 422
Cdd:PRK11901   91 NQSSPSAANNTSDGHDASGVKNTAPPQDISAPPISPTPTQ-----AAPPQTPNGQQriELPGN-----------ISDALS 154
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  423 QVTSETPASSSPTQvtseTPASSSPTqvtsdTPASNSPPQGTSdtPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:PRK11901  155 QQQGQVNAASQNAQ----GNTSTLPT-----APATVAPSKGAK--VPATAETHPTPPQKPATKKPAVNHHKTATVAVPPA 223
                         170       180
                  ....*....|....*....|....*....
gi 755526783  503 TSDTPASSSPPQ-VTSETPASSSPPQVTS 530
Cdd:PRK11901  224 TSGKPKSGAASArALSSAPASHYTLQLSS 252
PHA03291 PHA03291
envelope glycoprotein I; Provisional
506-577 4.90e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 41.09  E-value: 4.90e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755526783  506 TPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSP-TNMTSDTPASSSP 577
Cdd:PHA03291  204 VPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPgGGEAPPANATPAP 276
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
211-573 4.90e-03

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 41.20  E-value: 4.90e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  211 PPKATHRMTITSLTgRPQVTSDTLASSSPPQGTSDT----PASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATS 286
Cdd:COG5180   187 EPRDALKDSPEKLD-RPKVEVKDEAQEEPPDLTGGAdhprPEAASSPKVDPPSTSEARSRPATVDAQPEMRPPADAKERR 265
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  287 AsssppqgtsdTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQvtsatsasssppqgTSDTPASSSPPQGTLDTPSS 366
Cdd:COG5180   266 R----------AAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARP--------------IDVKGVASAPPATRPVRPPG 321
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  367 SSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSETPGFSSP--PQVTTATLVSSSPPQVTSETPASSSPTQVTSETPAS 444
Cdd:COG5180   322 GARDPGTPRPGQPTERPAGVPEAASDAGQPPSAYPPAEEAVPgkPLEQGAPRPGSSGGDGAPFQPPNGAPQPGLGRRGAP 401
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  445 SSPtqvtsdtPASNSPPQGTSD--TPGFSSPTQVTTATLVSSSPPQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPAS 522
Cdd:COG5180   402 GPP-------MGAGDLVQAALDggGRETASLGGAAGGAGQGPKADFVPGDAESVSGPAGLADQAGAAASTAMADFVAPVT 474
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|.
gi 755526783  523 SSPPQVTSDTSASISPPQVISDTPASSSPPQVTseTPASSSPTNMTSDTPA 573
Cdd:COG5180   475 DATPVDVADVLGVRPDAILGGNVAPASGLDAET--RIIEAEGAPATEDFVA 523
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
423-575 5.07e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 41.12  E-value: 5.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  423 QVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPPQV 502
Cdd:PRK13108  289 EYVVDEALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAV---KAEVAEVTDEVAAESVVQVADRDGESTPAVEE 365
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755526783  503 TSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSP--PQVTSETPASSSPTNMTSDTPASS 575
Cdd:PRK13108  366 TSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPevPEKAAPIPDPAKPDELAVAGPGDD 440
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
364-455 5.25e-03

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 40.88  E-value: 5.25e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  364 PSSSSPPQGTSDTPASSSPPQGTSET---PASNSPPQGTSETpgfssPPQVTTATLVSSSPPQVTSETPASSSPtqvtSE 440
Cdd:PRK13335   80 PNTNEEKTSASKIEKISQPKQEEQKSlniSATPAPKQEQSQT-----TTESTTPKTKVTTPPSTNTPQPMQSTK----SD 150
                          90
                  ....*....|....*.
gi 755526783  441 TPASSSPTQVTSD-TP 455
Cdd:PRK13335  151 TPQSPTIKQAQTDmTP 166
Mating_C pfam12737
C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine ...
361-598 5.35e-03

C-terminal domain of homeodomain 1; Mating in fungi is controlled by the loci that determine the mating type of an individual, and only individuals with differing mating types can mate. Basidiomycete fungi have evolved a unique mating system, termed tetrapolar or bifactorial incompatibility, in which mating type is determined by two unlinked loci; compatibility at both loci is required for mating to occur. The multi-allelic tetrapolar mating system is considered to be a novel innovation that could have only evolved once, and is thus unique to the mushroom fungi. This domain is C-terminal to the homeodomain transcription factor region.


Pssm-ID: 372279 [Multi-domain]  Cd Length: 412  Bit Score: 41.13  E-value: 5.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   361 LDTPSSSSPPQGTSDTPASSSpPQGTSETPASNSPpqgtseTPGFSSPPQVTTATLVSSSPPQVTSEtpassspTQVTSE 440
Cdd:pfam12737  123 LDSPSSSSSPEKCLPSPAPSE-QEALSEISAACGP------TPSTLTPLNVAPSLTPSKKRKRCLSD-------GFDGPK 188
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   441 TPASSSPT---QVTSDT-PASNSPPQGTSDTPGFSSPtqvtTATLVSSSPPQVTSDTPASSSP-----------PQVTSD 505
Cdd:pfam12737  189 RPPNKRVQprpQTVSDPfPTSTSIPEWDEWLQNHMSP----SLTLHGDIPPPVSVEAPDSNTPldieifnfpyhPDLTPS 264
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783   506 TPASSSPPQVTSETPASSSPPQVTSDTSASIspPQVISDTPASSSPPQVTSETPASSSPTNMTSDTPASSSPTNMTSDTP 585
Cdd:pfam12737  265 PAPSLSDSVIEVATPTTESDYMCNGTLRQTF--SWFEFDFPELIQPTNTPASNNELSLPFDPSTDIVVSRTILPLLDWRS 342
                          250
                   ....*....|...
gi 755526783   586 ASSSPTNMTSDTP 598
Cdd:pfam12737  343 QSFLSQTFASPPH 355
PHA03249 PHA03249
DNA packaging tegument protein UL25; Provisional
429-561 5.89e-03

DNA packaging tegument protein UL25; Provisional


Pssm-ID: 223023  Cd Length: 653  Bit Score: 41.15  E-value: 5.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  429 PASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTsDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSS--PPQVTSDT 506
Cdd:PHA03249   33 PRPRAPTEDLDRMEAGLSSYSSSSDNKSSFEVVSET-DSGSEAEAERGRRAGMGGRNKATKPSRRNKTTQcrPTSLALAT 111
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 755526783  507 PASSSPPQVTSETPASSSPPQVTSDTS-------ASISPPQVISDTPASSSPPQVTSETPAS 561
Cdd:PHA03249  112 AATMPATPSSGKSPKVSSPPSIPSLSEedegaerNSGGDDSSHTDNESTQSQPEADDEPDLA 173
PRK10856 PRK10856
cytoskeleton protein RodZ;
346-437 6.03e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 40.39  E-value: 6.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  346 GTSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSSPPQGTSETPASnsppQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:PRK10856  167 STTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPS----QANVDTAATPAPAAPATPDGAAPLPTDQA 242
                          90
                  ....*....|..
gi 755526783  426 SETPASSSPTQV 437
Cdd:PRK10856  243 GVSTPAADPNAL 254
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
268-495 6.21e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.01  E-value: 6.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  268 GTSDTPASSSPpqvtsATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGTSDTPASSSPPQVTSATSASSSPPQGT 347
Cdd:PRK12323  371 GAGPATAAAAP-----VAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPG 445
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  348 SDTPASSSPPQgtldTPSSSSPPQGTS-DTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVTS 426
Cdd:PRK12323  446 GAPAPAPAPAA----APAAAARPAAAGpRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  427 ETPASSSPTQVTSETPASSSPTQvtsdtPASNSPPQGTSDTPGFSSPTQVTTATlvSSSPPQVTSDTPA 495
Cdd:PRK12323  522 VAESIPDPATADPDDAFETLAPA-----PAAAPAPRAAAATEPVVAPRPPRASA--SGLPDMFDGDWPA 583
PRK13335 PRK13335
superantigen-like protein SSL3; Reviewed;
347-425 6.75e-03

superantigen-like protein SSL3; Reviewed;


Pssm-ID: 139494 [Multi-domain]  Cd Length: 356  Bit Score: 40.49  E-value: 6.75e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  347 TSDTPASSSPPQGTLDTPSSSSPPQGTSDTPASSspPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQVT 425
Cdd:PRK13335   89 ASKIEKISQPKQEEQKSLNISATPAPKQEQSQTT--TESTTPKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDMT 165
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
421-565 8.07e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.47  E-value: 8.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  421 PPQVTSETPAsssptqvtsetPASSSPTQVTSDTPASNSPPQgtsdtpgfSSPTQVTTATLVSSSPPQVTSDTPASSSPP 500
Cdd:PRK14951  366 PAAAAEAAAP-----------AEKKTPARPEAAAPAAAPVAQ--------AAAAPAPAAAPAAAASAPAAPPAAAPPAPV 426
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755526783  501 QVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASIsPPQVISDTPASSSPPQVTSETPASSSPT 565
Cdd:PRK14951  427 AAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAI-PVRVAPEPAVASAAPAPAAAPAAARLTP 490
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
420-539 8.07e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 40.47  E-value: 8.07e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  420 SPPQVTSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSPPQVTSDTPASSSP 499
Cdd:PRK14951  372 AAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAP 451
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 755526783  500 PQVTSDTPAsSSPPQVTSETPASSSPPQVTSDTSASISPP 539
Cdd:PRK14951  452 PAQAAPETV-AIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
370-521 8.40e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 40.62  E-value: 8.40e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  370 PQGTSDTPASssPPQGTSETPASnsppQGTSETPGFSSPPQVTTATLVSSSPPQVTSETPASSSPTQVTSetpASSSPTQ 449
Cdd:PRK07994  361 PAAPLPEPEV--PPQSAAPAASA----QATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLA---ARQQLQR 431
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 755526783  450 VTSDTPASNSPPQGTSDTPgfssPTQVTTATLVSSSP-PQVTSDTPASSSPPQVTSDTPASSSPPQVTSETPA 521
Cdd:PRK07994  432 AQGATKAKKSEPAAASRAR----PVNSALERLASVRPaPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKAL 500
PHA03269 PHA03269
envelope glycoprotein C; Provisional
489-627 8.63e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 40.48  E-value: 8.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  489 VTSDTPASSSPPQVTSDTPASSSPPQVTSETPASSSPPQVTSDTSASISPPQV-ISDTPASSSP--PQVTSETPASSSPT 565
Cdd:PHA03269   18 IIANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLaQAPTPAASEKfdPAPAPHQAASRAPD 97
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 755526783  566 NMTSDTPAsSSPTNMTSDTPASSS---PTNMTSDTPASSSPPWPVITEVTRPESTIpAGRSLANI 627
Cdd:PHA03269   98 PAVAPQLA-AAPKPDAAEAFTSAAqahEAPADAGTSAASKKPDPAAHTQHSPPPFA-YTRSMEHI 160
SP4_N cd22536
N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins ...
345-655 8.92e-03

N-terminal domain of transcription factor Specificity Protein (SP) 4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. Human SP4 is a risk gene of multiple psychiatric disorders including schizophrenia, bipolar disorder, and major depression. SP4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP4.


Pssm-ID: 411773 [Multi-domain]  Cd Length: 623  Bit Score: 40.67  E-value: 8.92e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  345 QGTSDTPASSSPPQGTldtpSSSSPPQGTSDTPASSSPPQGTSETPASNSPPQGTSetpgfsspPQVTTATLVSSSPPQV 424
Cdd:cd22536    90 QGVSAATSSAAPSSSN----NGSTSPTKVKAGNSNASAPGQFQVIQVQNMQNPSGS--------VQYQVIPQIQTVEGQQ 157
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  425 TSETPASSSPTQVTSET----PASS--SPTQVTSDTPASNSPPQGTSDT-------PGFSSPTQVTTatlVSSSPPQVTS 491
Cdd:cd22536   158 IQISPANATALQDLQGQiqliPAGNnqAILTTPNRTASGNIIAQNLANQtvpvqirPGVSIPLQLQT---IPGAQAQVVT 234
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  492 DTPASSSppQVTSDTPasssppqVTSETPASSSPPQVTSDTSASISPPQVISDTPASSSPPQVTSETPASSSPTNMT--- 568
Cdd:cd22536   235 TLPINIG--GVTLALP-------VINNVAAGGGSGQLVQPSDGGVSNGNQLVSTPITTASVSTMPESPSSSTTCTTTast 305
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  569 ----SDTPASSSPTNMTSDTPASSSPTNMTSDTPASSSPPWPVI-TEVTRPESTIPAGRSLANITSKAQE--------DS 635
Cdd:cd22536   306 sltsSDTLVSSAETGQYASTAASSERTEEEPQTSAAESEAQSSSqLQSNGLQNVQDQSNSLQQVQIVGQPilqqiqiqQP 385
                         330       340
                  ....*....|....*....|
gi 755526783  636 PLGVISTHPQMSFQSSTSQQ 655
Cdd:cd22536   386 QQQIIQAIQPQSFQLQSGQT 405
KLF12_N cd21441
N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as ...
425-486 9.18e-03

N-terminal domain of Kruppel-like factor 12; Kruppel-like factor 12 (also known as Krueppel-like transcription factor 12, KLF12) regulates, by transcriptionally repressing Nur77 expression, endometrial decidualization, which is a prerequisite for successful implantation and the establishment of pregnancy. It is involved in the maturation processes of kidney collecting ducts after birth, and is able to increase the promoter activity of the UT-A1 urea transporter promoter by binding to the CACCC motif. KLF12 has also been found to promote colorectal cancer growth is also involved in the invasion and apoptosis of basal-like breast carcinoma. KLF12 belongs to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Although these factors bind to similar elements in vitro, they have distinct activities in vivo depending on their expression profile and the sequence of the N-terminal activation/repression domain, which differ between members. KLF12 contains an N-terminal domain that is related to the N-terminal repression domain of KLF8.


Pssm-ID: 410608 [Multi-domain]  Cd Length: 197  Bit Score: 39.22  E-value: 9.18e-03
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 755526783  425 TSETPASSSPTQVTSETPASSSPTQVTSDTPASNSPPQGTSDTPGFSSPTQVTTATLVSSSP 486
Cdd:cd21441    65 TSPTAVSSSPVSMTASASPSSSSSSSSSSSRPASSPTVITSVSSASSVPTVLTPGPLVASAS 126
PLN02983 PLN02983
biotin carboxyl carrier protein of acetyl-CoA carboxylase
365-554 9.62e-03

biotin carboxyl carrier protein of acetyl-CoA carboxylase


Pssm-ID: 215533 [Multi-domain]  Cd Length: 274  Bit Score: 39.82  E-value: 9.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  365 SSSSPpqgtsdTPASSSPPQGTSETPASNSPPQGTSETPGFSSPPQVTTATLVSSSPPQvTSETPASSSPTQVTSETPAS 444
Cdd:PLN02983    3 SLSVP------CAKTAAAAANVGSRLSRSSFRLQPKPNISFPSKGPNPKRSAVPKVKAQ-LNEVAVDGSSNSAKSDDPKS 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  445 SSPTQVTSDTPASNSPPQGTSDTPGFSSP--TQVTT------------------------------------ATLVSSSP 486
Cdd:PLN02983   76 EVAPSEPKDEPPSNSSSKPNLPDEESISEfmTQVSSlvklvdsrdivelqlkqldcelvirkkealpqppppAPVVMMQP 155
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  487 PQVTSDTPASSSPPQVTSDTPASSSPPqvtseTPASSSPPQVTSDTSASISPPQ--VISDTPASSSPPQV 554
Cdd:PLN02983  156 PPPHAMPPASPPAAQPAPSAPASSPPP-----TPASPPPAKAPKSSHPPLKSPMagTFYRSPAPGEPPFV 220
PHA03269 PHA03269
envelope glycoprotein C; Provisional
471-600 9.73e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 40.48  E-value: 9.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 755526783  471 SSPTQVTTATLVSS-----SPPQVTSDTPASSSP-PQVTSDTPASSSPPQVTSETPASSSP--PQVTSDTSASISPPQVI 542
Cdd:PHA03269   21 NLNTNIPIPELHTSaatqkPDPAPAPHQAASRAPdPAVAPTSAASRKPDLAQAPTPAASEKfdPAPAPHQAASRAPDPAV 100
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 755526783  543 SDTPASSSPPQvtsetpASSSPTNMTSDTPASS-SPTNMTSDTPassSPTNMTSDTPAS 600
Cdd:PHA03269  101 APQLAAAPKPD------AAEAFTSAAQAHEAPAdAGTSAASKKP---DPAAHTQHSPPP 150
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH