NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|33859580|ref|NP_034835|]
View 

galectin-3 [Mus musculus]

Protein Classification

galectin family protein( domain architecture ID 10658251)

galectin family protein may exclusively bind beta-galactosides such as lactose in a manner independent of metal ions

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Gal-bind_lectin smart00908
Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are ...
137-258 1.22e-54

Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are classified according to the carbohydrate-recognition domain (CRD) of which there are two main types, S-type and C-type. Galectins (previously S-lectins) bind exclusively beta-galactosides like lactose. They do not require metal ions for activity. Galectins are found predominantly, but not exclusively in mammals. Their function is unclear. They are developmentally regulated and may be involved in differentiation, cellular regulation and tissue construction.


:

Pssm-ID: 214904  Cd Length: 122  Bit Score: 172.01  E-value: 1.22e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    137 PGGVMPRMLITIMGTVKPNANRIVLDFRRG--NDVAFHFNPRFNENnrrVIVCNTKQDNNWGKEERQSAFPFESGKPFKI 214
Cdd:smart00908   1 PGGLSPGSSITIRGIVLPDAKRFSINLQCGpnADIALHFNPRFDEG---TIVRNSKQNGKWGKEERSGGFPFQPGQPFEL 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 33859580    215 QVLVEADHFKVAVNDAHLLQYNHRMKnLREISQLGISGDITLTS 258
Cdd:smart00908  78 EILVEEDEFKVAVNGQHFLEFPHRLP-LESIDTLEISGDVQLTS 120
PRK07764 super family cl35613
DNA polymerase III subunits gamma and tau; Validated
9-139 2.56e-08

DNA polymerase III subunits gamma and tau; Validated


The actual alignment was detected with superfamily member PRK07764:

Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.22  E-value: 2.56e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    9 DALAGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGA 88
Cdd:PRK07764 619 AAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAP 698
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 33859580   89 YPGSTAPGAFPGQPGAPGAYPSAPG-GYPAAGPYGVPAGPLTVPYDLPLPGG 139
Cdd:PRK07764 699 AQPAPAPAATPPAGQADDPAAQPPQaAQGASAPSPAADDPVPLPPEPDDPPD 750
 
Name Accession Description Interval E-value
Gal-bind_lectin smart00908
Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are ...
137-258 1.22e-54

Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are classified according to the carbohydrate-recognition domain (CRD) of which there are two main types, S-type and C-type. Galectins (previously S-lectins) bind exclusively beta-galactosides like lactose. They do not require metal ions for activity. Galectins are found predominantly, but not exclusively in mammals. Their function is unclear. They are developmentally regulated and may be involved in differentiation, cellular regulation and tissue construction.


Pssm-ID: 214904  Cd Length: 122  Bit Score: 172.01  E-value: 1.22e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    137 PGGVMPRMLITIMGTVKPNANRIVLDFRRG--NDVAFHFNPRFNENnrrVIVCNTKQDNNWGKEERQSAFPFESGKPFKI 214
Cdd:smart00908   1 PGGLSPGSSITIRGIVLPDAKRFSINLQCGpnADIALHFNPRFDEG---TIVRNSKQNGKWGKEERSGGFPFQPGQPFEL 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 33859580    215 QVLVEADHFKVAVNDAHLLQYNHRMKnLREISQLGISGDITLTS 258
Cdd:smart00908  78 EILVEEDEFKVAVNGQHFLEFPHRLP-LESIDTLEISGDVQLTS 120
GLECT cd00070
Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as ...
131-258 7.17e-54

Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.


Pssm-ID: 238025  Cd Length: 127  Bit Score: 170.51  E-value: 7.17e-54
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580 131 PYDLPLPGGVMPRMLITIMGTVKPNANRIVLDFRRGN-DVAFHFNPRFNENnrrVIVCNTKQDNNWGKEERQSAFPFESG 209
Cdd:cd00070   1 PYKLPLPGGLKPGSTLTVKGRVLPNAKRFSINLGTGSsDIALHFNPRFDEN---VIVRNSFLNGNWGPEERSGGFPFQPG 77
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....*....
gi 33859580 210 KPFKIQVLVEADHFKVAVNDAHLLQYNHRMKnLREISQLGISGDITLTS 258
Cdd:cd00070  78 QPFELTILVEEDKFQIFVNGQHFFSFPHRLP-LESIDYLSINGDVSLTS 125
Gal-bind_lectin pfam00337
Galactoside-binding lectin; This family contains galactoside binding lectins. The family also ...
137-258 1.03e-49

Galactoside-binding lectin; This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).


Pssm-ID: 459768  Cd Length: 124  Bit Score: 159.73  E-value: 1.03e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   137 PGGVMPRMLITIMGTVKPNANRIVLDFRRG----NDVAFHFNPRFNENnrrVIVCNTKQDNNWGKEERQSAFPFESGKPF 212
Cdd:pfam00337   1 PGGLQPGSSLTIKGIVLPDAQRFSINLQTGvgpsDDIALHFNPRFDEN---VIVRNSRQNGQWGQEEREGGFPFQPGQPF 77
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 33859580   213 KIQVLVEADHFKVAVNDAHLLQYNHRMKNlREISQLGISGDITLTS 258
Cdd:pfam00337  78 ELTILVGDDHFKIYVNGQHFTTFKHRLPP-EDIDALQVRGDVKLTS 122
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-139 2.56e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.22  E-value: 2.56e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    9 DALAGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGA 88
Cdd:PRK07764 619 AAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAP 698
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 33859580   89 YPGSTAPGAFPGQPGAPGAYPSAPG-GYPAAGPYGVPAGPLTVPYDLPLPGG 139
Cdd:PRK07764 699 AQPAPAPAATPPAGQADDPAAQPPQaAQGASAPSPAADDPVPLPPEPDDPPD 750
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-127 8.42e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 43.36  E-value: 8.42e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   15 GNPNPQGYPGAWGNQPGAG--------GYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAP 86
Cdd:NF038329 132 GEQGPRGDRGETGPAGPAGppgpqgerGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGP 211
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 33859580   87 GAYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGP 127
Cdd:NF038329 212 AGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGP 252
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
27-112 2.03e-04

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 42.63  E-value: 2.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    27 GNQPGAGGYPGAAYPGA--YPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQP-- 102
Cdd:pfam03157 440 GQQPGQGQQPGQEQPGQgqQPGQGQQGQQPGQPEQGQQPGQGQPGYYPTSPQQSGQGQQLGQWQQQGQGQPGYYPTSPlq 519
                          90
                  ....*....|...
gi 33859580   103 ---GAPGAYPSAP 112
Cdd:pfam03157 520 pgqGQPGYYPTSP 532
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
12-143 2.80e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 41.97  E-value: 2.80e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580  12 AGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAY----PGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPG 87
Cdd:COG5180 345 ASDAGQPPSAYPPAEEAVPGKPLEQGAPRPGSSggdgAPFQPPNGAPQPGLGRRGAPGPPMGAGDLVQAALDGGGRETAS 424
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*.
gi 33859580  88 AYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGPLTVPYDLPLPGGVMPR 143
Cdd:COG5180 425 LGGAAGGAGQGPKADFVPGDAESVSGPAGLADQAGAAASTAMADFVAPVTDATPVD 480
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
45-133 3.89e-04

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 41.39  E-value: 3.89e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580  45 PGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGP---TAPGAYPGSTAPGAFPGQPGAPGayPSAPGGYPAAGPY 121
Cdd:cd23959 156 FGQHPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSASpfaTATDTAPSSGAPDGFPAEASAPS--PFAAPASAASFPA 233
                        90
                ....*....|..
gi 33859580 122 GVPAGPLTVPYD 133
Cdd:cd23959 234 APVANGEAATPT 245
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
36-162 6.38e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 40.76  E-value: 6.38e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   36 PGAAYPGAYPGQAPPGAypgQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPgqpgAPGAYPSAPGgy 115
Cdd:NF041121  16 GRAAAPPSPEGPAPTAA---SQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPP----PPGPAGAAPG-- 86
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 33859580  116 pAAGPYGVPAGPltvpydlPLPGGV-MPRMLITIMGTVkPNANRIVLD 162
Cdd:NF041121  87 -AALPVRVPAPP-------ALPNPLeLARALRPLKRRV-PSPRRVELD 125
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
36-137 1.53e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 39.79  E-value: 1.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    36 PGAAYPGAYPGQAPPGAYPGQapPGAYPGQAPpsaYPGPTAPGAYPGPTAPGAYPGSTAPG----AFPGQPgAPGAYPSA 111
Cdd:TIGR01628 392 GGAMGQPPYYGQGPQQQFNGQ--PLGWPRMSM---MPTPMGPGGPLRPNGLAPMNAVRAPSrnaqNAAQKP-PMQPVMYP 465
                          90       100
                  ....*....|....*....|....*.
gi 33859580   112 PGGYPAAGPYGVPAGPLTVPYDLPLP 137
Cdd:TIGR01628 466 PNYQSLPLSQDLPQPQSTASQGGQNK 491
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-106 5.97e-03

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 37.58  E-value: 5.97e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   15 GNPNPQGYPGAWGnQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPgAYPGPTAPGAYPGSTA 94
Cdd:NF038329 168 GEAGPQGPAGKDG-EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGP-AGDGQQGPDGDPGPTG 245
                         90
                 ....*....|..
gi 33859580   95 PGAFPGQPGAPG 106
Cdd:NF038329 246 EDGPQGPDGPAG 257
 
Name Accession Description Interval E-value
Gal-bind_lectin smart00908
Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are ...
137-258 1.22e-54

Galactoside-binding lectin; Animal lectins display a wide variety of architectures. They are classified according to the carbohydrate-recognition domain (CRD) of which there are two main types, S-type and C-type. Galectins (previously S-lectins) bind exclusively beta-galactosides like lactose. They do not require metal ions for activity. Galectins are found predominantly, but not exclusively in mammals. Their function is unclear. They are developmentally regulated and may be involved in differentiation, cellular regulation and tissue construction.


Pssm-ID: 214904  Cd Length: 122  Bit Score: 172.01  E-value: 1.22e-54
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    137 PGGVMPRMLITIMGTVKPNANRIVLDFRRG--NDVAFHFNPRFNENnrrVIVCNTKQDNNWGKEERQSAFPFESGKPFKI 214
Cdd:smart00908   1 PGGLSPGSSITIRGIVLPDAKRFSINLQCGpnADIALHFNPRFDEG---TIVRNSKQNGKWGKEERSGGFPFQPGQPFEL 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 33859580    215 QVLVEADHFKVAVNDAHLLQYNHRMKnLREISQLGISGDITLTS 258
Cdd:smart00908  78 EILVEEDEFKVAVNGQHFLEFPHRLP-LESIDTLEISGDVQLTS 120
GLECT cd00070
Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as ...
131-258 7.17e-54

Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.


Pssm-ID: 238025  Cd Length: 127  Bit Score: 170.51  E-value: 7.17e-54
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580 131 PYDLPLPGGVMPRMLITIMGTVKPNANRIVLDFRRGN-DVAFHFNPRFNENnrrVIVCNTKQDNNWGKEERQSAFPFESG 209
Cdd:cd00070   1 PYKLPLPGGLKPGSTLTVKGRVLPNAKRFSINLGTGSsDIALHFNPRFDEN---VIVRNSFLNGNWGPEERSGGFPFQPG 77
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....*....
gi 33859580 210 KPFKIQVLVEADHFKVAVNDAHLLQYNHRMKnLREISQLGISGDITLTS 258
Cdd:cd00070  78 QPFELTILVEEDKFQIFVNGQHFFSFPHRLP-LESIDYLSINGDVSLTS 125
Gal-bind_lectin pfam00337
Galactoside-binding lectin; This family contains galactoside binding lectins. The family also ...
137-258 1.03e-49

Galactoside-binding lectin; This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).


Pssm-ID: 459768  Cd Length: 124  Bit Score: 159.73  E-value: 1.03e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   137 PGGVMPRMLITIMGTVKPNANRIVLDFRRG----NDVAFHFNPRFNENnrrVIVCNTKQDNNWGKEERQSAFPFESGKPF 212
Cdd:pfam00337   1 PGGLQPGSSLTIKGIVLPDAQRFSINLQTGvgpsDDIALHFNPRFDEN---VIVRNSRQNGQWGQEEREGGFPFQPGQPF 77
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 33859580   213 KIQVLVEADHFKVAVNDAHLLQYNHRMKNlREISQLGISGDITLTS 258
Cdd:pfam00337  78 ELTILVGDDHFKIYVNGQHFTTFKHRLPP-EDIDALQVRGDVKLTS 122
GLECT smart00276
Galectin; Galectin - galactose-binding lectin
132-260 1.97e-49

Galectin; Galectin - galactose-binding lectin


Pssm-ID: 214596  Cd Length: 128  Bit Score: 158.93  E-value: 1.97e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    132 YDLPLPGGVMPRMLITIMGTVKPNANRIVLDFRRGN-DVAFHFNPRFNENnrrVIVCNTKQDNNWGKEERQSAFPFESGK 210
Cdd:smart00276   1 FTLPIPGGLKPGQTLTVRGIVLPDAKRFSINLLTGGdDIALHFNPRFNEN---KIVCNSKLNGSWGSEEREGGFPFQPGQ 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 33859580    211 PFKIQVLVEADHFKVAVNDAHLLQYNHRMKNLrEISQLGISGDITLTSAN 260
Cdd:smart00276  78 PFDLTIIVQPDHFQIFVNGVHITTFPHRLPLE-SIDYLSINGDVQLTSVS 126
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-139 2.56e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.22  E-value: 2.56e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    9 DALAGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGA 88
Cdd:PRK07764 619 AAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAP 698
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 33859580   89 YPGSTAPGAFPGQPGAPGAYPSAPG-GYPAAGPYGVPAGPLTVPYDLPLPGG 139
Cdd:PRK07764 699 AQPAPAPAATPPAGQADDPAAQPPQaAQGASAPSPAADDPVPLPPEPDDPPD 750
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
6-125 1.12e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 52.37  E-value: 1.12e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    6 SLNDALAGSGNPNPQGYPGawgnqPGAGGYPGAAYPGAypgQAPPGAYP--GQAPPGAYPGQAPPSAYPGPTAP--GAYP 81
Cdd:PRK14959 370 SLRPSGGGASAPSGSAAEG-----PASGGAATIPTPGT---QGPQGTAPaaGMTPSSAAPATPAPSAAPSPRVPwdDAPP 441
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*.
gi 33859580   82 GPTAPG--AYPGSTAPGAFPGqPGAPGAYPSAPGGYPAAGPYGVPA 125
Cdd:PRK14959 442 APPRSGipPRPAPRMPEASPV-PGAPDSVASASDAPPTLGDPSDTA 486
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
10-127 1.49e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.83  E-value: 1.49e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   10 ALAGSGNPNPQGY-PGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYP-------GQAPPGAYPGQAPPSAYPGPTAPGAYP 81
Cdd:PRK07764 594 AAGGEGPPAPASSgPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEAsaapapgVAAPEHHPKHVAVPDASDGGDGWPAKA 673
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|.
gi 33859580   82 GPTAPGAYPGSTAPGAFPGQPGAPGAYPSA-----PGGYPAAGPYGVPAGP 127
Cdd:PRK07764 674 GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPapaatPPAGQADDPAAQPPQA 724
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-185 1.97e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.44  E-value: 1.97e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    9 DALAGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAP---PSAYPGPTAPGAYPGPTA 85
Cdd:PRK07764 379 ERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPapaPPSPAGNAPAGGAPSPPP 458
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   86 PGAYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGPLTVPYDLPLPG---------GVMPRMLITIMGTVKPNA 156
Cdd:PRK07764 459 AAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATlrerwpeilAAVPKRSRKTWAILLPEA 538
                        170       180       190
                 ....*....|....*....|....*....|....*.
gi 33859580  157 -------NRIVLDFRRGNDVAFHFNPRFNENNRRVI 185
Cdd:PRK07764 539 tvlgvrgDTLVLGFSTGGLARRFASPGNAEVLVTAL 574
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
10-142 3.88e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.67  E-value: 3.88e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   10 ALAGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYpGQAPPSAYPGPTAPGAYPGPTAPGAY 89
Cdd:PRK07764 642 APAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPA-GAAPAQPAPAPAATPPAGQADDPAAQ 720
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 33859580   90 PGSTAPGAFPGQPGAPGAYP--SAPGGYPAAGPYGVPAGPLTVPYDLPLPGGVMP 142
Cdd:PRK07764 721 PPQAAQGASAPSPAADDPVPlpPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPP 775
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
31-137 5.17e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.18  E-value: 5.17e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   31 GAGGYPGAAYPGAYP---GQAPPGAYPGQAPPGAYPGQAPPSAYPGPtAPGAYPGPTAPGAYPGSTAPGAFPGQPGAPGA 107
Cdd:PRK12323 371 GAGPATAAAAPVAQPapaAAAPAAAAPAPAAPPAAPAAAPAAAAAAR-AVAAAPARRSPAPEALAAARQASARGPGGAPA 449
                         90       100       110
                 ....*....|....*....|....*....|
gi 33859580  108 YPSAPGGYPAAGPYGVPAGPLTVPYDLPLP 137
Cdd:PRK12323 450 PAPAPAAAPAAAARPAAAGPRPVAAAAAAA 479
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
9-120 5.73e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.29  E-value: 5.73e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    9 DALAGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGA 88
Cdd:PRK07764 668 GWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDD 747
                         90       100       110
                 ....*....|....*....|....*....|..
gi 33859580   89 YPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGP 120
Cdd:PRK07764 748 PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPP 779
PHA03378 PHA03378
EBNA-3B; Provisional
17-145 8.06e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.98  E-value: 8.06e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   17 PNPQGYPGAwgNQPGAGGYPGAAYPGAYPGQA-PPGAYPGQA-PPGAYPGQA-PPSAYPGPTAPgaypgptaPGAYPGST 93
Cdd:PHA03378 705 RPPAAPPGR--AQRPAAATGRARPPAAAPGRArPPAAAPGRArPPAAAPGRArPPAAAPGRARP--------PAAAPGAP 774
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 33859580   94 APGAFPGQPGAPGAYPSApGGYPAAGPYGVPAGPLTVPYDLPLPGGVMPRML 145
Cdd:PHA03378 775 TPQPPPQAPPAPQQRPRG-APTPQPPPQAGPTSMQLMPRAAPGQQGPTKQIL 825
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
12-127 8.07e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.90  E-value: 8.07e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   12 AGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAyPGQAPPSAYPGPTAPGAYPGPTAPGAYPG 91
Cdd:PRK07764 586 AVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAA-PAEASAAPAPGVAAPEHHPKHVAVPDASD 664
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 33859580   92 STAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGP 127
Cdd:PRK07764 665 GGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQ 700
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
36-138 1.11e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.41  E-value: 1.11e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   36 PGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTA-----------PGAYPGPT-APGAYPGSTAPGAFPGQPG 103
Cdd:PRK12323 392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEAlaaarqasargPGGAPAPApAPAAAPAAAARPAAAGPRP 471
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 33859580  104 APGAYPSAPG-GYPAAGPYGVPAGPL---TVPYDLPLPG 138
Cdd:PRK12323 472 VAAAAAAAPArAAPAAAPAPADDDPPpweELPPEFASPA 510
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
30-152 1.42e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 45.86  E-value: 1.42e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   30 PGAGGY---PGAAYPGAYPGQAPPGAYP-GQAPPGAYPGQAPPSAYPGPTAPGAYPGPtAPGAYPGSTAPGAFPGQPGAP 105
Cdd:PRK14951 366 PAAAAEaaaPAEKKTPARPEAAAPAAAPvAQAAAAPAPAAAPAAAASAPAAPPAAAPP-APVAAPAAAAPAAAPAAAPAA 444
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 33859580  106 GAYPSAPGGYPAAGPYGVPAG-----PLTVPYDLPLPGGVMPRMLITIMGTV 152
Cdd:PRK14951 445 VALAPAPPAQAAPETVAIPVRvapepAVASAAPAPAAAPAAARLTPTEEGDV 496
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
32-139 1.54e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.75  E-value: 1.54e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   32 AGGYPGAAYPGAYPGQAPPGAYPGQ---APPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFP-------GQ 101
Cdd:PRK07764 588 VGPAPGAAGGEGPPAPASSGPPEEAarpAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAvpdasdgGD 667
                         90       100       110
                 ....*....|....*....|....*....|....*...
gi 33859580  102 PGAPGAYPSAPGGYPAAGPYGVPAGPLTVPYDLPLPGG 139
Cdd:PRK07764 668 GWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAP 705
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
30-135 3.53e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 44.84  E-value: 3.53e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   30 PGAGGYPGAAYPGAYPGQAPPgayPGQAPPGAYPGQAPPSAYPGPTAPGAYPGP--TAPGAYPGSTAPGAFPGQPGAPG- 106
Cdd:PRK07003 362 VTGGGAPGGGVPARVAGAVPA---PGARAAAAVGASAVPAVTAVTGAAGAALAPkaAAAAAATRAEAPPAAPAPPATADr 438
                         90       100
                 ....*....|....*....|....*....
gi 33859580  107 AYPSAPGGYPAAGPYGVPAGPLTVPYDLP 135
Cdd:PRK07003 439 GDDAADGDAPVPAKANARASADSRCDERD 467
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
17-143 4.45e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.48  E-value: 4.45e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   17 PNPQGYPGAWGNQPGAGGYPGAAYPG----AYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGS 92
Cdd:PRK12323 404 AAPAAAPAAAAAARAVAAAPARRSPApealAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARA 483
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 33859580   93 TAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGPL-TVPYDLPLPGGVMPR 143
Cdd:PRK12323 484 APAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAgWVAESIPDPATADPD 535
dnaA PRK14086
chromosomal replication initiator protein DnaA;
41-138 5.19e-05

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 44.05  E-value: 5.19e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   41 PGAYPGQAPPGAYPG-QAPPGAYPGQAPPSAYPGPTA----PGAYPGPTAPgaypgsTAPGAFPG--QPGAPGAYPSAPG 113
Cdd:PRK14086  90 PSAGEPAPPPPHARRtSEPELPRPGRRPYEGYGGPRAddrpPGLPRQDQLP------TARPAYPAyqQRPEPGAWPRAAD 163
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 33859580  114 ---------GYPAAGPYGVPAGPLTVP-YDLPLPG 138
Cdd:PRK14086 164 dygwqqqrlGFPPRAPYASPASYAPEQeRDREPYD 198
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-127 8.42e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 43.36  E-value: 8.42e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   15 GNPNPQGYPGAWGNQPGAG--------GYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAP 86
Cdd:NF038329 132 GEQGPRGDRGETGPAGPAGppgpqgerGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGP 211
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 33859580   87 GAYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGP 127
Cdd:NF038329 212 AGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGP 252
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
48-143 8.65e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.44  E-value: 8.65e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   48 APPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGP 127
Cdd:PRK07764 589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDG 668
                         90
                 ....*....|....*.
gi 33859580  128 LTVPYDLPLPGGVMPR 143
Cdd:PRK07764 669 WPAKAGGAAPAAPPPA 684
PHA03247 PHA03247
large tegument protein UL36; Provisional
17-137 9.12e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 9.12e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    17 PNPQGYPGAWGNQ---PGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAyPGST 93
Cdd:PHA03247 2706 PTPEPAPHALVSAtplPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP-RRLT 2784
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 33859580    94 APGAFPGQPGAPGA-YPSAPGGYPAAGPYGVPAGPLTVPYDLPLP 137
Cdd:PHA03247 2785 RPAVASLSESRESLpSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
PHA03247 PHA03247
large tegument protein UL36; Provisional
41-127 1.35e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.35e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    41 PGAYPGQAPPGAYPGQAPPGAYPGQAPP---SAYPGPTAP----GAYPGPTAPGAYPGSTAPGAFPGQPGAPGAyPSAPG 113
Cdd:PHA03247 2686 RAARPTVGSLTSLADPPPPPPTPEPAPHalvSATPLPPGPaaarQASPALPAAPAPPAVPAGPATPGGPARPAR-PPTTA 2764
                          90
                  ....*....|....
gi 33859580   114 GYPAAGPYGVPAGP 127
Cdd:PHA03247 2765 GPPAPAPPAAPAAG 2778
PHA03247 PHA03247
large tegument protein UL36; Provisional
49-136 1.40e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 1.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    49 PPGAYPGQAPPGAYPG-QAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGP 127
Cdd:PHA03247 2703 PPPPTPEPAPHALVSAtPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRR 2782

                  ....*....
gi 33859580   128 LTVPYDLPL 136
Cdd:PHA03247 2783 LTRPAVASL 2791
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
27-112 2.03e-04

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 42.63  E-value: 2.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    27 GNQPGAGGYPGAAYPGA--YPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQP-- 102
Cdd:pfam03157 440 GQQPGQGQQPGQEQPGQgqQPGQGQQGQQPGQPEQGQQPGQGQPGYYPTSPQQSGQGQQLGQWQQQGQGQPGYYPTSPlq 519
                          90
                  ....*....|...
gi 33859580   103 ---GAPGAYPSAP 112
Cdd:pfam03157 520 pgqGQPGYYPTSP 532
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
10-112 2.32e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 2.32e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   10 ALAGSGNPNPQGYPGAWGNQPGAGGYPGA-------AYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPG 82
Cdd:PRK07764 671 AKAGGAAPAAPPPAPAPAAPAAPAGAAPAqpapapaATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPD 750
                         90       100       110
                 ....*....|....*....|....*....|
gi 33859580   83 PTAPGAYPGSTAPGAFPGQPGAPGAYPSAP 112
Cdd:PRK07764 751 PAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
12-143 2.80e-04

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 41.97  E-value: 2.80e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580  12 AGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAY----PGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPG 87
Cdd:COG5180 345 ASDAGQPPSAYPPAEEAVPGKPLEQGAPRPGSSggdgAPFQPPNGAPQPGLGRRGAPGPPMGAGDLVQAALDGGGRETAS 424
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*.
gi 33859580  88 AYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGPLTVPYDLPLPGGVMPR 143
Cdd:COG5180 425 LGGAAGGAGQGPKADFVPGDAESVSGPAGLADQAGAAASTAMADFVAPVTDATPVD 480
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
27-104 3.44e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.80  E-value: 3.44e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 33859580    27 GNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPT-APGAYPGPTAPGAYPGSTAPGAFPGQPGA 104
Cdd:PRK12270   39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAaAAAAAAAPAAPPAAAAAAAPAAAAVEDEV 117
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
54-138 3.47e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.80  E-value: 3.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    54 PGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGPLTVPYD 133
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEV 117

                  ....*
gi 33859580   134 LPLPG 138
Cdd:PRK12270  118 TPLRG 122
dnaA PRK14086
chromosomal replication initiator protein DnaA;
17-143 3.54e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 41.73  E-value: 3.54e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   17 PNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQA-PPGAYPGQAPPSaypgptAPGAYPGPTAPgaYPGSTAP 95
Cdd:PRK14086 100 PHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPtARPAYPAYQQRP------EPGAWPRAADD--YGWQQQR 171
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 33859580   96 GAFPgqPGAPgaYPSAPGGYPAAGPYGVPAGPLTVPYDLPLPGGVMPR 143
Cdd:PRK14086 172 LGFP--PRAP--YASPASYAPEQERDREPYDAGRPEYDQRRRDYDHPR 215
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
45-133 3.89e-04

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 41.39  E-value: 3.89e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580  45 PGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGP---TAPGAYPGSTAPGAFPGQPGAPGayPSAPGGYPAAGPY 121
Cdd:cd23959 156 FGQHPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSASpfaTATDTAPSSGAPDGFPAEASAPS--PFAAPASAASFPA 233
                        90
                ....*....|..
gi 33859580 122 GVPAGPLTVPYD 133
Cdd:cd23959 234 APVANGEAATPT 245
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
36-162 6.38e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 40.76  E-value: 6.38e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   36 PGAAYPGAYPGQAPPGAypgQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPgqpgAPGAYPSAPGgy 115
Cdd:NF041121  16 GRAAAPPSPEGPAPTAA---SQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPP----PPGPAGAAPG-- 86
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 33859580  116 pAAGPYGVPAGPltvpydlPLPGGV-MPRMLITIMGTVkPNANRIVLD 162
Cdd:NF041121  87 -AALPVRVPAPP-------ALPNPLeLARALRPLKRRV-PSPRRVELD 125
PHA03247 PHA03247
large tegument protein UL36; Provisional
32-143 7.12e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.08  E-value: 7.12e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    32 AGGYPGAAYPGAYPGQA-----PPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPtaPGAYPGSTAPGAFPGQPGAPG 106
Cdd:PHA03247 2548 AGDPPPPLPPAAPPAAPdrsvpPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDD--RGDPRGPAPPSPLPPDTHAPD 2625
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 33859580   107 ayPSAPGGYPAA----GPYGVPAGPLTVPYDLPLPGGVMPR 143
Cdd:PHA03247 2626 --PPPPSPSPAAnepdPHPPPTVPPPERPRDDPAPGRVSRP 2664
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
36-131 7.52e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 40.64  E-value: 7.52e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    36 PGAAYPGAYPGQAPPGAYPGQAP-PGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQPGAPGAYPSAPGG 114
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAaPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEV 117
                          90       100
                  ....*....|....*....|.
gi 33859580   115 YPAAGPYGVPA----GPLTVP 131
Cdd:PRK12270  118 TPLRGAAAAVAknmdASLEVP 138
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
47-131 8.27e-04

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 40.24  E-value: 8.27e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580  47 QAPPGAYPGQAPpgaYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQPGAPGAYPSAPGGYP--------AA 118
Cdd:cd23959 143 QTAPVTPFGQLP---MFGQHPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSASPFATATDTAPSSGAPdgfpaeasAP 219
                        90
                ....*....|...
gi 33859580 119 GPYGVPAGPLTVP 131
Cdd:cd23959 220 SPFAAPASAASFP 232
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
12-137 8.44e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 40.60  E-value: 8.44e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   12 AGSGNPNPQGYPGAwgnQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGA--YPGQAPPSAYPGPTAPGAYPGPTAPGAY 89
Cdd:PRK07003 362 VTGGGAPGGGVPAR---VAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAalAPKAAAAAAATRAEAPPAAPAPPATADR 438
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*....
gi 33859580   90 PGSTAPGAFPGQPGAPG-AYPSAPGGYPAAGPYGVPAGPLTVPYDLPLP 137
Cdd:PRK07003 439 GDDAADGDAPVPAKANArASADSRCDERDAQPPADSGSASAPASDAPPD 487
PHA03378 PHA03378
EBNA-3B; Provisional
10-103 8.84e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 40.44  E-value: 8.84e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   10 ALAGSGNPNPQGYPGAwgNQPGAGGYPGAAYPGAYPGQA-PPGAYPGQA-PPGAYPGQAPPSAYP--GPTA---PGAYPG 82
Cdd:PHA03378 718 AAATGRARPPAAAPGR--ARPPAAAPGRARPPAAAPGRArPPAAAPGRArPPAAAPGAPTPQPPPqaPPAPqqrPRGAPT 795
                         90       100
                 ....*....|....*....|....
gi 33859580   83 PTAP---GAYPGSTAPGAFPGQPG 103
Cdd:PHA03378 796 PQPPpqaGPTSMQLMPRAAPGQQG 819
PHA03247 PHA03247
large tegument protein UL36; Provisional
30-142 9.21e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.69  E-value: 9.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    30 PGAGGYPGA-AYPGAYPGQAPPGAYPGQAPPGAypgqAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQPGAPGAY 108
Cdd:PHA03247 2751 PGGPARPARpPTTAGPPAPAPPAAPAAGPPRRL----TRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAG 2826
                          90       100       110
                  ....*....|....*....|....*....|....
gi 33859580   109 PSAPGGYPAAGPYGVPAGPltVPYDLPLPGGVMP 142
Cdd:PHA03247 2827 PLPPPTSAQPTAPPPPPGP--PPPSLPLGGSVAP 2858
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
15-114 1.25e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 40.05  E-value: 1.25e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   15 GNPNPQGYPGAWGNQPGAGGyPGAAYPGAYPGQAPPGaypGQAPPGayPGQAPPSAYPGPTAPGAYPGPTAPGAYpgSTA 94
Cdd:PRK14959 401 GTQGPQGTAPAAGMTPSSAA-PATPAPSAAPSPRVPW---DDAPPA--PPRSGIPPRPAPRMPEASPVPGAPDSV--ASA 472
                         90       100
                 ....*....|....*....|
gi 33859580   95 PGAFPGQPGAPGAYPSAPGG 114
Cdd:PRK14959 473 SDAPPTLGDPSDTAEHTPSG 492
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
30-134 1.26e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 39.72  E-value: 1.26e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   30 PGAGGYPGAAYPGAYPGQAPP-GAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAfpgqpgaPGAY 108
Cdd:PRK14965 382 PAPPSAAWGAPTPAAPAAPPPaAAPPVPPAAPARPAAARPAPAPAPPAAAAPPARSADPAAAASAGDRW-------RAFV 454
                         90       100
                 ....*....|....*....|....*.
gi 33859580  109 PSAPGGYPAAGPYGVPAGPLTVPYDL 134
Cdd:PRK14965 455 AFVKGKKPALGASLEQGSPLGVSAGL 480
PHA03378 PHA03378
EBNA-3B; Provisional
23-131 1.40e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 40.05  E-value: 1.40e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   23 PGAWGNQPGAGGYPGAAYPGAYPGQAPPgayPGQAPPGAYPGQAPPSAYPGP---TAPGAYPGPTAPGAYPGSTAPGAFP 99
Cdd:PHA03378 669 IGHIPYQPSPTGANTMLPIQWAPGTMQP---PPRAPTPMRPPAAPPGRAQRPaaaTGRARPPAAAPGRARPPAAAPGRAR 745
                         90       100       110
                 ....*....|....*....|....*....|...
gi 33859580  100 GQPGAPG-AYPSAPGGYPAAGPYGVPAGPLTVP 131
Cdd:PHA03378 746 PPAAAPGrARPPAAAPGRARPPAAAPGAPTPQP 778
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
36-137 1.53e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 39.79  E-value: 1.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    36 PGAAYPGAYPGQAPPGAYPGQapPGAYPGQAPpsaYPGPTAPGAYPGPTAPGAYPGSTAPG----AFPGQPgAPGAYPSA 111
Cdd:TIGR01628 392 GGAMGQPPYYGQGPQQQFNGQ--PLGWPRMSM---MPTPMGPGGPLRPNGLAPMNAVRAPSrnaqNAAQKP-PMQPVMYP 465
                          90       100
                  ....*....|....*....|....*.
gi 33859580   112 PGGYPAAGPYGVPAGPLTVPYDLPLP 137
Cdd:TIGR01628 466 PNYQSLPLSQDLPQPQSTASQGGQNK 491
PHA03247 PHA03247
large tegument protein UL36; Provisional
10-128 1.58e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.92  E-value: 1.58e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    10 ALAGSGNPNPQGYPgawgnqPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAY 89
Cdd:PHA03247 2747 GPATPGGPARPARP------PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 33859580    90 PGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYG---VPAGPL 128
Cdd:PHA03247 2821 AASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDV 2862
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
13-112 1.79e-03

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 39.55  E-value: 1.79e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    13 GSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYP------GQAPPGAYPG---QAPPSAYPGPTAPGAYPGP 83
Cdd:pfam03157 315 GQEQQDQQPGQGRQGQQPGQGQQGQQPAQGQQPGQGQPGYYPtspqqpGQGQPGYYPTsqqQPQQGQQPEQGQQGQQQGQ 394
                          90       100
                  ....*....|....*....|....*....
gi 33859580    84 TAPGAYPGStapGAFPGQpGAPGAYPSAP 112
Cdd:pfam03157 395 GQQGQQPGQ---GQQPGQ-GQPGYYPTSP 419
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
36-144 2.03e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 39.37  E-value: 2.03e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    36 PGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTApgayPGSTAPGAFPGQPGAPGAYPSAPGGY 115
Cdd:pfam03154 263 PQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPG----PSPAAPGQSQQRIHTPPSQSQLQSQQ 338
                          90       100
                  ....*....|....*....|....*....
gi 33859580   116 PAAgPYGVPAGPLTVPYDLPLPGGVMPRM 144
Cdd:pfam03154 339 PPR-EQPLPPAPLSMPHIKPPPTTPIPQL 366
PHA03247 PHA03247
large tegument protein UL36; Provisional
23-143 2.08e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.54  E-value: 2.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    23 PGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAyPGPTAPGAYPGPTAPGAYP-GSTAPG---AF 98
Cdd:PHA03247 2786 PAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS-AQPTAPPPPPGPPPPSLPLgGSVAPGgdvRR 2864
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 33859580    99 PGQPGAPGAYPSAPGGYPAAGPYGVPAGPLTVPYDLPLPGGVMPR 143
Cdd:PHA03247 2865 RPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP 2909
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
37-118 2.10e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 39.34  E-value: 2.10e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   37 GAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAyPGPTAPGAyPGSTAPGAfpgqpgAPGAYPSAPGGYP 116
Cdd:PRK14965 380 GAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPA-PAPAPPAA-AAPPARSA------DPAAAASAGDRWR 451

                 ..
gi 33859580  117 AA 118
Cdd:PRK14965 452 AF 453
PHA03247 PHA03247
large tegument protein UL36; Provisional
12-131 2.30e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 39.15  E-value: 2.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580    12 AGSGNPNPQGYPGAWGNQPGAGGYPGAAYP---GAYPGQAPPGAYPGQAP--PGAYPGQAPPSAYPGPTAPGAYPGPTAP 86
Cdd:PHA03247 2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSRprrARRLGRAAQASSPPQRPrrRAARPTVGSLTSLADPPPPPPTPEPAPH 2713
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 33859580    87 GAYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGPLTVP 131
Cdd:PHA03247 2714 ALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA 2758
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
10-128 2.38e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 39.09  E-value: 2.38e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   10 ALAGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGP----TAPGAYPGP-- 83
Cdd:PRK12323 432 ALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPppweELPPEFASPap 511
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*..
gi 33859580   84 --TAPGAYPGSTAPGAFPGQPGAPGAYPSAPGGyPAAGPYGVPAGPL 128
Cdd:PRK12323 512 aqPDAAPAGWVAESIPDPATADPDDAFETLAPA-PAAAPAPRAAAAT 557
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
13-127 3.53e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 38.54  E-value: 3.53e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   13 GSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGS 92
Cdd:PRK14951 367 AAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVA 446
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....
gi 33859580   93 TAP---------GAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGP 127
Cdd:PRK14951 447 LAPappaqaapeTVAIPVRVAPEPAVASAAPAPAAAPAAARLTP 490
PHA02682 PHA02682
ORF080 virion core protein; Provisional
46-135 3.58e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 37.92  E-value: 3.58e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   46 GQAPPGAYPG-QAPPGAYPGQAPPSAYPGPTAPGayPGPTAPGAYPGSTAPGAFPGQPGAPG-AYPSAPGGYPaagpygv 123
Cdd:PHA02682  78 GQSPLAPSPAcAAPAPACPACAPAAPAPAVTCPA--PAPACPPATAPTCPPPAVCPAPARPApACPPSTRQCP------- 148
                         90
                 ....*....|..
gi 33859580  124 PAGPLTVPYDLP 135
Cdd:PHA02682 149 PAPPLPTPKPAP 160
PRK12373 PRK12373
NADH-quinone oxidoreductase subunit E;
41-144 4.24e-03

NADH-quinone oxidoreductase subunit E;


Pssm-ID: 237082 [Multi-domain]  Cd Length: 400  Bit Score: 38.24  E-value: 4.24e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   41 PGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQP--GAPGAYPSAPGGYPAA 118
Cdd:PRK12373 232 APWQGDAAPVPPSEAARPKSADAETNAALKTPATAPKAAAKNAKAPEAQPVSGTAAAEPAPKeaAKAAAAAAKPALEDKP 311
                         90       100
                 ....*....|....*....|....*...
gi 33859580  119 GPYGV--PAGpltvPYDLPLPGGVMPRM 144
Cdd:PRK12373 312 RPLGIarPGG----ADDLKLISGVGPKI 335
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
9-93 4.37e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 38.33  E-value: 4.37e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580     9 DALAGSGNPNPQGYPGAWGNQPGAGGYPgAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGA 88
Cdd:PRK12270   35 DYGPGSTAAPTAAAAAAAAAASAPAAAP-AAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAV 113

                  ....*
gi 33859580    89 YPGST 93
Cdd:PRK12270  114 EDEVT 118
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
15-106 5.97e-03

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 37.58  E-value: 5.97e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   15 GNPNPQGYPGAWGnQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPgAYPGPTAPGAYPGSTA 94
Cdd:NF038329 168 GEAGPQGPAGKDG-EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGP-AGDGQQGPDGDPGPTG 245
                         90
                 ....*....|..
gi 33859580   95 PGAFPGQPGAPG 106
Cdd:NF038329 246 EDGPQGPDGPAG 257
PHA02682 PHA02682
ORF080 virion core protein; Provisional
27-131 6.99e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 37.15  E-value: 6.99e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   27 GNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGqAPPSAYPGPTAPGAYPGPTAPG-AYPGST--APGA--FPGQ 101
Cdd:PHA02682  78 GQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPA-CPPATAPTCPPPAVCPAPARPApACPPSTrqCPPAppLPTP 156
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 33859580  102 PGAPGAYPS------APGGYPAAGPYGVPAGPLTVP 131
Cdd:PHA02682 157 KPAPAAKPIflhnqlPPPDYPAASCPTIETAPAASP 192
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
23-119 7.82e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 37.55  E-value: 7.82e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580   23 PGAWGNQPGAGGYPGAAYPGAYPGQAPPGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYPGPTAPGAYPGSTAPGAFPGQP 102
Cdd:PRK12323 467 AGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPA 546
                         90
                 ....*....|....*..
gi 33859580  103 GAPGAYPSAPGGYPAAG 119
Cdd:PRK12323 547 AAPAPRAAAATEPVVAP 563
PHA03247 PHA03247
large tegument protein UL36; Provisional
6-129 9.52e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 37.23  E-value: 9.52e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 33859580     6 SLNDALAGSGNPNPQGYPGAWGNQPGAGGYPGAAYPGAYPGQAP----PGAYPGQAPPGAYPGQAPPSAYPGPTAPGAYP 81
Cdd:PHA03247  364 SLEDLSAGRHHPKRASLPTRKRRSARHAATPFARGPGGDDQTRPaapvPASVPTPAPTPVPASAPPPPATPLPSAEPGSD 443
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 33859580    82 GPTAPGAYPGSTAPGAFPGQPGAPGAYPSAPGGYPAAGPYGVPAGPLT 129
Cdd:PHA03247  444 DGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPPEPPGADLA 491
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH