NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034578030|ref|XP_016874317|]
View 

collagen alpha-1(II) chain isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1299-1534 1.40e-162

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


:

Pssm-ID: 460199  Cd Length: 233  Bit Score: 489.54  E-value: 1.40e-162
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1299 HDAEVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPAN 1378
Cdd:pfam01410    1 RDEEVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETGETCIYPTKAS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1379 VPKKNWWSsksKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKA 1458
Cdd:pfam01410   81 IPRKNWWT---KESKHVWFGEFMNGGSQFSYGVDGVGPSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQATGNLKKA 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034578030 1459 LLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534
Cdd:pfam01410  158 LLLQGSNDEEIRAEGNSRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIVDIAPMDIGGADQEFGVEVGPVCF 233
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
849-1059 6.94e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.66  E-value: 6.94e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  849 GEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAG---QKGDAGAPGPQGPSGAPGPQGP 925
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGpqgPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  926 TGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNP--------GPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGP 997
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGqqgpdgdpGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034578030  998 AGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPG 1059
Cdd:NF038329   277 DGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
422-685 1.07e-29

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 124.25  E-value: 1.07e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  422 KGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAgapgfpgprgppgpqgatGPLGPKG 501
Cdd:NF038329   119 KGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQ------------------GPAGKDG 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  502 QTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGpigppgergaPGNRGFPGQDGLAGPKGAPGER 581
Cdd:NF038329   181 EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAG----------PAGDGQQGPDGDPGPTGEDGPQ 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  582 GPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPsgapgedgrpgppgpqgaRGQPGVMGFPGPKGANGE 661
Cdd:NF038329   251 GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGK------------------DGLPGKDGKDGQNGKDGL 312
                          250       260
                   ....*....|....*....|....
gi 1034578030  662 PGKAGEKGLPGAPGLRGLPGKDGE 685
Cdd:NF038329   313 PGKDGKDGQPGKDGLPGKDGKDGQ 336
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
663-925 5.78e-26

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 113.08  E-value: 5.78e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  663 GKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPGPsgfqglpgppgppgeggkpgdqgvPGEAGAPGL 742
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGP------------------------AGPQGEAGP 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  743 VGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGeRGAAGIAGPKGDRGDVGEKG 822
Cdd:NF038329   173 QGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQG 251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  823 PEGAPGKDGGRGltgpigppgpagANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEA 902
Cdd:NF038329   252 PDGPAGKDGPRG------------DRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKD 319
                          250       260
                   ....*....|....*....|...
gi 1034578030  903 GQKGDAGAPGPQGPSGAPGPQGP 925
Cdd:NF038329   320 GQPGKDGLPGKDGKDGQPGKPAP 342
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
987-1182 2.18e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 111.15  E-value: 2.18e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  987 GRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGP 1066
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1067 PGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGvKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMG 1146
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDG 275
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1034578030 1147 PSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRG 1182
Cdd:NF038329   276 KDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDG 311
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
81-136 1.83e-23

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


:

Pssm-ID: 278520  Cd Length: 57  Bit Score: 94.80  E-value: 1.83e-23
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1034578030   81 CVQDGQRYNDKDVWKPEPCRICVCDTGTVLCDDIICEDVkDCLSP--EIPFGECCPIC 136
Cdd:pfam00093    1 CVQNGVVYENGETWKPDLCTICTCDDGKVLCDKIICPPL-DCPNPrlEIPPGECCPVC 57
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
261-474 1.56e-16

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 84.19  E-value: 1.56e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  261 GPAGAPGPQGFQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKAGERGPPGPQGARGFPGTPGLPGVKGHRGY 340
Cdd:NF038329   126 GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGE 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  341 PGLDGAKGEAGAPGVKGESGSPGENGSpGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPG 420
Cdd:NF038329   206 QGPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVG 284
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034578030  421 AKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPG 474
Cdd:NF038329   285 PAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1134-1243 8.54e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 46.82  E-value: 8.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1134 GDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQGLPGPPGPSGDQGASGPAGPSGPRGP 1213
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110
                   ....*....|....*....|....*....|
gi 1034578030 1214 PGPVGPSGKDGANGIPGPIGPPGPRGRSGE 1243
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGP 226
 
Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1299-1534 1.40e-162

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 489.54  E-value: 1.40e-162
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1299 HDAEVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPAN 1378
Cdd:pfam01410    1 RDEEVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETGETCIYPTKAS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1379 VPKKNWWSsksKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKA 1458
Cdd:pfam01410   81 IPRKNWWT---KESKHVWFGEFMNGGSQFSYGVDGVGPSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQATGNLKKA 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034578030 1459 LLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534
Cdd:pfam01410  158 LLLQGSNDEEIRAEGNSRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIVDIAPMDIGGADQEFGVEVGPVCF 233
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1300-1535 2.45e-151

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 459.63  E-value: 2.45e-151
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  1300 DAEVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANV 1379
Cdd:smart00038    1 DEEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSGEYWVDPNQGCIRDAIKVFCNFETGETCVSPSPSSI 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  1380 PKKNWWSSKSKekkHIWFGETINGGFHFSYGDDNLAPnTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKAL 1459
Cdd:smart00038   81 PRKTWYSGKSK---HVWFGETMNGGFKFSYGDSEGPP-VGVVQLTFLRLLSTEAHQNITYHCKNSVAYMDEATGNLKKAL 156
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034578030  1460 LIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCFL 1535
Cdd:smart00038  157 RLRGSNDVELSAEGNSKFTYEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIVDIAPSDIGGPDQEFGVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
849-1059 6.94e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.66  E-value: 6.94e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  849 GEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAG---QKGDAGAPGPQGPSGAPGPQGP 925
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGpqgPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  926 TGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNP--------GPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGP 997
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGqqgpdgdpGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034578030  998 AGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPG 1059
Cdd:NF038329   277 DGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
870-1104 1.31e-33

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 135.80  E-value: 1.31e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  870 GERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGA 949
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  950 AGRVGPPGSNGNPGPPGPPGPSGKDGPKGA-----RGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGL 1024
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPagpagDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1025 AGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASgdrgppgpvgppgltgpaGEPGREGSPGADGPPGRDGAAGVKG 1104
Cdd:NF038329   277 DGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLP------------------GKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
422-685 1.07e-29

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 124.25  E-value: 1.07e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  422 KGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAgapgfpgprgppgpqgatGPLGPKG 501
Cdd:NF038329   119 KGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQ------------------GPAGKDG 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  502 QTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGpigppgergaPGNRGFPGQDGLAGPKGAPGER 581
Cdd:NF038329   181 EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAG----------PAGDGQQGPDGDPGPTGEDGPQ 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  582 GPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPsgapgedgrpgppgpqgaRGQPGVMGFPGPKGANGE 661
Cdd:NF038329   251 GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGK------------------DGLPGKDGKDGQNGKDGL 312
                          250       260
                   ....*....|....*....|....
gi 1034578030  662 PGKAGEKGLPGAPGLRGLPGKDGE 685
Cdd:NF038329   313 PGKDGKDGQPGKDGLPGKDGKDGQ 336
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
751-1021 1.43e-29

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 123.86  E-value: 1.43e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  751 FPGERGSPGAQGLQGPRGLPGTPGTDGPKGAsgpagppgaqgppglQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKD 830
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGP---------------AGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKD 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  831 GgrgltgpigppgpagangEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEaGQKGDAGA 910
Cdd:NF038329   180 G------------------EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGD 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  911 PGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAG 990
Cdd:NF038329   241 PGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDG 320
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1034578030  991 EPGLQGPAGPPGEKGEPGDDGPSGAEGPPGP 1021
Cdd:NF038329   321 QPGKDGLPGKDGKDGQPGKPAPKTPEVPQKP 351
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
663-925 5.78e-26

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 113.08  E-value: 5.78e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  663 GKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPGPsgfqglpgppgppgeggkpgdqgvPGEAGAPGL 742
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGP------------------------AGPQGEAGP 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  743 VGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGeRGAAGIAGPKGDRGDVGEKG 822
Cdd:NF038329   173 QGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQG 251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  823 PEGAPGKDGGRGltgpigppgpagANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEA 902
Cdd:NF038329   252 PDGPAGKDGPRG------------DRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKD 319
                          250       260
                   ....*....|....*....|...
gi 1034578030  903 GQKGDAGAPGPQGPSGAPGPQGP 925
Cdd:NF038329   320 GQPGKDGLPGKDGKDGQPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
987-1182 2.18e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 111.15  E-value: 2.18e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  987 GRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGP 1066
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1067 PGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGvKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMG 1146
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDG 275
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1034578030 1147 PSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRG 1182
Cdd:NF038329   276 KDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDG 311
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
488-709 3.84e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 110.38  E-value: 3.84e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  488 PGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPG 567
Cdd:NF038329   128 AGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQG 207
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  568 QDGLAGPKGAPGERGPSGLAGPKGangdPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 647
Cdd:NF038329   208 PAGPAGPDGEAGPAGEDGPAGPAG----DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPV 283
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034578030  648 GVMGFPGPKGANGEPGKAGEKGLPGAPglrGLPGKDGETGAAGPPGPAGPAGERGEQGAPGP 709
Cdd:NF038329   284 GPAGKDGQNGKDGLPGKDGKDGQNGKD---GLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
904-1163 4.37e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 110.38  E-value: 4.37e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  904 QKGDAGAPGPQGPSGAPGPQGPtgvTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDS 983
Cdd:NF038329   112 QLKGDGEKGEPGPAGPAGPAGE---QGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPA 188
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  984 GPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQrGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGD 1063
Cdd:NF038329   189 GEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGE 267
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1064 RGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGdrgetgavgapgapgppgspgpagPTGKQGDRGEAGAQG 1143
Cdd:NF038329   268 AGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNG------------------------KDGLPGKDGKDGQPG 323
                          250       260
                   ....*....|....*....|
gi 1034578030 1144 PMGPSGPAGARGIQGPQGPR 1163
Cdd:NF038329   324 KDGLPGKDGKDGQPGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
511-780 3.99e-24

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 107.30  E-value: 3.99e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  511 FKGEQGPKGEPGPAGPQGAPGPAGEEGKRGargepggvgpigppgERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPK 590
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETG---------------PAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKD 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  591 GANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGaPGEDGRPGPPGPQGARGQPGVMGFPGPKGANGEPGKAGEKGL 670
Cdd:NF038329   180 GEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDG-EAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGK 258
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  671 PGAPGLRGLPGKDGetgaagppgpagPAGERGEQGAPGPSGFqglpgppgppgeggkpgdQGVPGEAGAPGLVGPRGERG 750
Cdd:NF038329   259 DGPRGDRGEAGPDG------------PDGKDGERGPVGPAGK------------------DGQNGKDGLPGKDGKDGQNG 308
                          250       260       270
                   ....*....|....*....|....*....|
gi 1034578030  751 FPGERGSPGAQGLQGPRGLPGTPGTDGPKG 780
Cdd:NF038329   309 KDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
700-943 7.02e-24

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 106.53  E-value: 7.02e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  700 ERGEQGAPGPSGFQGLPGPPGPPGEGGKPGDQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPK 779
Cdd:NF038329   118 EKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPR 197
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  780 GASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGD--RGDVGEKGPEGAPGKDGGRGLTGPigppgpagaNGEKGEVGPP 857
Cdd:NF038329   198 GETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGK---------DGPRGDRGEA 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  858 GPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGA 937
Cdd:NF038329   269 GPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTPEVP 348

                   ....*.
gi 1034578030  938 QGPPGA 943
Cdd:NF038329   349 QKPDTA 354
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
569-827 1.47e-23

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 105.76  E-value: 1.47e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  569 DGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPG 648
Cdd:NF038329   116 DGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQG 195
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  649 VMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPGKDGEtgaagppgpagpaGERGEQGAPGPsgfqglpgppgppgeggkp 728
Cdd:NF038329   196 PRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------------GQQGPDGDPGP------------------- 243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  729 gdQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGI 808
Cdd:NF038329   244 --TGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQ 321
                          250
                   ....*....|....*....
gi 1034578030  809 AGPKGDRGDVGEKGPEGAP 827
Cdd:NF038329   322 PGKDGLPGKDGKDGQPGKP 340
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
81-136 1.83e-23

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


Pssm-ID: 278520  Cd Length: 57  Bit Score: 94.80  E-value: 1.83e-23
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1034578030   81 CVQDGQRYNDKDVWKPEPCRICVCDTGTVLCDDIICEDVkDCLSP--EIPFGECCPIC 136
Cdd:pfam00093    1 CVQNGVVYENGETWKPDLCTICTCDDGKVLCDKIICPPL-DCPNPrlEIPPGECCPVC 57
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
304-605 2.34e-23

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 104.99  E-value: 2.34e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  304 EAGKPGKAGERGPPGPQGARgfpgtpglpgvkghrgypGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLPGERgrt 383
Cdd:NF038329   118 EKGEPGPAGPAGPAGEQGPR------------------GDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPA--- 176
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  384 gpagaagargndgqpgpagppgpvgpaggpgfpgapgakGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGI 463
Cdd:NF038329   177 ---------------------------------------GKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGE 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  464 PGAKGSAGAPGIAGApgfpgprgppgpqgatgplGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARG 543
Cdd:NF038329   218 AGPAGEDGPAGPAGD-------------------GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDG 278
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034578030  544 EPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLP 605
Cdd:NF038329   279 ERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
VWC smart00214
von Willebrand factor (vWF) type C domain;
81-136 5.07e-23

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214564  Cd Length: 59  Bit Score: 93.35  E-value: 5.07e-23
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034578030    81 CVQDGQRYNDKDVWKPEPCRICVCDTG-TVLCDDIICEDVKDCLSPE--IPFGECCPIC 136
Cdd:smart00214    1 CVHNGRVYNDGETWKPDPCQICTCLDGtTVLCDPVECPPPPDCPNPErvKPPGECCPRC 59
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
261-474 1.56e-16

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 84.19  E-value: 1.56e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  261 GPAGAPGPQGFQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKAGERGPPGPQGARGFPGTPGLPGVKGHRGY 340
Cdd:NF038329   126 GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGE 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  341 PGLDGAKGEAGAPGVKGESGSPGENGSpGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPG 420
Cdd:NF038329   206 QGPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVG 284
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034578030  421 AKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPG 474
Cdd:NF038329   285 PAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
855-1060 1.89e-09

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 61.97  E-value: 1.89e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  855 GPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGA 934
Cdd:COG5164      7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  935 RGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSG 1014
Cdd:COG5164     87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTT 166
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1034578030 1015 AEGPPGPQGLAGQRGiVGLPGQRGERGFPGLPGPSGEPGKQGAPGA 1060
Cdd:COG5164    167 PPGPGGSTTPPDDGG-STTPPNKGETGTDIPTGGTPRQGPDGPVKK 211
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
852-908 1.35e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 52.50  E-value: 1.35e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  852 GEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDA 908
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
816-1024 1.08e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 56.92  E-value: 1.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  816 GDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGA 895
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  896 KGEQGEAGQKGDAGAP---GPQGPSGAPGPQG-PTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPS 971
Cdd:PRK07764   670 PAKAGGAAPAAPPPAPapaAPAAPAGAAPAQPaPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPP 749
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034578030  972 GKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDgpsgAEGPPGPQGL 1024
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED----DAPSMDDEDR 798
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
734-945 1.39e-07

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 56.19  E-value: 1.39e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  734 PGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKG 813
Cdd:COG5164      6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQ 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  814 DRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGP------------------PGPAGSAGARGAPGERGET 875
Cdd:COG5164     86 NQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPpsggsttppgdggstppgPGSTGPGGSTTPPGDGGST 165
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034578030  876 GPPGPAGFAGPPGADGQ--PGAKGEQGEAGQKGDAGAPGPQGPSGAPG-------PQGPTGVTGPKGARGAQGPPGATG 945
Cdd:COG5164    166 TPPGPGGSTTPPDDGGSttPPNKGETGTDIPTGGTPRQGPDGPVKKDDkngkgnpPDDRGGKTGPKDQRPKTNPIERRG 244
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
309-365 4.34e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.26  E-value: 4.34e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  309 GKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKGESGSPGEN 365
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
148-378 9.79e-07

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 52.99  E-value: 9.79e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  148 PGPKGQKGEPGdikdivgPKGPPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPPGppgppgppglggnf 227
Cdd:NF038329   161 KGPAGPQGEAG-------PQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAG-------------- 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  228 aAQMAGGFDEKAGGAQLGVmqgpmgpmgprgppgpagapgpqgfQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGK 307
Cdd:NF038329   220 -PAGEDGPAGPAGDGQQGP-------------------------DGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGP 273
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034578030  308 PGKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDgakgeaGAPGVKGESGSPGENGSPGPMGPRGLPG 378
Cdd:NF038329   274 DGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKD------GLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
422-617 1.60e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.07  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  422 KGEAGPTGARGPEGAQGPR--GEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAgapgfPGPRGPPGPQGATGPLGP 499
Cdd:PRK07764   592 PGAAGGEGPPAPASSGPPEeaARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVA-----APEHHPKHVAVPDASDGG 666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  500 KGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPG 579
Cdd:PRK07764   667 DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPD 746
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1034578030  580 ErgPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDA 617
Cdd:PRK07764   747 D--PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEE 782
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
493-743 2.07e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 52.34  E-value: 2.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  493 ATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVgpigppgerGAPGNRGFPGQDGLA 572
Cdd:COG5164      8 KTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGT---------RPAGNQGATGPAQNQ 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  573 GPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGF 652
Cdd:COG5164     79 GGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTP 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  653 PGPKGANGEPGKAGEKGLPGAPGlRGLPGKDGETGAAGPPGPAGPAGERGEQGAPGPSGFQGLPGPPGPPGEGGKPGDQG 732
Cdd:COG5164    159 PGDGGSTTPPGPGGSTTPPDDGG-STTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQRPKT 237
                          250
                   ....*....|.
gi 1034578030  733 VPGEAGAPGLV 743
Cdd:COG5164    238 NPIERRGPERP 248
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
556-608 1.90e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 1.90e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034578030  556 ERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGAR 608
Cdd:pfam01391    5 PPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PRK12678 PRK12678
transcription termination factor Rho; Provisional
976-1181 6.40e-05

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 47.59  E-value: 6.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  976 PKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGlpGQRGERGFPGLPGPSGEPGKQ 1055
Cdd:PRK12678    77 ARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRER--GEAARRGAARKAGEGGEQPAT 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1056 GAPGASGDRGPPGpvgppgltgpagEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGD 1135
Cdd:PRK12678   155 EARADAAERTEEE------------ERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRD 222
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1034578030 1136 RGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHR 1181
Cdd:PRK12678   223 GGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRD 268
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
737-944 7.11e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.67  E-value: 7.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  737 AGAPGLVGPRGERGfPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRG 816
Cdd:PRK07764   589 GPAPGAAGGEGPPA-PASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGD 667
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  817 DVGEKGPEGAPGKDGGRGltgpigppgpagangekGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPgadgqPGAK 896
Cdd:PRK07764   668 GWPAKAGGAAPAAPPPAP-----------------APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP-----PQAA 725
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1034578030  897 GEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGAT 944
Cdd:PRK07764   726 QGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAA 773
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1134-1243 8.54e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 46.82  E-value: 8.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1134 GDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQGLPGPPGPSGDQGASGPAGPSGPRGP 1213
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110
                   ....*....|....*....|....*....|
gi 1034578030 1214 PGPVGPSGKDGANGIPGPIGPPGPRGRSGE 1243
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGP 226
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
735-780 3.76e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 3.76e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1034578030  735 GEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKG 780
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPG 46
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
1330-1367 6.03e-04

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 39.08  E-value: 6.03e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034578030 1330 TCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMET 1367
Cdd:NF040941     1 SCWEILQAGPSAPSGVYWIDPDGMGGLAPFQVYCDMTT 38
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1130-1176 5.32e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 5.32e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1034578030 1130 TGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERG 1176
Cdd:pfam01391    9 PGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
 
Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1299-1534 1.40e-162

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 489.54  E-value: 1.40e-162
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1299 HDAEVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPAN 1378
Cdd:pfam01410    1 RDEEVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETGETCIYPTKAS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1379 VPKKNWWSsksKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKA 1458
Cdd:pfam01410   81 IPRKNWWT---KESKHVWFGEFMNGGSQFSYGVDGVGPSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQATGNLKKA 157
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034578030 1459 LLIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF 1534
Cdd:pfam01410  158 LLLQGSNDEEIRAEGNSRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIVDIAPMDIGGADQEFGVEVGPVCF 233
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1300-1535 2.45e-151

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 459.63  E-value: 2.45e-151
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  1300 DAEVDATLKSLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETGETCVYPNPANV 1379
Cdd:smart00038    1 DEEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSGEYWVDPNQGCIRDAIKVFCNFETGETCVSPSPSSI 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  1380 PKKNWWSSKSKekkHIWFGETINGGFHFSYGDDNLAPnTANVQMTFLRLLSTEGSQNITYHCKNSIAYLDEAAGNLKKAL 1459
Cdd:smart00038   81 PRKTWYSGKSK---HVWFGETMNGGFKFSYGDSEGPP-VGVVQLTFLRLLSTEAHQNITYHCKNSVAYMDEATGNLKKAL 156
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034578030  1460 LIQGSNDVEIRAEGNSRFTYTALKDGCTKHTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCFL 1535
Cdd:smart00038  157 RLRGSNDVELSAEGNSKFTYEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIVDIAPSDIGGPDQEFGVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
849-1059 6.94e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.66  E-value: 6.94e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  849 GEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAG---QKGDAGAPGPQGPSGAPGPQGP 925
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGpqgPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  926 TGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNP--------GPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGP 997
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGqqgpdgdpGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034578030  998 AGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPG 1059
Cdd:NF038329   277 DGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
870-1104 1.31e-33

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 135.80  E-value: 1.31e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  870 GERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGA 949
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  950 AGRVGPPGSNGNPGPPGPPGPSGKDGPKGA-----RGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGL 1024
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPagpagDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1025 AGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASgdrgppgpvgppgltgpaGEPGREGSPGADGPPGRDGAAGVKG 1104
Cdd:NF038329   277 DGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLP------------------GKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
422-685 1.07e-29

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 124.25  E-value: 1.07e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  422 KGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAgapgfpgprgppgpqgatGPLGPKG 501
Cdd:NF038329   119 KGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQ------------------GPAGKDG 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  502 QTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGpigppgergaPGNRGFPGQDGLAGPKGAPGER 581
Cdd:NF038329   181 EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAG----------PAGDGQQGPDGDPGPTGEDGPQ 250
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  582 GPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPsgapgedgrpgppgpqgaRGQPGVMGFPGPKGANGE 661
Cdd:NF038329   251 GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGK------------------DGLPGKDGKDGQNGKDGL 312
                          250       260
                   ....*....|....*....|....
gi 1034578030  662 PGKAGEKGLPGAPGLRGLPGKDGE 685
Cdd:NF038329   313 PGKDGKDGQPGKDGLPGKDGKDGQ 336
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
751-1021 1.43e-29

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 123.86  E-value: 1.43e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  751 FPGERGSPGAQGLQGPRGLPGTPGTDGPKGAsgpagppgaqgppglQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPGKD 830
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGP---------------AGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKD 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  831 GgrgltgpigppgpagangEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEaGQKGDAGA 910
Cdd:NF038329   180 G------------------EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGD 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  911 PGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAG 990
Cdd:NF038329   241 PGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDG 320
                          250       260       270
                   ....*....|....*....|....*....|.
gi 1034578030  991 EPGLQGPAGPPGEKGEPGDDGPSGAEGPPGP 1021
Cdd:NF038329   321 QPGKDGLPGKDGKDGQPGKPAPKTPEVPQKP 351
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
663-925 5.78e-26

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 113.08  E-value: 5.78e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  663 GKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPGPsgfqglpgppgppgeggkpgdqgvPGEAGAPGL 742
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGP------------------------AGPQGEAGP 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  743 VGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGeRGAAGIAGPKGDRGDVGEKG 822
Cdd:NF038329   173 QGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQG 251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  823 PEGAPGKDGGRGltgpigppgpagANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEA 902
Cdd:NF038329   252 PDGPAGKDGPRG------------DRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKD 319
                          250       260
                   ....*....|....*....|...
gi 1034578030  903 GQKGDAGAPGPQGPSGAPGPQGP 925
Cdd:NF038329   320 GQPGKDGLPGKDGKDGQPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
987-1182 2.18e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 111.15  E-value: 2.18e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  987 GRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGP 1066
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1067 PGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGvKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMG 1146
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDG 275
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1034578030 1147 PSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRG 1182
Cdd:NF038329   276 KDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDG 311
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
488-709 3.84e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 110.38  E-value: 3.84e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  488 PGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPG 567
Cdd:NF038329   128 AGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQG 207
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  568 QDGLAGPKGAPGERGPSGLAGPKGangdPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 647
Cdd:NF038329   208 PAGPAGPDGEAGPAGEDGPAGPAG----DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPV 283
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034578030  648 GVMGFPGPKGANGEPGKAGEKGLPGAPglrGLPGKDGETGAAGPPGPAGPAGERGEQGAPGP 709
Cdd:NF038329   284 GPAGKDGQNGKDGLPGKDGKDGQNGKD---GLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
904-1163 4.37e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 110.38  E-value: 4.37e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  904 QKGDAGAPGPQGPSGAPGPQGPtgvTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDS 983
Cdd:NF038329   112 QLKGDGEKGEPGPAGPAGPAGE---QGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPA 188
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  984 GPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQrGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGD 1063
Cdd:NF038329   189 GEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGE 267
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1064 RGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGdrgetgavgapgapgppgspgpagPTGKQGDRGEAGAQG 1143
Cdd:NF038329   268 AGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNG------------------------KDGLPGKDGKDGQPG 323
                          250       260
                   ....*....|....*....|
gi 1034578030 1144 PMGPSGPAGARGIQGPQGPR 1163
Cdd:NF038329   324 KDGLPGKDGKDGQPGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
511-780 3.99e-24

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 107.30  E-value: 3.99e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  511 FKGEQGPKGEPGPAGPQGAPGPAGEEGKRGargepggvgpigppgERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPK 590
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETG---------------PAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKD 179
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  591 GANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGaPGEDGRPGPPGPQGARGQPGVMGFPGPKGANGEPGKAGEKGL 670
Cdd:NF038329   180 GEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDG-EAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGK 258
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  671 PGAPGLRGLPGKDGetgaagppgpagPAGERGEQGAPGPSGFqglpgppgppgeggkpgdQGVPGEAGAPGLVGPRGERG 750
Cdd:NF038329   259 DGPRGDRGEAGPDG------------PDGKDGERGPVGPAGK------------------DGQNGKDGLPGKDGKDGQNG 308
                          250       260       270
                   ....*....|....*....|....*....|
gi 1034578030  751 FPGERGSPGAQGLQGPRGLPGTPGTDGPKG 780
Cdd:NF038329   309 KDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
700-943 7.02e-24

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 106.53  E-value: 7.02e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  700 ERGEQGAPGPSGFQGLPGPPGPPGEGGKPGDQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPK 779
Cdd:NF038329   118 EKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPR 197
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  780 GASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGD--RGDVGEKGPEGAPGKDGGRGLTGPigppgpagaNGEKGEVGPP 857
Cdd:NF038329   198 GETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGK---------DGPRGDRGEA 268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  858 GPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGA 937
Cdd:NF038329   269 GPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTPEVP 348

                   ....*.
gi 1034578030  938 QGPPGA 943
Cdd:NF038329   349 QKPDTA 354
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
569-827 1.47e-23

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 105.76  E-value: 1.47e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  569 DGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPG 648
Cdd:NF038329   116 DGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQG 195
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  649 VMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPGKDGEtgaagppgpagpaGERGEQGAPGPsgfqglpgppgppgeggkp 728
Cdd:NF038329   196 PRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------------GQQGPDGDPGP------------------- 243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  729 gdQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGI 808
Cdd:NF038329   244 --TGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQ 321
                          250
                   ....*....|....*....
gi 1034578030  809 AGPKGDRGDVGEKGPEGAP 827
Cdd:NF038329   322 PGKDGLPGKDGKDGQPGKP 340
VWC pfam00093
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ...
81-136 1.83e-23

von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.


Pssm-ID: 278520  Cd Length: 57  Bit Score: 94.80  E-value: 1.83e-23
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1034578030   81 CVQDGQRYNDKDVWKPEPCRICVCDTGTVLCDDIICEDVkDCLSP--EIPFGECCPIC 136
Cdd:pfam00093    1 CVQNGVVYENGETWKPDLCTICTCDDGKVLCDKIICPPL-DCPNPrlEIPPGECCPVC 57
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
304-605 2.34e-23

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 104.99  E-value: 2.34e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  304 EAGKPGKAGERGPPGPQGARgfpgtpglpgvkghrgypGLDGAKGEAGAPGVKGESGSPGENGSPGPMGPRGLPGERgrt 383
Cdd:NF038329   118 EKGEPGPAGPAGPAGEQGPR------------------GDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPA--- 176
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  384 gpagaagargndgqpgpagppgpvgpaggpgfpgapgakGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGI 463
Cdd:NF038329   177 ---------------------------------------GKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGE 217
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  464 PGAKGSAGAPGIAGApgfpgprgppgpqgatgplGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARG 543
Cdd:NF038329   218 AGPAGEDGPAGPAGD-------------------GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDG 278
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034578030  544 EPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLP 605
Cdd:NF038329   279 ERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
VWC smart00214
von Willebrand factor (vWF) type C domain;
81-136 5.07e-23

von Willebrand factor (vWF) type C domain;


Pssm-ID: 214564  Cd Length: 59  Bit Score: 93.35  E-value: 5.07e-23
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 1034578030    81 CVQDGQRYNDKDVWKPEPCRICVCDTG-TVLCDDIICEDVKDCLSPE--IPFGECCPIC 136
Cdd:smart00214    1 CVHNGRVYNDGETWKPDPCQICTCLDGtTVLCDPVECPPPPDCPNPErvKPPGECCPRC 59
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
261-474 1.56e-16

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 84.19  E-value: 1.56e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  261 GPAGAPGPQGFQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGKPGKAGERGPPGPQGARGFPGTPGLPGVKGHRGY 340
Cdd:NF038329   126 GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGE 205
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  341 PGLDGAKGEAGAPGVKGESGSPGENGSpGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFPGAPG 420
Cdd:NF038329   206 QGPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVG 284
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034578030  421 AKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPG 474
Cdd:NF038329   285 PAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
855-1060 1.89e-09

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 61.97  E-value: 1.89e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  855 GPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGA 934
Cdd:COG5164      7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  935 RGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSG 1014
Cdd:COG5164     87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTT 166
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1034578030 1015 AEGPPGPQGLAGQRGiVGLPGQRGERGFPGLPGPSGEPGKQGAPGA 1060
Cdd:COG5164    167 PPGPGGSTTPPDDGG-STTPPNKGETGTDIPTGGTPRQGPDGPVKK 211
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
800-1060 1.03e-08

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 59.66  E-value: 1.03e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  800 PGERGAAGIAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPG 879
Cdd:COG5164      6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQ 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  880 PAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGpTGVTGPKGARGAQGPPGATGFPGAAgrVGPPGSN 959
Cdd:COG5164     86 NQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPS-GGSTTPPGDGGSTPPGPGSTGPGGS--TTPPGDG 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  960 GNPGPPGPPGPSGKDGPKGArGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAG----QRGIVGLPG 1035
Cdd:COG5164    163 GSTTPPGPGGSTTPPDDGGS-TTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTgpkdQRPKTNPIE 241
                          250       260
                   ....*....|....*....|....*
gi 1034578030 1036 QRGERGFPGLPGPSGEPGKQGAPGA 1060
Cdd:COG5164    242 RRGPERPEAAALPAELTALEAENRA 266
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
852-908 1.35e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 52.50  E-value: 1.35e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  852 GEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDA 908
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
816-1024 1.08e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 56.92  E-value: 1.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  816 GDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGA 895
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  896 KGEQGEAGQKGDAGAP---GPQGPSGAPGPQG-PTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPS 971
Cdd:PRK07764   670 PAKAGGAAPAAPPPAPapaAPAAPAGAAPAQPaPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPP 749
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034578030  972 GKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDgpsgAEGPPGPQGL 1024
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAED----DAPSMDDEDR 798
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
900-1107 1.08e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 56.92  E-value: 1.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  900 GEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGA 979
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  980 RGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPG 1059
Cdd:PRK07764   670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPP 749
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1034578030 1060 ASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRG 1107
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDED 797
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
734-945 1.39e-07

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 56.19  E-value: 1.39e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  734 PGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKG 813
Cdd:COG5164      6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQ 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  814 DRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGP------------------PGPAGSAGARGAPGERGET 875
Cdd:COG5164     86 NQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPpsggsttppgdggstppgPGSTGPGGSTTPPGDGGST 165
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034578030  876 GPPGPAGFAGPPGADGQ--PGAKGEQGEAGQKGDAGAPGPQGPSGAPG-------PQGPTGVTGPKGARGAQGPPGATG 945
Cdd:COG5164    166 TPPGPGGSTTPPDDGGSttPPNKGETGTDIPTGGTPRQGPDGPVKKDDkngkgnpPDDRGGKTGPKDQRPKTNPIERRG 244
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
858-913 2.93e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.64  E-value: 2.93e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034578030  858 GPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGP 913
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
876-931 3.90e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.26  E-value: 3.90e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1034578030  876 GPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGP 931
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
309-365 4.34e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.26  E-value: 4.34e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  309 GKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKGESGSPGEN 365
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
855-911 5.18e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 47.87  E-value: 5.18e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  855 GPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAP 911
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
885-941 7.74e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 47.49  E-value: 7.74e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  885 GPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPP 941
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
849-1028 8.35e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.84  E-value: 8.35e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  849 GEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPG--------AKGEQGEAGQKGDAGAPGP-QGPSGA 919
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAApaeasaapAPGVAAPEHHPKHVAVPDAsDGGDGW 669
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  920 PGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPsgkdgPKGARGDSGPPGRAGEPGLQGPAG 999
Cdd:PRK07764   670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQ-----PPQAAQGASAPSPAADDPVPLPPE 744
                          170       180
                   ....*....|....*....|....*....
gi 1034578030 1000 PPGEKGEPGDDGPSGAEGPPGPQGLAGQR 1028
Cdd:PRK07764   745 PDDPPDPAGAPAQPPPPPAPAPAAAPAAA 773
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
148-378 9.79e-07

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 52.99  E-value: 9.79e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  148 PGPKGQKGEPGdikdivgPKGPPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPPGppgppgppglggnf 227
Cdd:NF038329   161 KGPAGPQGEAG-------PQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAG-------------- 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  228 aAQMAGGFDEKAGGAQLGVmqgpmgpmgprgppgpagapgpqgfQGNPGEPGEPGVSGPMGPRGPPGPPGKPGDDGEAGK 307
Cdd:NF038329   220 -PAGEDGPAGPAGDGQQGP-------------------------DGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGP 273
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034578030  308 PGKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDgakgeaGAPGVKGESGSPGENGSPGPMGPRGLPG 378
Cdd:NF038329   274 DGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKD------GLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
984-1040 1.25e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 46.72  E-value: 1.25e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  984 GPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGER 1040
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
422-617 1.60e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 53.07  E-value: 1.60e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  422 KGEAGPTGARGPEGAQGPR--GEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAgapgfPGPRGPPGPQGATGPLGP 499
Cdd:PRK07764   592 PGAAGGEGPPAPASSGPPEeaARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVA-----APEHHPKHVAVPDASDGG 666
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  500 KGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPG 579
Cdd:PRK07764   667 DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPD 746
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1034578030  580 ErgPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDA 617
Cdd:PRK07764   747 D--PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEE 782
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
493-743 2.07e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 52.34  E-value: 2.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  493 ATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVgpigppgerGAPGNRGFPGQDGLA 572
Cdd:COG5164      8 KTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGT---------RPAGNQGATGPAQNQ 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  573 GPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGF 652
Cdd:COG5164     79 GGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTP 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  653 PGPKGANGEPGKAGEKGLPGAPGlRGLPGKDGETGAAGPPGPAGPAGERGEQGAPGPSGFQGLPGPPGPPGEGGKPGDQG 732
Cdd:COG5164    159 PGDGGSTTPPGPGGSTTPPDDGG-STTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQRPKT 237
                          250
                   ....*....|.
gi 1034578030  733 VPGEAGAPGLV 743
Cdd:COG5164    238 NPIERRGPERP 248
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
864-920 2.25e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 45.95  E-value: 2.25e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  864 GARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAP 920
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
975-1029 6.01e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.79  E-value: 6.01e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1034578030  975 GPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRG 1029
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
894-950 9.08e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.41  E-value: 9.08e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  894 GAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAA 950
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
662-934 1.10e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 50.03  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  662 PGKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERG---EQGAPGPSGFQGLPGPPGPPGEGGKPGDQGVPGEAG 738
Cdd:COG5164      6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRpaqNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQ 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  739 APGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDV 818
Cdd:COG5164     86 NQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGST 165
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  819 GEKGPEGAPGKDGGRGLTGPigppgpaganGEKGEVGPPGPAGSAgarGAPGERGETGPPGPAGFAGPPgaDGQPGAKGE 898
Cdd:COG5164    166 TPPGPGGSTTPPDDGGSTTP----------PNKGETGTDIPTGGT---PRQGPDGPVKKDDKNGKGNPP--DDRGGKTGP 230
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1034578030  899 QGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGA 934
Cdd:COG5164    231 KDQRPKTNPIERRGPERPEAAALPAELTALEAENRA 266
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
848-898 1.13e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.02  E-value: 1.13e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1034578030  848 NGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 898
Cdd:pfam01391    6 PGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
996-1052 1.39e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.02  E-value: 1.39e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  996 GPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEP 1052
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
312-368 1.84e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 1.84e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  312 GERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGAPGVKGESGSPGENGSP 368
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
556-608 1.90e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 1.90e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1034578030  556 ERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGAR 608
Cdd:pfam01391    5 PPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
422-473 4.00e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.48  E-value: 4.00e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1034578030  422 KGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAP 473
Cdd:pfam01391    6 PGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
423-476 4.20e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.48  E-value: 4.20e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1034578030  423 GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIA 476
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAP 54
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
519-709 4.81e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.06  E-value: 4.81e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  519 GEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGR 598
Cdd:PRK07764   590 PAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  599 PGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGFPGPKGANGEPGKAGEKGLPGAPGLRG 678
Cdd:PRK07764   670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPP 749
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1034578030  679 LPGKDGETGAAGPPGPAGPAGERGEQGAPGP 709
Cdd:PRK07764   750 DPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
PRK12678 PRK12678
transcription termination factor Rho; Provisional
976-1181 6.40e-05

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 47.59  E-value: 6.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  976 PKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGlpGQRGERGFPGLPGPSGEPGKQ 1055
Cdd:PRK12678    77 ARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRER--GEAARRGAARKAGEGGEQPAT 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1056 GAPGASGDRGPPGpvgppgltgpagEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGD 1135
Cdd:PRK12678   155 EARADAAERTEEE------------ERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDRREERGRRD 222
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1034578030 1136 RGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHR 1181
Cdd:PRK12678   223 GGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRD 268
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
737-944 7.11e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 47.67  E-value: 7.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  737 AGAPGLVGPRGERGfPGERGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRG 816
Cdd:PRK07764   589 GPAPGAAGGEGPPA-PASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGD 667
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  817 DVGEKGPEGAPGKDGGRGltgpigppgpagangekGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPgadgqPGAK 896
Cdd:PRK07764   668 GWPAKAGGAAPAAPPPAP-----------------APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQP-----PQAA 725
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 1034578030  897 GEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGAT 944
Cdd:PRK07764   726 QGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAA 773
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
903-952 7.88e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.71  E-value: 7.88e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 1034578030  903 GQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGR 952
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGA 50
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
495-541 8.20e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.71  E-value: 8.20e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1034578030  495 GPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGA 541
Cdd:pfam01391    7 GPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGA 53
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1134-1243 8.54e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 46.82  E-value: 8.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1134 GDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQGLPGPPGPSGDQGASGPAGPSGPRGP 1213
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110
                   ....*....|....*....|....*....|
gi 1034578030 1214 PGPVGPSGKDGANGIPGPIGPPGPRGRSGE 1243
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGP 226
PRK12678 PRK12678
transcription termination factor Rho; Provisional
855-1041 1.73e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 46.05  E-value: 1.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  855 GPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPtgvtgpkgA 934
Cdd:PRK12678    61 GGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERR--------E 132
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  935 RGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSG 1014
Cdd:PRK12678   133 RGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQG 212
                          170       180
                   ....*....|....*....|....*..
gi 1034578030 1015 AEGPPGPQGLAGQRGivGLPGQRGERG 1041
Cdd:PRK12678   213 DRREERGRRDGGDRR--GRRRRRDRRD 237
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
855-1176 2.03e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.13  E-value: 2.03e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  855 GPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPG--PQGPSGAPGPQGPTGVTGPK 932
Cdd:PRK07764   434 APAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAaaPAAPAAPAAPAGADDAATLR 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  933 GA--------------------RGAQ--GPPGAT---GF--PGAAGRVGPPGSNG------------------NPGPPGP 967
Cdd:PRK07764   514 ERwpeilaavpkrsrktwaillPEATvlGVRGDTlvlGFstGGLARRFASPGNAEvlvtalaeelggdwqveaVVGPAPG 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  968 PGPSGKDGPKGARGDSGPPGRAGEPGlqGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPG 1047
Cdd:PRK07764   594 AAGGEGPPAPASSGPPEEAARPAAPA--APAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA 671
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1048 PSGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPA 1127
Cdd:PRK07764   672 KAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDP 751
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1034578030 1128 GPTGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERG 1176
Cdd:PRK07764   752 AGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRD 800
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
519-589 3.31e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 3.31e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034578030  519 GEPGPAGPQGAPGPAGEEGKRGArgepggvgpigppgeRGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGP 589
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGP---------------PGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
561-617 3.34e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 3.34e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1034578030  561 GNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDA 617
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
735-780 3.76e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 3.76e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1034578030  735 GEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGTDGPKG 780
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPG 46
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
859-1175 4.67e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.98  E-value: 4.67e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  859 PAGSAGARG--APGERGETGPPgPAGFAGPPGADGQPGAkgeqgeagqkgdAGAPGPQGPSGAPGPQGPTGVTGPKGARG 936
Cdd:PRK07764   365 PSASDDERGllARLERLERRLG-VAGGAGAPAAAAPSAA------------AAAPAAAPAPAAAAPAAAAAPAPAAAPQP 431
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  937 AQGPPgatgfPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPgddgpSGAE 1016
Cdd:PRK07764   432 APAPA-----PAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAP-----AAPA 501
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1017 GPPGPQGLAGQRG----IVGLPGQRGER---------------------GF--PGLPGPSGEPG---------------- 1053
Cdd:PRK07764   502 APAGADDAATLRErwpeILAAVPKRSRKtwaillpeatvlgvrgdtlvlGFstGGLARRFASPGnaevlvtalaeelggd 581
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1054 -----KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGD-RGETGAVGAPGAPGPPGSPGPA 1127
Cdd:PRK07764   582 wqveaVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEaSAAPAPGVAAPEHHPKHVAVPD 661
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1034578030 1128 GPTGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGER 1175
Cdd:PRK07764   662 ASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATP 709
GGGWT_bact NF040941
fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, ...
1330-1367 6.03e-04

fibrinogen-like bacterial YCDxxxxGGGW domain; Pfam model PF00147, about 220 amino acids long, describes a conserved domain found in eukaryotic proteins such as fibrinogen beta and gamma chains, fincolin, and angiopoietin. This model describes a small homology domain, about 46 amino acids long, found in the PF00147 homology region of those proteins but also as a much shorter homology domain in bacterial proteins that may lack homology to those proteins, or to each other, outside this region. The signature motif, at the C-terminus of this domain, is YCDxTTDGGGWxLV.


Pssm-ID: 468872 [Multi-domain]  Cd Length: 46  Bit Score: 39.08  E-value: 6.03e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1034578030 1330 TCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMET 1367
Cdd:NF040941     1 SCWEILQAGPSAPSGVYWIDPDGMGGLAPFQVYCDMTT 38
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
849-893 6.22e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.40  E-value: 6.22e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1034578030  849 GEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQP 893
Cdd:pfam01391   13 GPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
862-950 6.95e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 44.29  E-value: 6.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  862 SAGARGAPGerGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPP 941
Cdd:PRK14959   374 SGGGASAPS--GSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPR 451

                   ....*....
gi 1034578030  942 GATGFPGAA 950
Cdd:PRK14959   452 PAPRMPEAS 460
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
495-540 9.40e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.63  E-value: 9.40e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1034578030  495 GPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRG 540
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPG 46
PHA03169 PHA03169
hypothetical protein; Provisional
797-945 1.06e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 43.42  E-value: 1.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  797 QGMPGERGAAGIAGPKGDRGDVGEKGPEGApGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETG 876
Cdd:PHA03169    81 HGEKEERGQGGPSGSGSESVGSPTPSPSGS-AEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSP 159
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034578030  877 PPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATG 945
Cdd:PHA03169   160 NQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQ 228
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
525-599 1.25e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.24  E-value: 1.25e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034578030  525 GPQGAPGPAGEEGKrgargepggvgpigppgeRGAPGNRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRP 599
Cdd:pfam01391    1 GPPGPPGPPGPPGP------------------PGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
305-353 2.07e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.86  E-value: 2.07e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 1034578030  305 AGKPGKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGAP 353
Cdd:pfam01391    9 PGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
705-1101 2.44e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 2.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  705 GAPGPSGFQGLPGPPGPPGEGGKPGDQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTPGtDGPKGASGP 784
Cdd:PRK07764   400 SAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPA-PAAAPEPTA 478
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  785 AGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPE--GAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPaGS 862
Cdd:PRK07764   479 APAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPEilAAVPKRSRKTWAILLPEATVLGVRGDTLVLGFSTG-GL 557
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  863 AGARGAPGE------------------RGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQG 924
Cdd:PRK07764   558 ARRFASPGNaevlvtalaeelggdwqvEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPA 637
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  925 PTGVTGPKGARGAQGPPGATGFPGAAGRVgppgsngnpgppgpPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEK 1004
Cdd:PRK07764   638 EASAAPAPGVAAPEHHPKHVAVPDASDGG--------------DGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP 703
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1005 GEPGDDGPSGAEGPPGPQGLAGQRGivGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPG- 1083
Cdd:PRK07764   704 APAATPPAGQADDPAAQPPQAAQGA--SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSe 781
                          410
                   ....*....|....*...
gi 1034578030 1084 REGSPGADGPPGRDGAAG 1101
Cdd:PRK07764   782 EEEMAEDDAPSMDDEDRR 799
PHA03169 PHA03169
hypothetical protein; Provisional
803-1023 3.35e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 41.88  E-value: 3.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  803 RGAAGIAGPKGDrgDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSA-----GARGAPGERGETGP 877
Cdd:PHA03169    38 GTAARAAKPAPP--APTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGsesvgSPTPSPSGSAEELA 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  878 PGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPgaagrvgppg 957
Cdd:PHA03169   116 SGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEP---------- 185
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034578030  958 sngnpgppgppGPSGKDGPKGARGDSGPPgragepglqgPAGPPGEKGEPGDDGPSGAEGPPGPQG 1023
Cdd:PHA03169   186 -----------EPDSPGPPQSETPTSSPP----------PQSPPDEPGEPQSPTPQQAPSPNTQQA 230
PPE COG5651
PPE-repeat protein [Function unknown];
858-1034 3.53e-03

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 41.42  E-value: 3.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  858 GPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGPKGAR-- 935
Cdd:COG5651    206 NQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAAtn 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  936 -GAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSG 1014
Cdd:COG5651    286 lGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAG 365
                          170       180
                   ....*....|....*....|
gi 1034578030 1015 AEGPPGPQGLAGQRGIVGLP 1034
Cdd:COG5651    366 GGGGSAGAAAGAASGGGAAA 385
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
852-1163 3.73e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 3.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  852 GEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGEQGEAGQKGDAGAPGPQGPSGAPGPQGPTGVTGP 931
Cdd:PRK07764   395 AAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAP 474
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  932 KGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPPGPPGPSGKD------------------------------------- 974
Cdd:PRK07764   475 EPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADDAATLRERWPeilaavpkrsrktwaillpeatvlgvrgdtlvlgfst 554
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  975 --------GPKGAR--------------------GDSGPPGRAGEPGLQGPAGPPGEKGEPGDDGPSGAEGPPGPQGLAG 1026
Cdd:PRK07764   555 gglarrfaSPGNAEvlvtalaeelggdwqveavvGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAA 634
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1027 QRGIVGLPGQRGERGFPGLPGPSGEP----GKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGV 1102
Cdd:PRK07764   635 APAEASAAPAPGVAAPEHHPKHVAVPdasdGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQA 714
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034578030 1103 KGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPR 1163
Cdd:PRK07764   715 DDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPP 775
PHA03169 PHA03169
hypothetical protein; Provisional
746-926 3.84e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 41.49  E-value: 3.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  746 RGERGFPGErGSPGAQGLQGPRGLPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEG 825
Cdd:PHA03169    81 HGEKEERGQ-GGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSP 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  826 APGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERG--ETGPPGPAGFAGPPGADGQPGAKGEQGEAG 903
Cdd:PHA03169   160 NQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPpdEPGEPQSPTPQQAPSPNTQQAVEHEDEPTE 239
                          170       180
                   ....*....|....*....|....
gi 1034578030  904 -QKGDAGAPGPQGPSGAPGPQGPT 926
Cdd:PHA03169   240 pEREGPPFPGHRSHSYTVVGWKPS 263
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
427-680 4.69e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 41.53  E-value: 4.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  427 PTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGIAGAPGFPGPRGPPGPQGATG----------- 495
Cdd:pfam09606  164 GQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAGAQMGQQAQANGGMNPQqmggapnqvam 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  496 ---PLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPAGEEGKRGARGEPGGVGPIGPPGERgaPGNRGFPGQDGLA 572
Cdd:pfam09606  244 qqqQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIGDQNNYQQ--QQTRQQQQQQGGN 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030  573 GPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPGVMGF 652
Cdd:pfam09606  322 HPAAHQQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANPMQRGQPGMMSSPSPVPGQQVRQVTPNQFMRQSPQPSV 401
                          250       260
                   ....*....|....*....|....*...
gi 1034578030  653 PGPKGANGEPGKAGEKGLPGAPGLRGLP 680
Cdd:pfam09606  402 PSPQGPGSQPPQSHPGGMIPSPALIPSP 429
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1130-1176 5.32e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 5.32e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1034578030 1130 TGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERG 1176
Cdd:pfam01391    9 PGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
730-773 6.10e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 6.10e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1034578030  730 DQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRGLPGTP 773
Cdd:pfam01391   14 PPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
597-665 6.35e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 6.35e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034578030  597 GRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGedgrpgppgpqgARGQPGVMGFPGPKGANGEPGKA 665
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPG------------PPGPPGPPGPPGPPGAPGAPGPP 57
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1013-1182 6.43e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 41.04  E-value: 6.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1013 SGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGspgadG 1092
Cdd:PRK12678    60 GGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRER-----G 134
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034578030 1093 PPGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAGPTGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEP 1172
Cdd:PRK12678   135 EAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDDRDRRDRREQGDR 214
                          170
                   ....*....|
gi 1034578030 1173 GERGLKGHRG 1182
Cdd:PRK12678   215 REERGRRDGG 224
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
810-881 9.70e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 35.93  E-value: 9.70e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034578030  810 GPKGDRGDVGEKGPEGAPGKDGGRGltgpigppgpagangEKGEVGPPGPAGSAGARGAPGERGETGPPGPA 881
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPG---------------PPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH