NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2009632981|ref|XP_040092241|]
View 

nidogen-1 isoform X1 [Oryx dammah]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
nidG2 cd00255
Nidogen, G2 domain; Nidogen is an important component of the basement membrane, an ...
427-658 8.40e-116

Nidogen, G2 domain; Nidogen is an important component of the basement membrane, an extracellular sheet-like matrix. Nidogen is a multifunctional protein that interacts with many other basement membrane proteins, like collagen, perlecan, lamin, and has a potential role in the assembly and connection of networks. Nidogen consists of 3 globular domains (G1-G3), G3 is the lamin-binding domain, while G2 binds collagen IV and perlecan. Also found in hemicentin, a protein which functions at various cell-cell and cell-matrix junctions and might assist in refining broad regions of cell contact into oriented, line-shaped junctions. Nidogen G2 consists of an N-terminal EGF-like domain (excluded from this alignment model) and an 11-stranded beta-barrel with a central helix, a topology that exhibits high structural similarity to the green flourescent proteins of Cnidaria.


:

Pssm-ID: 238158  Cd Length: 224  Bit Score: 359.70  E-value: 8.40e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  427 GSPQRVNGKVKGRIFVGdsQVPIVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPIGGIIGWMFAVEQDGFKNG 506
Cdd:cd00255      1 GIPQRVNGKVSGNINVG--QSPVEFGDADLHSYVVTSDGRAYTAISNIPESLGPSLRPLAPIGGTIGWLFALEQGGAKNG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  507 FSITGGEFTRQAEVTFVGHPDKLIIKQQFSGIDEHGHLTIDTELEGRVPQIAFGSSVHIEPYTELYHYSRQ-VITSSSTR 585
Cdd:cd00255     79 FSLTGGEFTRQAEVTFYTGGEKLRITQVARGLDSHGHLLLDTVISGRVPQVPAGATVHIEDYTELYHYTGPgVLTSSSTR 158
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2009632981  586 EYTVTEPeqhgtAPSHAHTYQWRQTITFQECVHDDsqPVLPSTQQLSVDSVFVLYNQEERILRYALSNSIGPV 658
Cdd:cd00255    159 EYTVDEG-----GESQTLSYQWNQTITYEECPHDD--EAAPDLQQLLVARIFALYNPEEEILRFAITNSIGPG 224
NIDO smart00539
Extracellular domain of unknown function in nidogen (entactin) and hypothetical proteins;
106-269 1.93e-44

Extracellular domain of unknown function in nidogen (entactin) and hypothetical proteins;


:

Pssm-ID: 214712  Cd Length: 152  Bit Score: 157.97  E-value: 1.93e-44
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   106 PFLADLDTtDGLGKVYYREDLSPSVTQLAAECVQRGFPE-VSFKPSSAVVVTWESVAPYQGPSKDPTlegkrNTFQCILA 184
Cdd:smart00539    1 PFWADADT-EGTGKVYYRETTDHAILDRATESVREGFTDmGGFRAKSVVIVTWENVAAYGSQSSDGT-----NTFQAVLA 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   185 SSDSSTYAIFLYPEDGLQFYTTFSKkeENQVPAVVAFSQGLvgliwkSDGAYNIFANDRESIGNLAKSSNSGLQGIWVFE 264
Cdd:smart00539   75 TDGSRTYAIFLYPSLGWTSDTTAGG--DDGVRARAGFNGGD------GTFSYTLPASGEENIKNLAEGSNVGIPGRWMFR 146

                    ....*
gi 2009632981   265 IGSPA 269
Cdd:smart00539  147 VDGAE 151
Thyroglobulin_1 pfam00086
Thyroglobulin type-1 repeat; Thyroglobulin type 1 repeats are thought to be involved in the ...
847-914 1.19e-27

Thyroglobulin type-1 repeat; Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 and 5 with 6.


:

Pssm-ID: 459665  Cd Length: 66  Bit Score: 106.62  E-value: 1.19e-27
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2009632981  847 CQLERERIVGTADSPRPqPPGLFVPECDEQGHYMPTQCHSSTGYCWCVDRDGQEVEGTRTGSGMrPPC 914
Cdd:pfam00086    1 CERERARALEQAASGRP-ASGLYIPNCDEDGFYKPVQCHGSTGYCWCVDPEGQEIPGTRTRGGD-PDC 66
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
1009-1050 2.10e-13

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


:

Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 65.32  E-value: 2.10e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 2009632981  1009 TIIRQDLGSPEGIALDHLGRNIFWTDSQLDRIEVAKLDGTQR 1050
Cdd:smart00135    2 TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
809-837 2.97e-11

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 59.15  E-value: 2.97e-11
                           10        20
                   ....*....|....*....|....*....
gi 2009632981  809 CHPDAFCYNTPGSFVCRCKYGYQGDGFHC 837
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
1051-1095 3.34e-11

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


:

Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 59.15  E-value: 3.34e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 2009632981  1051 RVLFETDLVNPRGIVTDSMRGNLYWTDWNRDnpKIETSYMDGTNR 1095
Cdd:smart00135    1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLD--VIEVANLDGTNR 43
Ldl_recept_b pfam00058
Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif ...
987-1025 2.19e-09

Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.


:

Pssm-ID: 459654  Cd Length: 42  Bit Score: 54.09  E-value: 2.19e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 2009632981  987 IYWTDISQ-PSIGRASLHGGEPTTIIRQDLGSPEGIALDH 1025
Cdd:pfam00058    3 LYWTDSSLrASISSADLNGSDRKTLFTDDLQHPNAIAVDP 42
FXa_inhibition pfam14670
Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is ...
1207-1231 2.69e-09

Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is found to be the target for a potent inhibitor of coagulation, TAK-442.


:

Pssm-ID: 464251 [Multi-domain]  Cd Length: 36  Bit Score: 53.40  E-value: 2.69e-09
                           10        20
                   ....*....|....*....|....*
gi 2009632981 1207 CSVNNGGCTHLCLATPGSRTCRCPD 1231
Cdd:pfam14670    1 CSVNNGGCSHLCLNTPGGYTCSCPE 25
EGF_CA smart00179
Calcium-binding EGF-like domain;
708-741 1.51e-08

Calcium-binding EGF-like domain;


:

Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 51.48  E-value: 1.51e-08
                            10        20        30
                    ....*....|....*....|....*....|....
gi 2009632981   708 DIDECSEqPSVCGNHAVCNNHPGTFRCECMEGYR 741
Cdd:smart00179    1 DIDECAS-GNPCQNGGTCVNTVGSYRCECPPGYT 33
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
388-423 4.74e-08

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.90  E-value: 4.74e-08
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2009632981  388 CASNRHRCSVHAQCRDFATGFCCRCAAGYTGNGRQC 423
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
670-706 2.38e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 47.98  E-value: 2.38e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2009632981  670 CYIGSHGCDANAACRPGPGaQFTCECSIGFRGDGRAC 706
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
760-798 3.70e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 47.59  E-value: 3.70e-07
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2009632981  760 CETGLHDCDipQRAQCKFMGhGSYTCICLPGFSGDGRAC 798
Cdd:pfam12947    1 CSDNNGGCH--PNATCTNTG-GSFTCTCNDGYTGDGVTC 36
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
965-1006 1.23e-06

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


:

Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 46.06  E-value: 1.23e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 2009632981   965 KTLLHAPDKVIIGLAFDCVDKMIYWTDISQPSIGRASLHGGE 1006
Cdd:smart00135    1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
1096-1132 1.40e-05

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


:

Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 43.36  E-value: 1.40e-05
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 2009632981  1096 RILVQDDLGLPNGLTFDAYSSQLCWVDAGTHRAECLN 1132
Cdd:smart00135    1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVAN 37
 
Name Accession Description Interval E-value
nidG2 cd00255
Nidogen, G2 domain; Nidogen is an important component of the basement membrane, an ...
427-658 8.40e-116

Nidogen, G2 domain; Nidogen is an important component of the basement membrane, an extracellular sheet-like matrix. Nidogen is a multifunctional protein that interacts with many other basement membrane proteins, like collagen, perlecan, lamin, and has a potential role in the assembly and connection of networks. Nidogen consists of 3 globular domains (G1-G3), G3 is the lamin-binding domain, while G2 binds collagen IV and perlecan. Also found in hemicentin, a protein which functions at various cell-cell and cell-matrix junctions and might assist in refining broad regions of cell contact into oriented, line-shaped junctions. Nidogen G2 consists of an N-terminal EGF-like domain (excluded from this alignment model) and an 11-stranded beta-barrel with a central helix, a topology that exhibits high structural similarity to the green flourescent proteins of Cnidaria.


Pssm-ID: 238158  Cd Length: 224  Bit Score: 359.70  E-value: 8.40e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  427 GSPQRVNGKVKGRIFVGdsQVPIVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPIGGIIGWMFAVEQDGFKNG 506
Cdd:cd00255      1 GIPQRVNGKVSGNINVG--QSPVEFGDADLHSYVVTSDGRAYTAISNIPESLGPSLRPLAPIGGTIGWLFALEQGGAKNG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  507 FSITGGEFTRQAEVTFVGHPDKLIIKQQFSGIDEHGHLTIDTELEGRVPQIAFGSSVHIEPYTELYHYSRQ-VITSSSTR 585
Cdd:cd00255     79 FSLTGGEFTRQAEVTFYTGGEKLRITQVARGLDSHGHLLLDTVISGRVPQVPAGATVHIEDYTELYHYTGPgVLTSSSTR 158
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2009632981  586 EYTVTEPeqhgtAPSHAHTYQWRQTITFQECVHDDsqPVLPSTQQLSVDSVFVLYNQEERILRYALSNSIGPV 658
Cdd:cd00255    159 EYTVDEG-----GESQTLSYQWNQTITYEECPHDD--EAAPDLQQLLVARIFALYNPEEEILRFAITNSIGPG 224
G2F smart00682
G2 nidogen domain and fibulin;
425-664 8.36e-109

G2 nidogen domain and fibulin;


Pssm-ID: 214774  Cd Length: 227  Bit Score: 340.96  E-value: 8.36e-109
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   425 AEGSPQRVNGKVKGRIFVGdsQVPIVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPIGGIIGWMFAVEQDGFK 504
Cdd:smart00682    1 AEGGPQRVSGSVSGVINVG--EFPVAFENADLHSYVVSSEGRAYTAISNIPSPLGAALRPLVPIGGTIGWLFAKEQGGAV 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   505 NGFSITGGEFTRQAEVTFvGHPDKLIIKQQFSGIDEHGHLTIDTELEGRVPQIAFGSSVHIEPYTELYHYSRQ-VITSSS 583
Cdd:smart00682   79 NGFQLTGGVFTRETEVTF-AGGEILRIKQTFSGLDEHGYLKVKIEVSGRVPQVAAGAEVTIPDYTEEYTYTGPgVLTTSS 157
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   584 TREYTVtepeqhgtaPSHAHTYQWRQTITFQECVHDDSQPvlPSTQQLSVDSVFVLYNQEERILRYALSNSIGPVRDGSP 663
Cdd:smart00682  158 TREYTV---------DNQTHSYTVDQTITFEECQHRDAFP--PTTQQLHVSSVFVDYNDEERVLRFAAHNSVGPGDESNQ 226

                    .
gi 2009632981   664 D 664
Cdd:smart00682  227 C 227
G2F pfam07474
G2F domain; Nidogen, an invariant component of basement membranes, is a multifunctional ...
427-620 2.07e-87

G2F domain; Nidogen, an invariant component of basement membranes, is a multifunctional protein that interacts with most other major basement membrane proteins. The G2 fragment or (G2F domain) contains binding sites for collagen IV and perlecan. The structure is composed of an 11-stranded beta-barrel with a central helix. This domain is structurally related to that of green fluorescent protein pfam01353. A large surface patch on the beta-barrel is conserved in all metazoan nidogens.


Pssm-ID: 462175  Cd Length: 184  Bit Score: 281.02  E-value: 2.07e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  427 GSPQRVNGKVKGRIfvgdsqVPIVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPIGGIIGWMFAVEQDGFKNG 506
Cdd:pfam07474    1 GVPQRVNGKVSGTI------NGVEFGDADLHAYVVTNDGRAYTAISNIPPSLGPLLQLLSSIGGPIGWLFALEQGGAKNG 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  507 FSITGGEFTRQAEVTFVGHPDKLIIKQQFSGIDEHGHLTIDTELEGRVPQIAFGSSVHIEPYTELYHYSRQ-VITSSSTR 585
Cdd:pfam07474   75 FSLTGGVFNRTAEVTFPPTGERLTITQEFRGLDEDGHLVVDTVISGTVPQVPAGSTVIIEDYTELYQYTGPgELTSSSTR 154
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 2009632981  586 EYTVTEPEQHGTapshaHTYQWRQTITFQECVHDD 620
Cdd:pfam07474  155 TYTVDGEGNTRT-----ISYTVNQTITYQECRHAE 184
NIDO smart00539
Extracellular domain of unknown function in nidogen (entactin) and hypothetical proteins;
106-269 1.93e-44

Extracellular domain of unknown function in nidogen (entactin) and hypothetical proteins;


Pssm-ID: 214712  Cd Length: 152  Bit Score: 157.97  E-value: 1.93e-44
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   106 PFLADLDTtDGLGKVYYREDLSPSVTQLAAECVQRGFPE-VSFKPSSAVVVTWESVAPYQGPSKDPTlegkrNTFQCILA 184
Cdd:smart00539    1 PFWADADT-EGTGKVYYRETTDHAILDRATESVREGFTDmGGFRAKSVVIVTWENVAAYGSQSSDGT-----NTFQAVLA 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   185 SSDSSTYAIFLYPEDGLQFYTTFSKkeENQVPAVVAFSQGLvgliwkSDGAYNIFANDRESIGNLAKSSNSGLQGIWVFE 264
Cdd:smart00539   75 TDGSRTYAIFLYPSLGWTSDTTAGG--DDGVRARAGFNGGD------GTFSYTLPASGEENIKNLAEGSNVGIPGRWMFR 146

                    ....*
gi 2009632981   265 IGSPA 269
Cdd:smart00539  147 VDGAE 151
NIDO pfam06119
Nidogen-like; This is a nidogen-like domain (NIDO) domain and is an extracellular domain found ...
176-267 1.09e-27

Nidogen-like; This is a nidogen-like domain (NIDO) domain and is an extracellular domain found in nidogen and hypothetical proteins of unknown function.


Pssm-ID: 461833  Cd Length: 90  Bit Score: 107.76  E-value: 1.09e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  176 RNTFQCILASSDSSTYAIFLYPEDGLQFYTTFSKKEEN---QVPAVVAFSQGLvgliwKSDGAYNIFANDRESIGNLAKS 252
Cdd:pfam06119    1 TNTFQAVLATDGSGSFAIFNYPDGGIQWTTGKASGGTNglgGTPAQAGFSAGD-----GDGRYYELPGSGTDSIRNLTET 75
                           90
                   ....*....|....*
gi 2009632981  253 SNSGLQGIWVFEIGS 267
Cdd:pfam06119   76 SNVGVPGRWVFRIDS 90
Thyroglobulin_1 pfam00086
Thyroglobulin type-1 repeat; Thyroglobulin type 1 repeats are thought to be involved in the ...
847-914 1.19e-27

Thyroglobulin type-1 repeat; Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 and 5 with 6.


Pssm-ID: 459665  Cd Length: 66  Bit Score: 106.62  E-value: 1.19e-27
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2009632981  847 CQLERERIVGTADSPRPqPPGLFVPECDEQGHYMPTQCHSSTGYCWCVDRDGQEVEGTRTGSGMrPPC 914
Cdd:pfam00086    1 CERERARALEQAASGRP-ASGLYIPNCDEDGFYKPVQCHGSTGYCWCVDPEGQEIPGTRTRGGD-PDC 66
TY cd00191
Thyroglobulin type I repeats.; The N-terminal region of human thyroglobulin contains 11 type-1 ...
847-914 1.65e-23

Thyroglobulin type I repeats.; The N-terminal region of human thyroglobulin contains 11 type-1 repeats TY repeats are proposed to be inhibitors of cysteine proteases


Pssm-ID: 238114  Cd Length: 66  Bit Score: 94.84  E-value: 1.65e-23
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2009632981  847 CQLERERIvgTADSPRPQPPGLFVPECDEQGHYMPTQCHSSTGYCWCVDRDGQEVEGTRTGSGmRPPC 914
Cdd:cd00191      2 CERERASA--LESLAGPKLSGLYVPQCDEDGNYEPVQCHGSTGYCWCVDPDGEEIPGTRTRGG-PPNC 66
TY smart00211
Thyroglobulin type I repeats; The N-terminal region of human thyroglobulin contains 11 type-1 ...
870-916 1.10e-15

Thyroglobulin type I repeats; The N-terminal region of human thyroglobulin contains 11 type-1 repeats TY repeats are proposed to be inhibitors of cysteine proteases and binding partners of heparin.


Pssm-ID: 214561  Cd Length: 46  Bit Score: 72.03  E-value: 1.10e-15
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 2009632981   870 VPECDEQGHYMPTQCHSSTGYCWCVDRDGQEVEGTRTGsGMRPPCLS 916
Cdd:smart00211    1 IPQCDEDGNYEPVQCDGSSGQCWCVDATGREIPGTRTE-GGDPDCPS 46
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
1009-1050 2.10e-13

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 65.32  E-value: 2.10e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 2009632981  1009 TIIRQDLGSPEGIALDHLGRNIFWTDSQLDRIEVAKLDGTQR 1050
Cdd:smart00135    2 TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
809-837 2.97e-11

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 59.15  E-value: 2.97e-11
                           10        20
                   ....*....|....*....|....*....
gi 2009632981  809 CHPDAFCYNTPGSFVCRCKYGYQGDGFHC 837
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
1051-1095 3.34e-11

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 59.15  E-value: 3.34e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 2009632981  1051 RVLFETDLVNPRGIVTDSMRGNLYWTDWNRDnpKIETSYMDGTNR 1095
Cdd:smart00135    1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLD--VIEVANLDGTNR 43
Ldl_recept_b pfam00058
Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif ...
987-1025 2.19e-09

Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.


Pssm-ID: 459654  Cd Length: 42  Bit Score: 54.09  E-value: 2.19e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 2009632981  987 IYWTDISQ-PSIGRASLHGGEPTTIIRQDLGSPEGIALDH 1025
Cdd:pfam00058    3 LYWTDSSLrASISSADLNGSDRKTLFTDDLQHPNAIAVDP 42
FXa_inhibition pfam14670
Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is ...
1207-1231 2.69e-09

Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is found to be the target for a potent inhibitor of coagulation, TAK-442.


Pssm-ID: 464251 [Multi-domain]  Cd Length: 36  Bit Score: 53.40  E-value: 2.69e-09
                           10        20
                   ....*....|....*....|....*
gi 2009632981 1207 CSVNNGGCTHLCLATPGSRTCRCPD 1231
Cdd:pfam14670    1 CSVNNGGCSHLCLNTPGGYTCSCPE 25
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
944-1172 3.10e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 59.26  E-value: 3.10e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  944 AQTGKIERLPLEGstmtkseaktllhaPDKVIIGLAFDcVDKMIYWTDISQPSIGRASLHGGEPTTI-IRQDLGSPEGIA 1022
Cdd:COG4257     87 PKTGEITTFALPG--------------GGSNPHGIAFD-PDGNLWFTDQGGNRIGRLDPATGEVTEFpLPTGGAGPYGIA 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1023 LDHLGrNIFWTDSQLDRIevAKLD---GTQRRVLFETDLVNPRGIVTDSmRGNLYWTDWNRD-----NPKietsymDGTN 1094
Cdd:COG4257    152 VDPDG-NLWVTDFGANAI--GRIDpdtGTLTEYALPTPGAGPRGLAVDP-DGNLWVADTGSGrigrfDPK------TGTV 221
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2009632981 1095 RRILVQDDLGLPNGLTFDAYssqlcwvdagthraeclnpgqpnrrkvleglqypfavtsfgKNLYYTDWKTNSVVAVD 1172
Cdd:COG4257    222 TEYPLPGGGARPYGVAVDGD-----------------------------------------GRVWFAESGANRIVRFD 258
EGF_CA smart00179
Calcium-binding EGF-like domain;
708-741 1.51e-08

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 51.48  E-value: 1.51e-08
                            10        20        30
                    ....*....|....*....|....*....|....
gi 2009632981   708 DIDECSEqPSVCGNHAVCNNHPGTFRCECMEGYR 741
Cdd:smart00179    1 DIDECAS-GNPCQNGGTCVNTVGSYRCECPPGYT 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
708-741 4.49e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.33  E-value: 4.49e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 2009632981  708 DIDECSEqPSVCGNHAVCNNHPGTFRCECMEGYR 741
Cdd:cd00054      1 DIDECAS-GNPCQNGGTCVNTVGSYRCSCPPGYT 33
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
388-423 4.74e-08

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.90  E-value: 4.74e-08
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2009632981  388 CASNRHRCSVHAQCRDFATGFCCRCAAGYTGNGRQC 423
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
Ldl_recept_b pfam00058
Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif ...
1028-1067 9.31e-08

Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.


Pssm-ID: 459654  Cd Length: 42  Bit Score: 49.47  E-value: 9.31e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2009632981 1028 RNIFWTDSQLD-RIEVAKLDGTQRRVLFETDLVNPRGIVTD 1067
Cdd:pfam00058    1 GRLYWTDSSLRaSISSADLNGSDRKTLFTDDLQHPNAIAVD 41
Ldl_recept_b pfam00058
Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif ...
1071-1112 9.49e-08

Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.


Pssm-ID: 459654  Cd Length: 42  Bit Score: 49.47  E-value: 9.49e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 2009632981 1071 GNLYWTDWNRDnPKIETSYMDGTNRRILVQDDLGLPNGLTFD 1112
Cdd:pfam00058    1 GRLYWTDSSLR-ASISSADLNGSDRKTLFTDDLQHPNAIAVD 41
EGF_CA smart00179
Calcium-binding EGF-like domain;
800-838 1.49e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 48.78  E-value: 1.49e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2009632981   800 DVNECQ-PSRCHPDAFCYNTPGSFVCRCKYGYQgDGFHCV 838
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYT-DGRNCE 39
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
670-706 2.38e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 47.98  E-value: 2.38e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2009632981  670 CYIGSHGCDANAACRPGPGaQFTCECSIGFRGDGRAC 706
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
760-798 3.70e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 47.59  E-value: 3.70e-07
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2009632981  760 CETGLHDCDipQRAQCKFMGhGSYTCICLPGFSGDGRAC 798
Cdd:pfam12947    1 CSDNNGGCH--PNATCTNTG-GSFTCTCNDGYTGDGVTC 36
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
965-1006 1.23e-06

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 46.06  E-value: 1.23e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 2009632981   965 KTLLHAPDKVIIGLAFDCVDKMIYWTDISQPSIGRASLHGGE 1006
Cdd:smart00135    1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
800-838 1.33e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 46.09  E-value: 1.33e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 2009632981  800 DVNECQ-PSRCHPDAFCYNTPGSFVCRCKYGYQGDgfHCV 838
Cdd:cd00054      1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGR--NCE 38
EGF_CA pfam07645
Calcium-binding EGF domain;
708-739 3.95e-06

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 44.54  E-value: 3.95e-06
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2009632981  708 DIDECSEQPSVCGNHAVCNNHPGTFRCECMEG 739
Cdd:pfam07645    1 DVDECATGTHNCPANTVCVNTIGSFECRCPDG 32
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
1096-1132 1.40e-05

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 43.36  E-value: 1.40e-05
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 2009632981  1096 RILVQDDLGLPNGLTFDAYSSQLCWVDAGTHRAECLN 1132
Cdd:smart00135    1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVAN 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
388-424 2.86e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.46  E-value: 2.86e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2009632981  388 CASNrHRCSVHAQCRDFATGFCCRCAAGYTgnGRQCV 424
Cdd:cd00054      5 CASG-NPCQNGGTCVNTVGSYRCSCPPGYT--GRNCE 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
757-799 6.99e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 35.69  E-value: 6.99e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 2009632981   757 INHCETGlHDCDipQRAQCkFMGHGSYTCICLPGFSgDGRACQ 799
Cdd:smart00179    2 IDECASG-NPCQ--NGGTC-VNTVGSYRCECPPGYT-DGRNCE 39
 
Name Accession Description Interval E-value
nidG2 cd00255
Nidogen, G2 domain; Nidogen is an important component of the basement membrane, an ...
427-658 8.40e-116

Nidogen, G2 domain; Nidogen is an important component of the basement membrane, an extracellular sheet-like matrix. Nidogen is a multifunctional protein that interacts with many other basement membrane proteins, like collagen, perlecan, lamin, and has a potential role in the assembly and connection of networks. Nidogen consists of 3 globular domains (G1-G3), G3 is the lamin-binding domain, while G2 binds collagen IV and perlecan. Also found in hemicentin, a protein which functions at various cell-cell and cell-matrix junctions and might assist in refining broad regions of cell contact into oriented, line-shaped junctions. Nidogen G2 consists of an N-terminal EGF-like domain (excluded from this alignment model) and an 11-stranded beta-barrel with a central helix, a topology that exhibits high structural similarity to the green flourescent proteins of Cnidaria.


Pssm-ID: 238158  Cd Length: 224  Bit Score: 359.70  E-value: 8.40e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  427 GSPQRVNGKVKGRIFVGdsQVPIVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPIGGIIGWMFAVEQDGFKNG 506
Cdd:cd00255      1 GIPQRVNGKVSGNINVG--QSPVEFGDADLHSYVVTSDGRAYTAISNIPESLGPSLRPLAPIGGTIGWLFALEQGGAKNG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  507 FSITGGEFTRQAEVTFVGHPDKLIIKQQFSGIDEHGHLTIDTELEGRVPQIAFGSSVHIEPYTELYHYSRQ-VITSSSTR 585
Cdd:cd00255     79 FSLTGGEFTRQAEVTFYTGGEKLRITQVARGLDSHGHLLLDTVISGRVPQVPAGATVHIEDYTELYHYTGPgVLTSSSTR 158
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2009632981  586 EYTVTEPeqhgtAPSHAHTYQWRQTITFQECVHDDsqPVLPSTQQLSVDSVFVLYNQEERILRYALSNSIGPV 658
Cdd:cd00255    159 EYTVDEG-----GESQTLSYQWNQTITYEECPHDD--EAAPDLQQLLVARIFALYNPEEEILRFAITNSIGPG 224
G2F smart00682
G2 nidogen domain and fibulin;
425-664 8.36e-109

G2 nidogen domain and fibulin;


Pssm-ID: 214774  Cd Length: 227  Bit Score: 340.96  E-value: 8.36e-109
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   425 AEGSPQRVNGKVKGRIFVGdsQVPIVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPIGGIIGWMFAVEQDGFK 504
Cdd:smart00682    1 AEGGPQRVSGSVSGVINVG--EFPVAFENADLHSYVVSSEGRAYTAISNIPSPLGAALRPLVPIGGTIGWLFAKEQGGAV 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   505 NGFSITGGEFTRQAEVTFvGHPDKLIIKQQFSGIDEHGHLTIDTELEGRVPQIAFGSSVHIEPYTELYHYSRQ-VITSSS 583
Cdd:smart00682   79 NGFQLTGGVFTRETEVTF-AGGEILRIKQTFSGLDEHGYLKVKIEVSGRVPQVAAGAEVTIPDYTEEYTYTGPgVLTTSS 157
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   584 TREYTVtepeqhgtaPSHAHTYQWRQTITFQECVHDDSQPvlPSTQQLSVDSVFVLYNQEERILRYALSNSIGPVRDGSP 663
Cdd:smart00682  158 TREYTV---------DNQTHSYTVDQTITFEECQHRDAFP--PTTQQLHVSSVFVDYNDEERVLRFAAHNSVGPGDESNQ 226

                    .
gi 2009632981   664 D 664
Cdd:smart00682  227 C 227
G2F pfam07474
G2F domain; Nidogen, an invariant component of basement membranes, is a multifunctional ...
427-620 2.07e-87

G2F domain; Nidogen, an invariant component of basement membranes, is a multifunctional protein that interacts with most other major basement membrane proteins. The G2 fragment or (G2F domain) contains binding sites for collagen IV and perlecan. The structure is composed of an 11-stranded beta-barrel with a central helix. This domain is structurally related to that of green fluorescent protein pfam01353. A large surface patch on the beta-barrel is conserved in all metazoan nidogens.


Pssm-ID: 462175  Cd Length: 184  Bit Score: 281.02  E-value: 2.07e-87
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  427 GSPQRVNGKVKGRIfvgdsqVPIVFENTDLHSYVVMNHGRSYTAISTIPETVGYSLLPLAPIGGIIGWMFAVEQDGFKNG 506
Cdd:pfam07474    1 GVPQRVNGKVSGTI------NGVEFGDADLHAYVVTNDGRAYTAISNIPPSLGPLLQLLSSIGGPIGWLFALEQGGAKNG 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  507 FSITGGEFTRQAEVTFVGHPDKLIIKQQFSGIDEHGHLTIDTELEGRVPQIAFGSSVHIEPYTELYHYSRQ-VITSSSTR 585
Cdd:pfam07474   75 FSLTGGVFNRTAEVTFPPTGERLTITQEFRGLDEDGHLVVDTVISGTVPQVPAGSTVIIEDYTELYQYTGPgELTSSSTR 154
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 2009632981  586 EYTVTEPEQHGTapshaHTYQWRQTITFQECVHDD 620
Cdd:pfam07474  155 TYTVDGEGNTRT-----ISYTVNQTITYQECRHAE 184
NIDO smart00539
Extracellular domain of unknown function in nidogen (entactin) and hypothetical proteins;
106-269 1.93e-44

Extracellular domain of unknown function in nidogen (entactin) and hypothetical proteins;


Pssm-ID: 214712  Cd Length: 152  Bit Score: 157.97  E-value: 1.93e-44
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   106 PFLADLDTtDGLGKVYYREDLSPSVTQLAAECVQRGFPE-VSFKPSSAVVVTWESVAPYQGPSKDPTlegkrNTFQCILA 184
Cdd:smart00539    1 PFWADADT-EGTGKVYYRETTDHAILDRATESVREGFTDmGGFRAKSVVIVTWENVAAYGSQSSDGT-----NTFQAVLA 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981   185 SSDSSTYAIFLYPEDGLQFYTTFSKkeENQVPAVVAFSQGLvgliwkSDGAYNIFANDRESIGNLAKSSNSGLQGIWVFE 264
Cdd:smart00539   75 TDGSRTYAIFLYPSLGWTSDTTAGG--DDGVRARAGFNGGD------GTFSYTLPASGEENIKNLAEGSNVGIPGRWMFR 146

                    ....*
gi 2009632981   265 IGSPA 269
Cdd:smart00539  147 VDGAE 151
NIDO pfam06119
Nidogen-like; This is a nidogen-like domain (NIDO) domain and is an extracellular domain found ...
176-267 1.09e-27

Nidogen-like; This is a nidogen-like domain (NIDO) domain and is an extracellular domain found in nidogen and hypothetical proteins of unknown function.


Pssm-ID: 461833  Cd Length: 90  Bit Score: 107.76  E-value: 1.09e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  176 RNTFQCILASSDSSTYAIFLYPEDGLQFYTTFSKKEEN---QVPAVVAFSQGLvgliwKSDGAYNIFANDRESIGNLAKS 252
Cdd:pfam06119    1 TNTFQAVLATDGSGSFAIFNYPDGGIQWTTGKASGGTNglgGTPAQAGFSAGD-----GDGRYYELPGSGTDSIRNLTET 75
                           90
                   ....*....|....*
gi 2009632981  253 SNSGLQGIWVFEIGS 267
Cdd:pfam06119   76 SNVGVPGRWVFRIDS 90
Thyroglobulin_1 pfam00086
Thyroglobulin type-1 repeat; Thyroglobulin type 1 repeats are thought to be involved in the ...
847-914 1.19e-27

Thyroglobulin type-1 repeat; Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 and 5 with 6.


Pssm-ID: 459665  Cd Length: 66  Bit Score: 106.62  E-value: 1.19e-27
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2009632981  847 CQLERERIVGTADSPRPqPPGLFVPECDEQGHYMPTQCHSSTGYCWCVDRDGQEVEGTRTGSGMrPPC 914
Cdd:pfam00086    1 CERERARALEQAASGRP-ASGLYIPNCDEDGFYKPVQCHGSTGYCWCVDPEGQEIPGTRTRGGD-PDC 66
TY cd00191
Thyroglobulin type I repeats.; The N-terminal region of human thyroglobulin contains 11 type-1 ...
847-914 1.65e-23

Thyroglobulin type I repeats.; The N-terminal region of human thyroglobulin contains 11 type-1 repeats TY repeats are proposed to be inhibitors of cysteine proteases


Pssm-ID: 238114  Cd Length: 66  Bit Score: 94.84  E-value: 1.65e-23
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2009632981  847 CQLERERIvgTADSPRPQPPGLFVPECDEQGHYMPTQCHSSTGYCWCVDRDGQEVEGTRTGSGmRPPC 914
Cdd:cd00191      2 CERERASA--LESLAGPKLSGLYVPQCDEDGNYEPVQCHGSTGYCWCVDPDGEEIPGTRTRGG-PPNC 66
TY smart00211
Thyroglobulin type I repeats; The N-terminal region of human thyroglobulin contains 11 type-1 ...
870-916 1.10e-15

Thyroglobulin type I repeats; The N-terminal region of human thyroglobulin contains 11 type-1 repeats TY repeats are proposed to be inhibitors of cysteine proteases and binding partners of heparin.


Pssm-ID: 214561  Cd Length: 46  Bit Score: 72.03  E-value: 1.10e-15
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 2009632981   870 VPECDEQGHYMPTQCHSSTGYCWCVDRDGQEVEGTRTGsGMRPPCLS 916
Cdd:smart00211    1 IPQCDEDGNYEPVQCDGSSGQCWCVDATGREIPGTRTE-GGDPDCPS 46
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
1009-1050 2.10e-13

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 65.32  E-value: 2.10e-13
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 2009632981  1009 TIIRQDLGSPEGIALDHLGRNIFWTDSQLDRIEVAKLDGTQR 1050
Cdd:smart00135    2 TLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
809-837 2.97e-11

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 59.15  E-value: 2.97e-11
                           10        20
                   ....*....|....*....|....*....
gi 2009632981  809 CHPDAFCYNTPGSFVCRCKYGYQGDGFHC 837
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
1051-1095 3.34e-11

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 59.15  E-value: 3.34e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*
gi 2009632981  1051 RVLFETDLVNPRGIVTDSMRGNLYWTDWNRDnpKIETSYMDGTNR 1095
Cdd:smart00135    1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLD--VIEVANLDGTNR 43
Ldl_recept_b pfam00058
Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif ...
987-1025 2.19e-09

Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.


Pssm-ID: 459654  Cd Length: 42  Bit Score: 54.09  E-value: 2.19e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 2009632981  987 IYWTDISQ-PSIGRASLHGGEPTTIIRQDLGSPEGIALDH 1025
Cdd:pfam00058    3 LYWTDSSLrASISSADLNGSDRKTLFTDDLQHPNAIAVDP 42
FXa_inhibition pfam14670
Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is ...
1207-1231 2.69e-09

Coagulation Factor Xa inhibitory site; This short domain on coagulation enzyme factor Xa is found to be the target for a potent inhibitor of coagulation, TAK-442.


Pssm-ID: 464251 [Multi-domain]  Cd Length: 36  Bit Score: 53.40  E-value: 2.69e-09
                           10        20
                   ....*....|....*....|....*
gi 2009632981 1207 CSVNNGGCTHLCLATPGSRTCRCPD 1231
Cdd:pfam14670    1 CSVNNGGCSHLCLNTPGGYTCSCPE 25
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
944-1172 3.10e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 59.26  E-value: 3.10e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  944 AQTGKIERLPLEGstmtkseaktllhaPDKVIIGLAFDcVDKMIYWTDISQPSIGRASLHGGEPTTI-IRQDLGSPEGIA 1022
Cdd:COG4257     87 PKTGEITTFALPG--------------GGSNPHGIAFD-PDGNLWFTDQGGNRIGRLDPATGEVTEFpLPTGGAGPYGIA 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1023 LDHLGrNIFWTDSQLDRIevAKLD---GTQRRVLFETDLVNPRGIVTDSmRGNLYWTDWNRD-----NPKietsymDGTN 1094
Cdd:COG4257    152 VDPDG-NLWVTDFGANAI--GRIDpdtGTLTEYALPTPGAGPRGLAVDP-DGNLWVADTGSGrigrfDPK------TGTV 221
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2009632981 1095 RRILVQDDLGLPNGLTFDAYssqlcwvdagthraeclnpgqpnrrkvleglqypfavtsfgKNLYYTDWKTNSVVAVD 1172
Cdd:COG4257    222 TEYPLPGGGARPYGVAVDGD-----------------------------------------GRVWFAESGANRIVRFD 258
EGF_CA smart00179
Calcium-binding EGF-like domain;
708-741 1.51e-08

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 51.48  E-value: 1.51e-08
                            10        20        30
                    ....*....|....*....|....*....|....
gi 2009632981   708 DIDECSEqPSVCGNHAVCNNHPGTFRCECMEGYR 741
Cdd:smart00179    1 DIDECAS-GNPCQNGGTCVNTVGSYRCECPPGYT 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
708-741 4.49e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.33  E-value: 4.49e-08
                           10        20        30
                   ....*....|....*....|....*....|....
gi 2009632981  708 DIDECSEqPSVCGNHAVCNNHPGTFRCECMEGYR 741
Cdd:cd00054      1 DIDECAS-GNPCQNGGTCVNTVGSYRCSCPPGYT 33
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
388-423 4.74e-08

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.90  E-value: 4.74e-08
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 2009632981  388 CASNRHRCSVHAQCRDFATGFCCRCAAGYTGNGRQC 423
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
909-1128 7.45e-08

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 54.70  E-value: 7.45e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  909 GMRPPCLSTVAPPVHFGPPVLTAVIPPPPGTHLLFAQTGKIERLPLEGSTMTKSEAKTLLHAPDKVIIGLAFDCVDKMIY 988
Cdd:COG3391      4 ASSLLVAVLLAVLALAALAVAVAALGLGGGGPLLAAASGGVVGAAVGGGGVALLAGLGLGAAAVADADGADAGADGRRLY 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  989 WTDISQPSIGRASLHGGEPTTIIRQDlGSPEGIALDHLGRNIFWTDSQLDRieVAKLDGTQRRVLFETDL-VNPRGIVTD 1067
Cdd:COG3391     84 VANSGSGRVSVIDLATGKVVATIPVG-GGPRGLAVDPDGGRLYVADSGNGR--VSVIDTATGKVVATIPVgAGPHGIAVD 160
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2009632981 1068 SMRGNLYWTDWnrDNPKIET--SYMDGTNRRILVQDDLG-LPNGLTFDAYSSQLCWVDAGTHRA 1128
Cdd:COG3391    161 PDGKRLYVANS--GSNTVSVivSVIDTATGKVVATIPVGgGPVGVAVSPDGRRLYVANRGSNTS 222
Ldl_recept_b pfam00058
Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif ...
1028-1067 9.31e-08

Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.


Pssm-ID: 459654  Cd Length: 42  Bit Score: 49.47  E-value: 9.31e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 2009632981 1028 RNIFWTDSQLD-RIEVAKLDGTQRRVLFETDLVNPRGIVTD 1067
Cdd:pfam00058    1 GRLYWTDSSLRaSISSADLNGSDRKTLFTDDLQHPNAIAVD 41
Ldl_recept_b pfam00058
Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif ...
1071-1112 9.49e-08

Low-density lipoprotein receptor repeat class B; This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.


Pssm-ID: 459654  Cd Length: 42  Bit Score: 49.47  E-value: 9.49e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 2009632981 1071 GNLYWTDWNRDnPKIETSYMDGTNRRILVQDDLGLPNGLTFD 1112
Cdd:pfam00058    1 GRLYWTDSSLR-ASISSADLNGSDRKTLFTDDLQHPNAIAVD 41
EGF_CA smart00179
Calcium-binding EGF-like domain;
800-838 1.49e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 48.78  E-value: 1.49e-07
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 2009632981   800 DVNECQ-PSRCHPDAFCYNTPGSFVCRCKYGYQgDGFHCV 838
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYT-DGRNCE 39
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1014-1169 1.64e-07

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 54.25  E-value: 1.64e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1014 DLGSPEGIALDHLGrNIFWTDSQLDRIEVAKLDGTQRRVL-----FETDLVNPRGIVTDSmRGNLYWTDWNRD-----NP 1083
Cdd:cd05819    100 EFNGPRGIAVDSSG-NIYVADTGNHRIQKFDPDGEFLTTFgsggsGPGQFNGPTGVAVDS-DGNIYVADTGNHriqvfDP 177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1084 kiETSYMDGTNRRILVQDDLGLPNGLTFDaySSQLCWV-DAGTHRAECLNPGQP------NRRKVLEGLQYPF--AVTSF 1154
Cdd:cd05819    178 --DGNFLTTFGSTGTGPGQFNYPTGIAVD--SDGNIYVaDSGNNRVQVFDPDGAgfggngNFLGSDGQFNRPSglAVDSD 253
                          170
                   ....*....|....*
gi 2009632981 1155 GkNLYYTDWKTNSVV 1169
Cdd:cd05819    254 G-NLYVADTGNNRIQ 267
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
670-706 2.38e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 47.98  E-value: 2.38e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2009632981  670 CYIGSHGCDANAACRPGPGaQFTCECSIGFRGDGRAC 706
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
760-798 3.70e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 47.59  E-value: 3.70e-07
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 2009632981  760 CETGLHDCDipQRAQCKFMGhGSYTCICLPGFSGDGRAC 798
Cdd:pfam12947    1 CSDNNGGCH--PNATCTNTG-GSFTCTCNDGYTGDGVTC 36
YvrE COG3386
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ...
984-1172 6.89e-07

Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway


Pssm-ID: 442613 [Multi-domain]  Cd Length: 266  Bit Score: 52.20  E-value: 6.89e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  984 DKMIYWTDISQPSIGRASLHGGEpTTIIRQDLGSPEGIALDHLGRniFWTDSQLDRIEVAKLDGTQRRVL---FETDLVN 1060
Cdd:COG3386     18 DGRLYWVDIPGGRIHRYDPDGGA-VEVFAEPSGRPNGLAFDPDGR--LLVADHGRGLVRFDPADGEVTVLadeYGKPLNR 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1061 PRGIVTDSmRGNLYWTD--WNRDNPKIetSYMDGTNRRILVQDDLGLPNGLTFDAYSSQLCWVDAGTH---RAECLNPGQ 1135
Cdd:COG3386     95 PNDGVVDP-DGRLYFTDmgEYLPTGAL--YRVDPDGSLRVLADGLTFPNGIAFSPDGRTLYVADTGAGriyRFDLDADGT 171
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2009632981 1136 PNRRKVL----EGLQYP--FAVTSFGkNLYYTDWKTNSVVAVD 1172
Cdd:COG3386    172 LGNRRVFadlpDGPGGPdgLAVDADG-NLWVALWGGGGVVRFD 213
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
965-1006 1.23e-06

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 46.06  E-value: 1.23e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|..
gi 2009632981   965 KTLLHAPDKVIIGLAFDCVDKMIYWTDISQPSIGRASLHGGE 1006
Cdd:smart00135    1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTN 42
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
800-838 1.33e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 46.09  E-value: 1.33e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 2009632981  800 DVNECQ-PSRCHPDAFCYNTPGSFVCRCKYGYQGDgfHCV 838
Cdd:cd00054      1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGR--NCE 38
EGF_CA pfam07645
Calcium-binding EGF domain;
708-739 3.95e-06

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 44.54  E-value: 3.95e-06
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2009632981  708 DIDECSEQPSVCGNHAVCNNHPGTFRCECMEG 739
Cdd:pfam07645    1 DVDECATGTHNCPANTVCVNTIGSFECRCPDG 32
LY smart00135
Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) ...
1096-1132 1.40e-05

Low-density lipoprotein-receptor YWTD domain; Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.


Pssm-ID: 214531 [Multi-domain]  Cd Length: 43  Bit Score: 43.36  E-value: 1.40e-05
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 2009632981  1096 RILVQDDLGLPNGLTFDAYSSQLCWVDAGTHRAECLN 1132
Cdd:smart00135    1 RTLLSSGLGHPNGLAVDWIEGRLYWTDWGLDVIEVAN 37
YncE COG3391
DNA-binding beta-propeller fold protein YncE [General function prediction only];
987-1177 1.59e-05

DNA-binding beta-propeller fold protein YncE [General function prediction only];


Pssm-ID: 442618 [Multi-domain]  Cd Length: 237  Bit Score: 47.77  E-value: 1.59e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  987 IYWTDISQPSIGRASLHGGEPTTIIRQDLGSPEGIALDHLGRNIFWTDSQLDRIEVakLDGTQRRVLFETDL-VNPRGIV 1065
Cdd:COG3391     39 AASGGVVGAAVGGGGVALLAGLGLGAAAVADADGADAGADGRRLYVANSGSGRVSV--IDLATGKVVATIPVgGGPRGLA 116
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1066 TDSMRGNLYWTDWNRDNpkieTSYMDGTNRRILVQDDLGL-PNGLTFDAYSSQLCWVDAGTHRA----ECLNPGQPNRRK 1140
Cdd:COG3391    117 VDPDGGRLYVADSGNGR----VSVIDTATGKVVATIPVGAgPHGIAVDPDGKRLYVANSGSNTVsvivSVIDTATGKVVA 192
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2009632981 1141 VLEGLQYPF--AVTSFGKNLYYTDWKTNSVVAVDLAVSK 1177
Cdd:COG3391    193 TIPVGGGPVgvAVSPDGRRLYVANRGSNTSNGGSNTVSV 231
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
984-1172 2.18e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 47.71  E-value: 2.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  984 DKMIYWTDISQPSIGRASLHGGEPTTIIRQDLGSPEGIALDHLGrNIFWTDSQLDRIevAKLD---GTQRRVLFETDLVN 1060
Cdd:COG4257     27 DGAVWFTDQGGGRIGRLDPATGEFTEYPLGGGSGPHGIAVDPDG-NLWFTDNGNNRI--GRIDpktGEITTFALPGGGSN 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1061 PRGIVTDSmRGNLYWTDWNRDnpKIetSYMDGTNRRILVQD---DLGLPNGLTFDAySSQLCWVDAGTHRAECLNP--GQ 1135
Cdd:COG4257    104 PHGIAFDP-DGNLWFTDQGGN--RI--GRLDPATGEVTEFPlptGGAGPYGIAVDP-DGNLWVTDFGANAIGRIDPdtGT 177
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2009632981 1136 PNRRKVLEGLQYPFAVTsFGK--NLYYTDWKTNSVVAVD 1172
Cdd:COG4257    178 LTEYALPTPGAGPRGLA-VDPdgNLWVADTGSGRIGRFD 215
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
803-838 2.50e-05

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 42.46  E-value: 2.50e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2009632981  803 ECQ-PSRCHPDAFCYNTPGSFVCRCKYGYQGDgFHCV 838
Cdd:cd00053      1 ECAaSNPCSNGGTCVNTPGSYRCVCPPGYTGD-RSCE 36
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1014-1112 3.79e-05

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 46.90  E-value: 3.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1014 DLGSPEGIALDHLGrNIFWTDSQLDRIEVAKLDGT-----QRRVLFETDLVNPRGIVTDSmRGNLYWTDwNRDNpKIETS 1088
Cdd:cd14963    146 ELSYPNGIAVDEDG-NIYVADSGNGRIQVFDKNGKfikelNGSPDGKSGFVNPRGIAVDP-DGNLYVVD-NLSH-RVYVF 221
                           90       100
                   ....*....|....*....|....*....
gi 2009632981 1089 YMDGTNRRILVQ--DDLG---LPNGLTFD 1112
Cdd:cd14963    222 DEQGKELFTFGGrgKDDGqfnLPNGLFID 250
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
711-741 4.58e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 38.61  E-value: 4.58e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 2009632981  711 ECSeQPSVCGNHAVCNNHPGTFRCECMEGYR 741
Cdd:cd00053      1 ECA-ASNPCSNGGTCVNTPGSYRCVCPPGYT 30
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1014-1169 1.11e-03

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 42.19  E-value: 1.11e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1014 DLGSPEGIALDHlGRNIFWTDSQLDRieVAKLDG---TQRRVLFeTDLVNPRGIVTDSMrGNLYWTDwnRDNPKIETSYM 1090
Cdd:cd14952     92 GLNDPTGVAVDA-AGNVYVADTGNNR--VLKLAAgsnTQTVLPF-TGLSNPDGVAVDGA-GNVYVTD--TGNNRVLKLAA 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1091 DGTNRRILVQDDLGLPNGLTFDAySSQLCWVDAGTHRAECLNPGQPNRRKV-LEGLQYP--FAVTSFGkNLYYTDWKTNS 1167
Cdd:cd14952    165 GSTTQTVLPFTGLNSPSGVAVDT-AGNVYVTDHGNNRVLKLAAGSTTPTVLpFTGLNGPlgVAVDAAG-NVYVADRGNDR 242

                   ..
gi 2009632981 1168 VV 1169
Cdd:cd14952    243 VV 244
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1007-1172 1.37e-03

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 41.81  E-value: 1.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1007 PTTIIRQDLGSPEGIALDHLGrNIFWTDSQLDRieVAKL--DGTQRRVLFETDLVNPRGIVTDSmRGNLYWTDWnrdnpk 1084
Cdd:cd14952     43 QTVLPFTGLYQPQGVAVDAAG-TVYVTDFGNNR--VLKLaaGSTTQTVLPFTGLNDPTGVAVDA-AGNVYVADT------ 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981 1085 ietsymdgTNRRILVQ------------DDLGLPNGLTFDAysSQLCWV-DAGTHRAECLNPGQpNRRKVL--EGLQYPF 1149
Cdd:cd14952    113 --------GNNRVLKLaagsntqtvlpfTGLSNPDGVAVDG--AGNVYVtDTGNNRVLKLAAGS-TTQTVLpfTGLNSPS 181
                          170       180
                   ....*....|....*....|....*
gi 2009632981 1150 AVT--SFGkNLYYTDWKTNSVVAVD 1172
Cdd:cd14952    182 GVAvdTAG-NVYVTDHGNNRVLKLA 205
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
712-741 2.38e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.81  E-value: 2.38e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 2009632981  712 CSEQPSVCGNHAVCNNHPGTFRCECMEGYR 741
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYT 30
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
968-1079 2.63e-03

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 41.04  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  968 LHAPDkviiGLAFDCVDKmIYWTDISQPSIgrASLHGGE--PTTIIRQDLGSPEGIALDHLGrNIFWTDSQLDRieVAKL 1045
Cdd:cd14952     93 LNDPT----GVAVDAAGN-VYVADTGNNRV--LKLAAGSntQTVLPFTGLSNPDGVAVDGAG-NVYVTDTGNNR--VLKL 162
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 2009632981 1046 DG--TQRRVLFETDLVNPRGIVTDSmRGNLYWTDWN 1079
Cdd:cd14952    163 AAgsTTQTVLPFTGLNSPSGVAVDT-AGNVYVTDHG 197
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
388-424 2.86e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 36.46  E-value: 2.86e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 2009632981  388 CASNrHRCSVHAQCRDFATGFCCRCAAGYTgnGRQCV 424
Cdd:cd00054      5 CASG-NPCQNGGTCVNTVGSYRCSCPPGYT--GRNCE 38
EGF_CA pfam07645
Calcium-binding EGF domain;
800-829 4.14e-03

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 36.06  E-value: 4.14e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2009632981  800 DVNECQ--PSRCHPDAFCYNTPGSFVCRCKYG 829
Cdd:pfam07645    1 DVDECAtgTHNCPANTVCVNTIGSFECRCPDG 32
NHL_like_6 cd14962
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
961-1077 4.90e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271332 [Multi-domain]  Cd Length: 271  Bit Score: 40.26  E-value: 4.90e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2009632981  961 KSEAKTLLHAPdkviIGLAFDCvDKMIYWTDISQPSI-------GRASLHG-GEPTTIIrqdlgSPEGIALDHLGrNIFW 1032
Cdd:cd14962      4 EERPKEALTRP----YGVAADG-RGRIYVADTGRGAVfvfdlpnGKVFVIGnAGPNRFV-----SPIGVAIDANG-NLYV 72
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2009632981 1033 TDSQLDRIEVAKLDGTQRRVLFETDLVN-PRGIVTDSMRGNLYWTD 1077
Cdd:cd14962     73 SDAELGKVFVFDRDGKFLRAIGAGALFKrPTGIAVDPAGKRLYVVD 118
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
388-419 6.56e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 35.44  E-value: 6.56e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2009632981  388 CASNRhrCSVHAQCRDFATGFCCRCAAGYTGN 419
Cdd:pfam00008    1 CAPNP--CSNGGTCVDTPGGYTCICPEGYTGK 30
EGF_CA smart00179
Calcium-binding EGF-like domain;
757-799 6.99e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 35.69  E-value: 6.99e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|...
gi 2009632981   757 INHCETGlHDCDipQRAQCkFMGHGSYTCICLPGFSgDGRACQ 799
Cdd:smart00179    2 IDECASG-NPCQ--NGGTC-VNTVGSYRCECPPGYT-DGRNCE 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH