NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1622846374|ref|XP_028685648|]
View 

stabilin-2 isoform X1 [Macaca mulatta]

Protein Classification

fasciclin domain-containing protein( domain architecture ID 10585349)

fasciclin domain-containing protein may be involved in cell adhesion; similar to fasciclin-like arabinogalactan proteins, a subclass of arabinogalactan proteins (AGPs)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Link_domain_TSG_6_like cd03515
This is the extracellular link domain of the type found in human TSG-6. The link domain is a ...
2243-2335 1.73e-51

This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accumulate in vascular tissues during aging and diabetes, these receptors may be implicated in the pathologies of these states. Both stabilins are present in the early endocytic pathway in hepatic sinusoidal epithelium associating with clathrin/AP-2. Stabilin-1 is expressed in macrophages. Stabilin-2 is absent from the latter. In macrophages: stabilin-1 is involved in trafficking between early/sorting endosomes and the trans-Golgi network. Stabilin-1 has also been implicated in angiogenesis and possibly leucocyte trafficking. Both stabilins bind gram-positive and gram-negative bacteria. TSG-6 and stabilins contain a single link module which supports high affinity binding to HA.


:

Pssm-ID: 239592  Cd Length: 93  Bit Score: 176.50  E-value: 1.73e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2243 GVFHLRSPLGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYGP 2322
Cdd:cd03515      1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQLGFHLCAAGWLAKGRVGYPIVFPSANCGFGHVGIVDYGP 80
                           90
                   ....*....|...
gi 1622846374 2323 RPNKSEMWDVFCY 2335
Cdd:cd03515     81 RLNLSERWDAYCY 93
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1662-1771 2.03e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.88  E-value: 2.03e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1662 QEHFVKDLVGP-GPFTVFAPLSAAFDE--EARVKDW-DKQGLMPQVLRYHVVAChQLLLENLILISNATSLQGEPIVISV 1737
Cdd:pfam02469   12 AAGLVDTLNGSqGPFTVFAPTNEAFAKlpAGTLNFLlKDKEQLKNLLKYHVVPG-RLTSSDLKNGGTLATLQGSKLRVNV 90
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1622846374 1738 SQGTVYINNkAKIISSDIISTNGIVHIIDKLLSP 1771
Cdd:pfam02469   91 TGGSVTVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1797-1927 3.17e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.49  E-value: 3.17e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1797 GYIKFSNLIQDSGLLSVITDPiHTPVTLFWPTDRALQALPAEQQDFLFNqdNKDKLKEYLKFHVIRDaKVLAVDLPTSTA 1876
Cdd:pfam02469    2 GFSTFVALLKAAGLVDTLNGS-QGPFTVFAPTNEAFAKLPAGTLNFLLK--DKEQLKNLLKYHVVPG-RLTSSDLKNGGT 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1622846374 1877 WKTLQGSELSVKCGAGSdigdLFLNGqtCRIVQRELLFDLGVAYGIDCLLI 1927
Cdd:pfam02469   78 LATLQGSKLRVNVTGGS----VTVNG--ARVVQADIEATNGVIHVIDKVLL 122
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1054-1174 8.79e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 107.34  E-value: 8.79e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1054 IFNQWINNASLQLTLSAT-SNLTVLVPSQQATKDMDQDEKSFWLSQSN-IPALIKYHILLGTYRVADLQTLSSsdmlATS 1131
Cdd:pfam02469    5 TFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKDKEqLKNLLKYHVVPGRLTSSDLKNGGT----LAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1622846374 1132 LQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1174
Cdd:pfam02469   81 LQGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1193-1310 1.05e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 98.48  E-value: 1.05e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1193 PDYSIFRGYIIQYNLANAIEAADA-YTVFAPNNDAIANYIRE-----KKVPSLKEDVLRYHVVlEEKLLKNDLHNGMHRK 1266
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGtlnflLKDKEQLKNLLKYHVV-PGRLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1622846374 1267 TMLGFSyfLGFFRHNNQLYVNEAPINYTNVATDKGVIHGLGKVL 1310
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
525-653 5.89e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.17  E-value: 5.89e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  525 PRYSKFRSLLEKTNVGHALDEDGvgGPYTIFVPSNEALNNMKDGTLDYLLSPegSRKLLELVRYHIVPfTQLEVATLIST 604
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQ--GPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1622846374  605 PHIRSMANQLIQFTTTdNGQILANDVAMEEIEITAKNGRIYTLTGVLIP 653
Cdd:pfam02469   76 GTLATLQGSKLRVNVT-GGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
381-502 7.80e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 95.78  E-value: 7.80e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  381 GRLTSFISLLDKA-YAWPL-SKLGPFTVLLPTDEG---LKEFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDTFYT 455
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLnGSQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1622846374  456 LTGKSGEIFNSDkdNQIKLklyggkNKVKIIQGDIIASNGLLHILDR 502
Cdd:pfam02469   81 LQGSKLRVNVTG--GSVTV------NGARVVQADIEATNGVIHVIDK 119
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
2401-2494 4.42e-15

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


:

Pssm-ID: 214719  Cd Length: 97  Bit Score: 72.78  E-value: 4.42e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  2401 TLFVPQNSGLgeNETLSGRD----------IEHHLANvSMFFYNDLVNGTTLQTRLGSKLLITASHDPlqptETRFVDGR 2470
Cdd:smart00554    1 TVFAPTDEAF--QKLPPDLNslladklknlLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGGS----GTVTVNGA 73
                            90       100
                    ....*....|....*....|....
gi 1622846374  2471 AILQWDIFASNGIIHIISRPLKAP 2494
Cdd:smart00554   74 RIVEADIAATNGVVHVIDRVLLPP 97
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1520-1556 2.29e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.13  E-value: 2.29e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1520 CEISNGGCSAKADCKRTtPGRRVCTCKAGYMGDGIVC 1556
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1604-1640 2.53e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.75  E-value: 2.53e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1604 CLTKNGGCSEFAICNHTGqGERTCTCKPNYVGNGFTC 1640
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2172-2209 3.89e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.36  E-value: 3.89e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1622846374 2172 CADGlNGGCHEHATCKMTgPGKHKCECKSHYVGDGLNC 2209
Cdd:pfam12947    1 CSDN-NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
836-864 6.48e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 44.90  E-value: 6.48e-06
                           10        20
                   ....*....|....*....|....*....
gi 1622846374  836 CHIHATCEYSSGTASCVCKAGYEGDGTVC 864
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1562-1598 2.17e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 43.36  E-value: 2.17e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1562 CLENHGGCDKNAECTQTgPNQAACNCLPAYTGDGKVC 1598
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2131-2166 8.23e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.82  E-value: 8.23e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1622846374 2131 CKQDNGGCAKVARCSQKGTKVSCSCQKGYKGDGLSC 2166
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
325-360 2.16e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 40.66  E-value: 2.16e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374  325 CKTNN-LCHRNANCTTVgPGQTQCMCRKGYVGDGLTC 360
Cdd:pfam12947    1 CSDNNgGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
245-274 4.78e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.50  E-value: 4.78e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 1622846374  245 CHPRAHCTYLgPNRHSCTCQEGYRGDGRVC 274
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
875-907 8.79e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 8.79e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1622846374  875 PGGCSRNAECIKTGtGTHTCVCQQGWTGNGRDC 907
Cdd:pfam12947    5 NGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1002-1038 1.28e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.35  E-value: 1.28e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1002 CLEQTRKCHPLANCQSTSSGVwSCVCQEGYEGDGFLC 1038
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1482-1514 6.48e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.42  E-value: 6.48e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1622846374 1482 NGMCHTSANCLpNSDGTASCKCAAGFQGNGTIC 1514
Cdd:pfam12947    5 NGGCHPNATCT-NTGGSFTCTCNDGYTGDGVTC 36
 
Name Accession Description Interval E-value
Link_domain_TSG_6_like cd03515
This is the extracellular link domain of the type found in human TSG-6. The link domain is a ...
2243-2335 1.73e-51

This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accumulate in vascular tissues during aging and diabetes, these receptors may be implicated in the pathologies of these states. Both stabilins are present in the early endocytic pathway in hepatic sinusoidal epithelium associating with clathrin/AP-2. Stabilin-1 is expressed in macrophages. Stabilin-2 is absent from the latter. In macrophages: stabilin-1 is involved in trafficking between early/sorting endosomes and the trans-Golgi network. Stabilin-1 has also been implicated in angiogenesis and possibly leucocyte trafficking. Both stabilins bind gram-positive and gram-negative bacteria. TSG-6 and stabilins contain a single link module which supports high affinity binding to HA.


Pssm-ID: 239592  Cd Length: 93  Bit Score: 176.50  E-value: 1.73e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2243 GVFHLRSPLGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYGP 2322
Cdd:cd03515      1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQLGFHLCAAGWLAKGRVGYPIVFPSANCGFGHVGIVDYGP 80
                           90
                   ....*....|...
gi 1622846374 2323 RPNKSEMWDVFCY 2335
Cdd:cd03515     81 RLNLSERWDAYCY 93
Xlink pfam00193
Extracellular link domain;
2243-2335 1.43e-36

Extracellular link domain;


Pssm-ID: 459706  Cd Length: 92  Bit Score: 133.85  E-value: 1.43e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2243 GVFHLRSPlGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYGP 2322
Cdd:pfam00193    1 GVFHLESP-GRYKLTFQEAQAACAALGATLATPEQLYAAWKAGLDTCDAGWLADGTVRYPITTPRPNCGGNMPGVRQYGF 79
                           90
                   ....*....|...
gi 1622846374 2323 RPNKSEMWDVFCY 2335
Cdd:pfam00193   80 RDPLSERYDAYCY 92
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1662-1771 2.03e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.88  E-value: 2.03e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1662 QEHFVKDLVGP-GPFTVFAPLSAAFDE--EARVKDW-DKQGLMPQVLRYHVVAChQLLLENLILISNATSLQGEPIVISV 1737
Cdd:pfam02469   12 AAGLVDTLNGSqGPFTVFAPTNEAFAKlpAGTLNFLlKDKEQLKNLLKYHVVPG-RLTSSDLKNGGTLATLQGSKLRVNV 90
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1622846374 1738 SQGTVYINNkAKIISSDIISTNGIVHIIDKLLSP 1771
Cdd:pfam02469   91 TGGSVTVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1797-1927 3.17e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.49  E-value: 3.17e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1797 GYIKFSNLIQDSGLLSVITDPiHTPVTLFWPTDRALQALPAEQQDFLFNqdNKDKLKEYLKFHVIRDaKVLAVDLPTSTA 1876
Cdd:pfam02469    2 GFSTFVALLKAAGLVDTLNGS-QGPFTVFAPTNEAFAKLPAGTLNFLLK--DKEQLKNLLKYHVVPG-RLTSSDLKNGGT 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1622846374 1877 WKTLQGSELSVKCGAGSdigdLFLNGqtCRIVQRELLFDLGVAYGIDCLLI 1927
Cdd:pfam02469   78 LATLQGSKLRVNVTGGS----VTVNG--ARVVQADIEATNGVIHVIDKVLL 122
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1054-1174 8.79e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 107.34  E-value: 8.79e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1054 IFNQWINNASLQLTLSAT-SNLTVLVPSQQATKDMDQDEKSFWLSQSN-IPALIKYHILLGTYRVADLQTLSSsdmlATS 1131
Cdd:pfam02469    5 TFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKDKEqLKNLLKYHVVPGRLTSSDLKNGGT----LAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1622846374 1132 LQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1174
Cdd:pfam02469   81 LQGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
LINK smart00445
Link (Hyaluronan-binding);
2243-2336 2.25e-26

Link (Hyaluronan-binding);


Pssm-ID: 214667  Cd Length: 94  Bit Score: 105.12  E-value: 2.25e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  2243 GVFHLRsPLGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYGp 2322
Cdd:smart00445    3 GVFHVE-KNGRYKLTFAEAREACRAQGATLATVGQLYAAWQDGFDTCDAGWLADGSVRYPIITPRPRCGGNLPGVRQYG- 80
                            90
                    ....*....|....
gi 1622846374  2323 RPNKSEMWDVFCYR 2336
Cdd:smart00445   81 FPDPTSRYDAYCFN 94
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1666-1771 3.42e-24

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 101.14  E-value: 3.42e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1666 VKDLVGPGPFTVFAPLSAAFDE--EARVKDW----DKQGLMpQVLRYHVVAcHQLLLENLILISNATSLQGEPIVISVSQ 1739
Cdd:COG2335     56 VDTLSGEGPFTVFAPTDAAFAAlpAGTLDALlkpeNKATLT-KILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG 133
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1622846374 1740 GTVYINNkAKIISSDIISTNGIVHIIDKLLSP 1771
Cdd:COG2335    134 GGVTVNG-ANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1193-1310 1.05e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 98.48  E-value: 1.05e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1193 PDYSIFRGYIIQYNLANAIEAADA-YTVFAPNNDAIANYIRE-----KKVPSLKEDVLRYHVVlEEKLLKNDLHNGMHRK 1266
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGtlnflLKDKEQLKNLLKYHVV-PGRLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1622846374 1267 TMLGFSyfLGFFRHNNQLYVNEAPINYTNVATDKGVIHGLGKVL 1310
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
525-653 5.89e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.17  E-value: 5.89e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  525 PRYSKFRSLLEKTNVGHALDEDGvgGPYTIFVPSNEALNNMKDGTLDYLLSPegSRKLLELVRYHIVPfTQLEVATLIST 604
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQ--GPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1622846374  605 PHIRSMANQLIQFTTTdNGQILANDVAMEEIEITAKNGRIYTLTGVLIP 653
Cdd:pfam02469   76 GTLATLQGSKLRVNVT-GGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
381-502 7.80e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 95.78  E-value: 7.80e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  381 GRLTSFISLLDKA-YAWPL-SKLGPFTVLLPTDEG---LKEFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDTFYT 455
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLnGSQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1622846374  456 LTGKSGEIFNSDkdNQIKLklyggkNKVKIIQGDIIASNGLLHILDR 502
Cdd:pfam02469   81 LQGSKLRVNVTG--GSVTV------NGARVVQADIEATNGVIHVIDK 119
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1676-1772 2.09e-22

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 93.58  E-value: 2.09e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  1676 TVFAPLSAAFDE-EARVKDWDKQgLMPQVLRYHVVAcHQLLLENLILISNATSLQGEPIVISVSQ--GTVYINNkAKIIS 1752
Cdd:smart00554    1 TVFAPTDEAFQKlPPDLNSLLAD-KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgsGTVTVNG-ARIVE 77
                            90       100
                    ....*....|....*....|
gi 1622846374  1753 SDIISTNGIVHIIDKLLSPK 1772
Cdd:smart00554   78 ADIAATNGVVHVIDRVLLPP 97
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1045-1174 1.81e-20

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 90.74  E-value: 1.81e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1045 ELSFLSEAaifnqwINNASLQLTLSATSNLTVLVPSQQATKDMDQDEKSFWLSQSNIPAL---IKYHILLGTYRVADLQT 1121
Cdd:COG2335     42 DFSTLVAA------LKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLtkiLTYHVVPGKVTAADLKD 115
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1622846374 1122 LSSsdmlATSLQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1174
Cdd:COG2335    116 GKT----LTTLQGQTLTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
511-653 1.98e-20

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 90.35  E-value: 1.98e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  511 FESNNEQTIMTMLQ--PRYSKFRSLLEKTNVGHALDEDGvggPYTIFVPSNEALNNMKDGTLDYLLSPEGSRKLLELVRY 588
Cdd:COG2335     25 AAMAPTKNIVETAAnnPDFSTLVAALKAAGLVDTLSGEG---PFTVFAPTDAAFAALPAGTLDALLKPENKATLTKILTY 101
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622846374  589 HIVPfTQLEVATLISTPHIRSMANQLIQFTTTDnGQILANDVAMEEIEITAKNGRIYTLTGVLIP 653
Cdd:COG2335    102 HVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG-GGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1823-1929 2.09e-18

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 82.41  E-value: 2.09e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  1823 TLFWPTDRALQALPAEQQDFLfnqdnKDKLKEYLKFHVIRDaKVLAVDLPTSTAWKTLQGSELSVKCGAGSdiGDLFLNG 1902
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLL-----ADKLKNLLLYHVVPG-RLSSADLLNGGTLPTLAGSKLRITRSGGS--GTVTVNG 72
                            90       100
                    ....*....|....*....|....*..
gi 1622846374  1903 QtcRIVQRELLFDLGVAYGIDCLLIDP 1929
Cdd:smart00554   73 A--RIVEADIAATNGVVHVIDRVLLPP 97
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1787-1927 5.30e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 77.64  E-value: 5.30e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1787 QNLTTLATKNGyiKFSNL---IQDSGLLSVITDPihTPVTLFWPTDRALQALPAEQQDFLFNQDNKDKLKEYLKFHVIrD 1863
Cdd:COG2335     31 KNIVETAANNP--DFSTLvaaLKAAGLVDTLSGE--GPFTVFAPTDAAFAALPAGTLDALLKPENKATLTKILTYHVV-P 105
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622846374 1864 AKVLAVDLPTSTAWKTLQGSELSVKcgaGSDiGDLFLNGQTcrIVQRELLFDLGVAYGIDCLLI 1927
Cdd:COG2335    106 GKVTAADLKDGKTLTTLQGQTLTVT---VSG-GGVTVNGAN--VITADIEASNGVIHVIDKVLL 163
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1075-1174 5.78e-16

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 75.48  E-value: 5.78e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  1075 TVLVPSQQATKDMDQDEKSFWLSQsnIPALIKYHILLGTYRVADLQtlssSDMLATSLQGN--FLHLAKVDGNITIEGAS 1152
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADK--LKNLLLYHVVPGRLSSADLL----NGGTLPTLAGSklRITRSGGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|..
gi 1622846374  1153 IVDGDNAATNGVIHIINKVLVP 1174
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1183-1310 7.94e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 77.25  E-value: 7.94e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1183 PNLLMRLEQMPDYSIFRGYIIQYNLANAIEAADAYTVFAPNNDAIANYIREKKVPSLKE-------DVLRYHVVlEEKLL 1255
Cdd:COG2335     31 KNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPenkatltKILTYHVV-PGKVT 109
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1622846374 1256 KNDLHNGMHRKTMLGFSyfLGFFRHNNQLYVNEAPINYTNVATDKGVIHGLGKVL 1310
Cdd:COG2335    110 AADLKDGKTLTTLQGQT--LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVL 162
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
2401-2494 4.42e-15

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 72.78  E-value: 4.42e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  2401 TLFVPQNSGLgeNETLSGRD----------IEHHLANvSMFFYNDLVNGTTLQTRLGSKLLITASHDPlqptETRFVDGR 2470
Cdd:smart00554    1 TVFAPTDEAF--QKLPPDLNslladklknlLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGGS----GTVTVNGA 73
                            90       100
                    ....*....|....*....|....
gi 1622846374  2471 AILQWDIFASNGIIHIISRPLKAP 2494
Cdd:smart00554   74 RIVEADIAATNGVVHVIDRVLLPP 97
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
381-502 1.53e-14

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 73.40  E-value: 1.53e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  381 GRLTSFISLLDKA-YAWPLSKLGPFTVLLPTDEG---LKEFNVNELLVD-NKAA-QYFVKLHIIAGQMNIEYMNNTDTFY 454
Cdd:COG2335     41 PDFSTLVAALKAAgLVDTLSGEGPFTVFAPTDAAfaaLPAGTLDALLKPeNKATlTKILTYHVVPGKVTAADLKDGKTLT 120
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1622846374  455 TLTGKSGEIfnSDKDNQIKLklyggkNKVKIIQGDIIASNGLLHILDR 502
Cdd:COG2335    121 TLQGQTLTV--TVSGGGVTV------NGANVITADIEASNGVIHVIDK 160
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
405-502 1.74e-12

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 65.46  E-value: 1.74e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374   405 TVLLPTDEGLKEFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDTFYTLTGKSGEIFNSDKDNQIKLklyggkNKVK 484
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTV------NGAR 74
                            90
                    ....*....|....*...
gi 1622846374   485 IIQGDIIASNGLLHILDR 502
Cdd:smart00554   75 IVEADIAATNGVVHVIDR 92
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1218-1310 2.31e-12

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 65.08  E-value: 2.31e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  1218 TVFAPNNDAIANYIREKK--VPSLKEDVLRYHVVlEEKLLKNDLHNGMHRKTMLGFSYFLGFFRHNNQLYVNEAPINYTN 1295
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNslLADKLKNLLLYHVV-PGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTVNGARIVEAD 79
                            90
                    ....*....|....*
gi 1622846374  1296 VATDKGVIHGLGKVL 1310
Cdd:smart00554   80 IAATNGVVHVIDRVL 94
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
553-654 8.96e-12

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 63.54  E-value: 8.96e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374   553 TIFVPSNEALNNMKDGtLDYLLSPegsrKLLELVRYHIVPfTQLEVATLISTPHIRSMANQLIQFTTT-DNGQILANDVA 631
Cdd:smart00554    1 TVFAPTDEAFQKLPPD-LNSLLAD----KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSgGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 1622846374   632 MEEIEITAKNGRIYTLTGVLIPP 654
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1520-1556 2.29e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.13  E-value: 2.29e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1520 CEISNGGCSAKADCKRTtPGRRVCTCKAGYMGDGIVC 1556
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1604-1640 2.53e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.75  E-value: 2.53e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1604 CLTKNGGCSEFAICNHTGqGERTCTCKPNYVGNGFTC 1640
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2172-2209 3.89e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.36  E-value: 3.89e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1622846374 2172 CADGlNGGCHEHATCKMTgPGKHKCECKSHYVGDGLNC 2209
Cdd:pfam12947    1 CSDN-NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
2401-2491 4.21e-07

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 51.10  E-value: 4.21e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2401 TLFVPQNSGLGE------NETLSGRD-----IEHHLANvSMFFYNDLVNGTTLQTRLGSKLLITASHDPLqptetrFVDG 2469
Cdd:pfam02469   27 TVFAPTNEAFAKlpagtlNFLLKDKEqlknlLKYHVVP-GRLTSSDLKNGGTLATLQGSKLRVNVTGGSV------TVNG 99
                           90       100
                   ....*....|....*....|..
gi 1622846374 2470 RAILQWDIFASNGIIHIISRPL 2491
Cdd:pfam02469  100 ARVVQADIEATNGVIHVIDKVL 121
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
836-864 6.48e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 44.90  E-value: 6.48e-06
                           10        20
                   ....*....|....*....|....*....
gi 1622846374  836 CHIHATCEYSSGTASCVCKAGYEGDGTVC 864
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
2340-2491 8.14e-06

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 48.36  E-value: 8.14e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2340 VNCSCKVGYVGDGFSCSGNLLQVLMSFPSLTNfLTEVLAYSNssargraflehLTD-LSIRG--TLFVPQNS-----GLG 2411
Cdd:COG2335     14 AACASSAAAEGAAMAPTKNIVETAANNPDFST-LVAALKAAG-----------LVDtLSGEGpfTVFAPTDAafaalPAG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2412 ENETLSG-------RDI-EHHLANvSMFFYNDLVNGTTLQTRLGSKLLITASHDPLQptetrfVDGRAILQWDIFASNGI 2483
Cdd:COG2335     82 TLDALLKpenkatlTKIlTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSGGGVT------VNGANVITADIEASNGV 154

                   ....*...
gi 1622846374 2484 IHIISRPL 2491
Cdd:COG2335    155 IHVIDKVL 162
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1562-1598 2.17e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 43.36  E-value: 2.17e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1562 CLENHGGCDKNAECTQTgPNQAACNCLPAYTGDGKVC 1598
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2131-2166 8.23e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.82  E-value: 8.23e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1622846374 2131 CKQDNGGCAKVARCSQKGTKVSCSCQKGYKGDGLSC 2166
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
325-360 2.16e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 40.66  E-value: 2.16e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374  325 CKTNN-LCHRNANCTTVgPGQTQCMCRKGYVGDGLTC 360
Cdd:pfam12947    1 CSDNNgGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
245-274 4.78e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.50  E-value: 4.78e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 1622846374  245 CHPRAHCTYLgPNRHSCTCQEGYRGDGRVC 274
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
875-907 8.79e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 8.79e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1622846374  875 PGGCSRNAECIKTGtGTHTCVCQQGWTGNGRDC 907
Cdd:pfam12947    5 NGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1002-1038 1.28e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.35  E-value: 1.28e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1002 CLEQTRKCHPLANCQSTSSGVwSCVCQEGYEGDGFLC 1038
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1482-1514 6.48e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.42  E-value: 6.48e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1622846374 1482 NGMCHTSANCLpNSDGTASCKCAAGFQGNGTIC 1514
Cdd:pfam12947    5 NGGCHPNATCT-NTGGSFTCTCNDGYTGDGVTC 36
 
Name Accession Description Interval E-value
Link_domain_TSG_6_like cd03515
This is the extracellular link domain of the type found in human TSG-6. The link domain is a ...
2243-2335 1.73e-51

This is the extracellular link domain of the type found in human TSG-6. The link domain is a hyaluronan (HA)-binding domain. TSG-6 is the protein product of tumor necrosis factor-stimulated gene-6. TSG-6 is up-regulated in inflammatory lesions and in the ovary during ovulation. It has a strong anti-inflammatory and chondroprotective effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. Also included in this group are the stabilins: stabilin-1 (FEEL-1, CLEVER-1) and stabilin-2 (FEEL-2). Stabilin-2 functions as the major liver and lymph node-scavenging receptor for HA and related glycosaminoglycans. Stabilin-2 is a scavenger receptor with a broad range of ligands including advanced glycation end (AGE) products, acetylated low density lipoprotein and procollagen peptides. In contrast, stabilin-1 does not bind HA, but binds acetylated low density lipoprotein and AGEs with lower affinity. As AGEs accumulate in vascular tissues during aging and diabetes, these receptors may be implicated in the pathologies of these states. Both stabilins are present in the early endocytic pathway in hepatic sinusoidal epithelium associating with clathrin/AP-2. Stabilin-1 is expressed in macrophages. Stabilin-2 is absent from the latter. In macrophages: stabilin-1 is involved in trafficking between early/sorting endosomes and the trans-Golgi network. Stabilin-1 has also been implicated in angiogenesis and possibly leucocyte trafficking. Both stabilins bind gram-positive and gram-negative bacteria. TSG-6 and stabilins contain a single link module which supports high affinity binding to HA.


Pssm-ID: 239592  Cd Length: 93  Bit Score: 176.50  E-value: 1.73e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2243 GVFHLRSPLGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYGP 2322
Cdd:cd03515      1 GVFHLRSRSGKYKLTYTEAKAACEAEGAHLATYSQLSAAQQLGFHLCAAGWLAKGRVGYPIVFPSANCGFGHVGIVDYGP 80
                           90
                   ....*....|...
gi 1622846374 2323 RPNKSEMWDVFCY 2335
Cdd:cd03515     81 RLNLSERWDAYCY 93
Xlink pfam00193
Extracellular link domain;
2243-2335 1.43e-36

Extracellular link domain;


Pssm-ID: 459706  Cd Length: 92  Bit Score: 133.85  E-value: 1.43e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2243 GVFHLRSPlGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYGP 2322
Cdd:pfam00193    1 GVFHLESP-GRYKLTFQEAQAACAALGATLATPEQLYAAWKAGLDTCDAGWLADGTVRYPITTPRPNCGGNMPGVRQYGF 79
                           90
                   ....*....|...
gi 1622846374 2323 RPNKSEMWDVFCY 2335
Cdd:pfam00193   80 RDPLSERYDAYCY 92
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1662-1771 2.03e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.88  E-value: 2.03e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1662 QEHFVKDLVGP-GPFTVFAPLSAAFDE--EARVKDW-DKQGLMPQVLRYHVVAChQLLLENLILISNATSLQGEPIVISV 1737
Cdd:pfam02469   12 AAGLVDTLNGSqGPFTVFAPTNEAFAKlpAGTLNFLlKDKEQLKNLLKYHVVPG-RLTSSDLKNGGTLATLQGSKLRVNV 90
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1622846374 1738 SQGTVYINNkAKIISSDIISTNGIVHIIDKLLSP 1771
Cdd:pfam02469   91 TGGSVTVNG-ARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1797-1927 3.17e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 108.49  E-value: 3.17e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1797 GYIKFSNLIQDSGLLSVITDPiHTPVTLFWPTDRALQALPAEQQDFLFNqdNKDKLKEYLKFHVIRDaKVLAVDLPTSTA 1876
Cdd:pfam02469    2 GFSTFVALLKAAGLVDTLNGS-QGPFTVFAPTNEAFAKLPAGTLNFLLK--DKEQLKNLLKYHVVPG-RLTSSDLKNGGT 77
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1622846374 1877 WKTLQGSELSVKCGAGSdigdLFLNGqtCRIVQRELLFDLGVAYGIDCLLI 1927
Cdd:pfam02469   78 LATLQGSKLRVNVTGGS----VTVNG--ARVVQADIEATNGVIHVIDKVLL 122
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1054-1174 8.79e-27

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 107.34  E-value: 8.79e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1054 IFNQWINNASLQLTLSAT-SNLTVLVPSQQATKDMDQDEKSFWLSQSN-IPALIKYHILLGTYRVADLQTLSSsdmlATS 1131
Cdd:pfam02469    5 TFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLLKDKEqLKNLLKYHVVPGRLTSSDLKNGGT----LAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1622846374 1132 LQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1174
Cdd:pfam02469   81 LQGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
LINK smart00445
Link (Hyaluronan-binding);
2243-2336 2.25e-26

Link (Hyaluronan-binding);


Pssm-ID: 214667  Cd Length: 94  Bit Score: 105.12  E-value: 2.25e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  2243 GVFHLRsPLGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYGp 2322
Cdd:smart00445    3 GVFHVE-KNGRYKLTFAEAREACRAQGATLATVGQLYAAWQDGFDTCDAGWLADGSVRYPIITPRPRCGGNLPGVRQYG- 80
                            90
                    ....*....|....
gi 1622846374  2323 RPNKSEMWDVFCYR 2336
Cdd:smart00445   81 FPDPTSRYDAYCFN 94
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1666-1771 3.42e-24

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 101.14  E-value: 3.42e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1666 VKDLVGPGPFTVFAPLSAAFDE--EARVKDW----DKQGLMpQVLRYHVVAcHQLLLENLILISNATSLQGEPIVISVSQ 1739
Cdd:COG2335     56 VDTLSGEGPFTVFAPTDAAFAAlpAGTLDALlkpeNKATLT-KILTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG 133
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1622846374 1740 GTVYINNkAKIISSDIISTNGIVHIIDKLLSP 1771
Cdd:COG2335    134 GGVTVNG-ANVITADIEASNGVIHVIDKVLLP 164
Link_Domain cd01102
The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive ...
2243-2335 5.81e-24

The link domain is a hyaluronan (HA)-binding domain. It functions to mediate adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It is found in the CD44 receptor and in human TSG-6. TSG-6 is the protein product of the tumor necrosis factor-stimulated gene-6. TSG-6 has a strong anti-inflammatory effect in models of acute inflammation and autoimmune arthritis and plays an essential role in female fertility. This group also contains the link domains of the chondroitin sulfate proteoglycan core proteins (CSPG) including aggrecan, versican, neurocan, and brevican and the link domains of the vertebrate HAPLN (HA and proteoglycan binding link) protein family. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates in which other CSPGs substitute for aggregan might contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN gene family are physically linked adjacent to CSPG genes. TSG-6 contains a single link module which supports high affinity binding with HA. The functional HA-binding domain of CD44 is an extended domain comprised of a link module flanked with N-and C- extensions. These extensions are essential for folding and functional activity. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of the CSPG aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) which contains link modules 3 and 4 which lack HA-binding activity. HAPLNs contain two contiguous link modules.


Pssm-ID: 238534  Cd Length: 92  Bit Score: 97.87  E-value: 5.81e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2243 GVFHLRSPLGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYGP 2322
Cdd:cd01102      1 VVFHLESQNGRYKLTFAEAALACKARGAHLATPGQLEAAWQDGFDVCTAGWLADGSVRYPIVTSRPNCGGRNPGVRSYGN 80
                           90
                   ....*....|...
gi 1622846374 2323 rPNKSEMWDVFCY 2335
Cdd:cd01102     81 -PAPSGRYDAYCF 92
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1193-1310 1.05e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 98.48  E-value: 1.05e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1193 PDYSIFRGYIIQYNLANAIEAADA-YTVFAPNNDAIANYIRE-----KKVPSLKEDVLRYHVVlEEKLLKNDLHNGMHRK 1266
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQGpFTVFAPTNEAFAKLPAGtlnflLKDKEQLKNLLKYHVV-PGRLTSSDLKNGGTLA 79
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1622846374 1267 TMLGFSyfLGFFRHNNQLYVNEAPINYTNVATDKGVIHGLGKVL 1310
Cdd:pfam02469   80 TLQGSK--LRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVL 121
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
525-653 5.89e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 96.17  E-value: 5.89e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  525 PRYSKFRSLLEKTNVGHALDEDGvgGPYTIFVPSNEALNNMKDGTLDYLLSPegSRKLLELVRYHIVPfTQLEVATLIST 604
Cdd:pfam02469    1 PGFSTFVALLKAAGLVDTLNGSQ--GPFTVFAPTNEAFAKLPAGTLNFLLKD--KEQLKNLLKYHVVP-GRLTSSDLKNG 75
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 1622846374  605 PHIRSMANQLIQFTTTdNGQILANDVAMEEIEITAKNGRIYTLTGVLIP 653
Cdd:pfam02469   76 GTLATLQGSKLRVNVT-GGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
381-502 7.80e-23

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 95.78  E-value: 7.80e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  381 GRLTSFISLLDKA-YAWPL-SKLGPFTVLLPTDEG---LKEFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDTFYT 455
Cdd:pfam02469    1 PGFSTFVALLKAAgLVDTLnGSQGPFTVFAPTNEAfakLPAGTLNFLLKDKEQLKNLLKYHVVPGRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1622846374  456 LTGKSGEIFNSDkdNQIKLklyggkNKVKIIQGDIIASNGLLHILDR 502
Cdd:pfam02469   81 LQGSKLRVNVTG--GSVTV------NGARVVQADIEATNGVIHVIDK 119
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1676-1772 2.09e-22

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 93.58  E-value: 2.09e-22
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  1676 TVFAPLSAAFDE-EARVKDWDKQgLMPQVLRYHVVAcHQLLLENLILISNATSLQGEPIVISVSQ--GTVYINNkAKIIS 1752
Cdd:smart00554    1 TVFAPTDEAFQKlPPDLNSLLAD-KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGgsGTVTVNG-ARIVE 77
                            90       100
                    ....*....|....*....|
gi 1622846374  1753 SDIISTNGIVHIIDKLLSPK 1772
Cdd:smart00554   78 ADIAATNGVVHVIDRVLLPP 97
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1045-1174 1.81e-20

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 90.74  E-value: 1.81e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1045 ELSFLSEAaifnqwINNASLQLTLSATSNLTVLVPSQQATKDMDQDEKSFWLSQSNIPAL---IKYHILLGTYRVADLQT 1121
Cdd:COG2335     42 DFSTLVAA------LKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLtkiLTYHVVPGKVTAADLKD 115
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1622846374 1122 LSSsdmlATSLQGNFLHLAKVDGNITIEGASIVDGDNAATNGVIHIINKVLVP 1174
Cdd:COG2335    116 GKT----LTTLQGQTLTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
511-653 1.98e-20

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 90.35  E-value: 1.98e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  511 FESNNEQTIMTMLQ--PRYSKFRSLLEKTNVGHALDEDGvggPYTIFVPSNEALNNMKDGTLDYLLSPEGSRKLLELVRY 588
Cdd:COG2335     25 AAMAPTKNIVETAAnnPDFSTLVAALKAAGLVDTLSGEG---PFTVFAPTDAAFAALPAGTLDALLKPENKATLTKILTY 101
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622846374  589 HIVPfTQLEVATLISTPHIRSMANQLIQFTTTDnGQILANDVAMEEIEITAKNGRIYTLTGVLIP 653
Cdd:COG2335    102 HVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSG-GGVTVNGANVITADIEASNGVIHVIDKVLLP 164
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1823-1929 2.09e-18

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 82.41  E-value: 2.09e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  1823 TLFWPTDRALQALPAEQQDFLfnqdnKDKLKEYLKFHVIRDaKVLAVDLPTSTAWKTLQGSELSVKCGAGSdiGDLFLNG 1902
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLL-----ADKLKNLLLYHVVPG-RLSSADLLNGGTLPTLAGSKLRITRSGGS--GTVTVNG 72
                            90       100
                    ....*....|....*....|....*..
gi 1622846374  1903 QtcRIVQRELLFDLGVAYGIDCLLIDP 1929
Cdd:smart00554   73 A--RIVEADIAATNGVVHVIDRVLLPP 97
Link_domain_CSPGs_modules_1_3 cd03517
Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third ...
2244-2335 3.33e-17

Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.


Pssm-ID: 239594  Cd Length: 95  Bit Score: 78.99  E-value: 3.33e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2244 VFHLRSPLGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNC---GSGVVGIVDY 2320
Cdd:cd03517      2 VFHYRDATARYALTFPRAQRACLDISAQIATPEQLLAAYEDGFEQCDAGWLADQTVRYPIQTPREGCygdMDGFPGVRNY 81
                           90
                   ....*....|....*
gi 1622846374 2321 GPRpNKSEMWDVFCY 2335
Cdd:cd03517     82 GVR-DPDELYDVYCY 95
Link_domain_HAPLN_module_1 cd03518
Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins ...
2244-2335 4.36e-17

Link_domain_HAPLN_module_1; this link domain is found in the first link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.


Pssm-ID: 239595  Cd Length: 95  Bit Score: 78.62  E-value: 4.36e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2244 VFHLRSPLGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCG--SGVVGIVDYG 2321
Cdd:cd03518      2 VFPYQPRLGRYNLNFHEAQQACEEQDATLASFEQLYQAWTEGLDWCNAGWLSDGTVQYPITKPREPCGgkRTVPGLRSYG 81
                           90
                   ....*....|....
gi 1622846374 2322 PRPNKSEMWDVFCY 2335
Cdd:cd03518     82 ERDKMLSRYDAFCF 95
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1787-1927 5.30e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 77.64  E-value: 5.30e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1787 QNLTTLATKNGyiKFSNL---IQDSGLLSVITDPihTPVTLFWPTDRALQALPAEQQDFLFNQDNKDKLKEYLKFHVIrD 1863
Cdd:COG2335     31 KNIVETAANNP--DFSTLvaaLKAAGLVDTLSGE--GPFTVFAPTDAAFAALPAGTLDALLKPENKATLTKILTYHVV-P 105
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622846374 1864 AKVLAVDLPTSTAWKTLQGSELSVKcgaGSDiGDLFLNGQTcrIVQRELLFDLGVAYGIDCLLI 1927
Cdd:COG2335    106 GKVTAADLKDGKTLTTLQGQTLTVT---VSG-GGVTVNGAN--VITADIEASNGVIHVIDKVLL 163
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1075-1174 5.78e-16

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 75.48  E-value: 5.78e-16
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  1075 TVLVPSQQATKDMDQDEKSFWLSQsnIPALIKYHILLGTYRVADLQtlssSDMLATSLQGN--FLHLAKVDGNITIEGAS 1152
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADK--LKNLLLYHVVPGRLSSADLL----NGGTLPTLAGSklRITRSGGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|..
gi 1622846374  1153 IVDGDNAATNGVIHIINKVLVP 1174
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLP 96
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
1183-1310 7.94e-16

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 77.25  E-value: 7.94e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 1183 PNLLMRLEQMPDYSIFRGYIIQYNLANAIEAADAYTVFAPNNDAIANYIREKKVPSLKE-------DVLRYHVVlEEKLL 1255
Cdd:COG2335     31 KNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPenkatltKILTYHVV-PGKVT 109
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1622846374 1256 KNDLHNGMHRKTMLGFSyfLGFFRHNNQLYVNEAPINYTNVATDKGVIHGLGKVL 1310
Cdd:COG2335    110 AADLKDGKTLTTLQGQT--LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVL 162
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
2401-2494 4.42e-15

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 72.78  E-value: 4.42e-15
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  2401 TLFVPQNSGLgeNETLSGRD----------IEHHLANvSMFFYNDLVNGTTLQTRLGSKLLITASHDPlqptETRFVDGR 2470
Cdd:smart00554    1 TVFAPTDEAF--QKLPPDLNslladklknlLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSGGS----GTVTVNGA 73
                            90       100
                    ....*....|....*....|....
gi 1622846374  2471 AILQWDIFASNGIIHIISRPLKAP 2494
Cdd:smart00554   74 RIVEADIAATNGVVHVIDRVLLPP 97
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
381-502 1.53e-14

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 73.40  E-value: 1.53e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  381 GRLTSFISLLDKA-YAWPLSKLGPFTVLLPTDEG---LKEFNVNELLVD-NKAA-QYFVKLHIIAGQMNIEYMNNTDTFY 454
Cdd:COG2335     41 PDFSTLVAALKAAgLVDTLSGEGPFTVFAPTDAAfaaLPAGTLDALLKPeNKATlTKILTYHVVPGKVTAADLKDGKTLT 120
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1622846374  455 TLTGKSGEIfnSDKDNQIKLklyggkNKVKIIQGDIIASNGLLHILDR 502
Cdd:COG2335    121 TLQGQTLTV--TVSGGGVTV------NGANVITADIEASNGVIHVIDK 160
Link_domain_CSPGs_modules_2_4 cd03520
Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules ...
2244-2335 5.05e-13

Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.


Pssm-ID: 239597  Cd Length: 96  Bit Score: 66.95  E-value: 5.05e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2244 VFHLRSPlgqYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVG---IVDY 2320
Cdd:cd03520      2 VFYATAP---EKFTFQEARAECRSLGAVLATTGQLYAAWRQGLDQCDPGWLADGSVRYPISTPRPQCGGGLPGvrtLYRF 78
                           90
                   ....*....|....*...
gi 1622846374 2321 GPR---PNKSEMWDVFCY 2335
Cdd:cd03520     79 PNQtgfPDPHSRFDAYCF 96
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
405-502 1.74e-12

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 65.46  E-value: 1.74e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374   405 TVLLPTDEGLKEFNVNELLVDNKAAQYFVKLHIIAGQMNIEYMNNTDTFYTLTGKSGEIFNSDKDNQIKLklyggkNKVK 484
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLADKLKNLLLYHVVPGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTV------NGAR 74
                            90
                    ....*....|....*...
gi 1622846374   485 IIQGDIIASNGLLHILDR 502
Cdd:smart00554   75 IVEADIAATNGVVHVIDR 92
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1218-1310 2.31e-12

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 65.08  E-value: 2.31e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374  1218 TVFAPNNDAIANYIREKK--VPSLKEDVLRYHVVlEEKLLKNDLHNGMHRKTMLGFSYFLGFFRHNNQLYVNEAPINYTN 1295
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNslLADKLKNLLLYHVV-PGRLSSADLLNGGTLPTLAGSKLRITRSGGSGTVTVNGARIVEAD 79
                            90
                    ....*....|....*
gi 1622846374  1296 VATDKGVIHGLGKVL 1310
Cdd:smart00554   80 IAATNGVVHVIDRVL 94
Link_domain_HAPLN_module_2 cd03519
Link_domain_HAPLN_module_2; this link domain is found in the second link module of proteins ...
2243-2335 8.92e-12

Link_domain_HAPLN_module_2; this link domain is found in the second link module of proteins similar to the vertebrate HAPLN (hyaluronan/HA and proteoglycan binding link) protein family which includes cartilage link protein. The link domain is a HA-binding domain. HAPLNs contain two contiguous link modules. Both link modules of cartilage link protein are involved in interaction with HA. In cartilage, a chondroitin sulfate proteoglycan core protein (CSPG) aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates with other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HAPLN gene family are physically linked adjacent to CSPG genes.


Pssm-ID: 239596  Cd Length: 91  Bit Score: 63.21  E-value: 8.92e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2243 GVFHLRSPLgqyKLTFDKAKEACANETATMATYSQLSYAQK-AKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYG 2321
Cdd:cd03519      1 GVFYLLHPG---KLTFSEAVAACQRDGAQIAKVGQLFAAWKfHGLDRCDAGWLADGSVRYPISRPRPRCGPLEPGVRSFG 77
                           90
                   ....*....|....
gi 1622846374 2322 PRPNKSEMWDVFCY 2335
Cdd:cd03519     78 FPDKKHKLYGVYCY 91
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
553-654 8.96e-12

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 63.54  E-value: 8.96e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374   553 TIFVPSNEALNNMKDGtLDYLLSPegsrKLLELVRYHIVPfTQLEVATLISTPHIRSMANQLIQFTTT-DNGQILANDVA 631
Cdd:smart00554    1 TVFAPTDEAFQKLPPD-LNSLLAD----KLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSgGSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 1622846374   632 MEEIEITAKNGRIYTLTGVLIPP 654
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
Link_domain_CD44_like cd03516
This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates ...
2243-2339 2.18e-10

This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marker for the detection and characterization of lymphatic vessels in tumors.


Pssm-ID: 239593  Cd Length: 144  Bit Score: 60.94  E-value: 2.18e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2243 GVFHLRSPlGQYKLTFDKAKEACANETATMATYSQLSYAQKAKYHLCSAGWLESGRVAYPTAFASQNCGSGVVGIVDYgp 2322
Cdd:cd03516      7 GVFLVEKN-GRYSLNFTEAKEACRALGLTLASKAQVETALKFGFETCRYGWVEDGFVVIPRIDPNPLCGKNGTGVYIL-- 83
                           90
                   ....*....|....*..
gi 1622846374 2323 RPNKSEMWDVFCYRMKD 2339
Cdd:cd03516     84 NSNLSSRYDAYCYNSSD 100
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1520-1556 2.29e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 49.13  E-value: 2.29e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1520 CEISNGGCSAKADCKRTtPGRRVCTCKAGYMGDGIVC 1556
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1604-1640 2.53e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.75  E-value: 2.53e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1604 CLTKNGGCSEFAICNHTGqGERTCTCKPNYVGNGFTC 1640
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2172-2209 3.89e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 48.36  E-value: 3.89e-07
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1622846374 2172 CADGlNGGCHEHATCKMTgPGKHKCECKSHYVGDGLNC 2209
Cdd:pfam12947    1 CSDN-NGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
2401-2491 4.21e-07

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 51.10  E-value: 4.21e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2401 TLFVPQNSGLGE------NETLSGRD-----IEHHLANvSMFFYNDLVNGTTLQTRLGSKLLITASHDPLqptetrFVDG 2469
Cdd:pfam02469   27 TVFAPTNEAFAKlpagtlNFLLKDKEqlknlLKYHVVP-GRLTSSDLKNGGTLATLQGSKLRVNVTGGSV------TVNG 99
                           90       100
                   ....*....|....*....|..
gi 1622846374 2470 RAILQWDIFASNGIIHIISRPL 2491
Cdd:pfam02469  100 ARVVQADIEATNGVIHVIDKVL 121
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
836-864 6.48e-06

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 44.90  E-value: 6.48e-06
                           10        20
                   ....*....|....*....|....*....
gi 1622846374  836 CHIHATCEYSSGTASCVCKAGYEGDGTVC 864
Cdd:pfam12947    8 CHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
2340-2491 8.14e-06

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 48.36  E-value: 8.14e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2340 VNCSCKVGYVGDGFSCSGNLLQVLMSFPSLTNfLTEVLAYSNssargraflehLTD-LSIRG--TLFVPQNS-----GLG 2411
Cdd:COG2335     14 AACASSAAAEGAAMAPTKNIVETAANNPDFST-LVAALKAAG-----------LVDtLSGEGpfTVFAPTDAafaalPAG 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622846374 2412 ENETLSG-------RDI-EHHLANvSMFFYNDLVNGTTLQTRLGSKLLITASHDPLQptetrfVDGRAILQWDIFASNGI 2483
Cdd:COG2335     82 TLDALLKpenkatlTKIlTYHVVP-GKVTAADLKDGKTLTTLQGQTLTVTVSGGGVT------VNGANVITADIEASNGV 154

                   ....*...
gi 1622846374 2484 IHIISRPL 2491
Cdd:COG2335    155 IHVIDKVL 162
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1562-1598 2.17e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 43.36  E-value: 2.17e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1562 CLENHGGCDKNAECTQTgPNQAACNCLPAYTGDGKVC 1598
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
2131-2166 8.23e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 41.82  E-value: 8.23e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1622846374 2131 CKQDNGGCAKVARCSQKGTKVSCSCQKGYKGDGLSC 2166
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
325-360 2.16e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 40.66  E-value: 2.16e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374  325 CKTNN-LCHRNANCTTVgPGQTQCMCRKGYVGDGLTC 360
Cdd:pfam12947    1 CSDNNgGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
245-274 4.78e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.50  E-value: 4.78e-04
                           10        20        30
                   ....*....|....*....|....*....|
gi 1622846374  245 CHPRAHCTYLgPNRHSCTCQEGYRGDGRVC 274
Cdd:pfam12947    8 CHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
875-907 8.79e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.73  E-value: 8.79e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1622846374  875 PGGCSRNAECIKTGtGTHTCVCQQGWTGNGRDC 907
Cdd:pfam12947    5 NGGCHPNATCTNTG-GSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1002-1038 1.28e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 38.35  E-value: 1.28e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1622846374 1002 CLEQTRKCHPLANCQSTSSGVwSCVCQEGYEGDGFLC 1038
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGGSF-TCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1482-1514 6.48e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.42  E-value: 6.48e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1622846374 1482 NGMCHTSANCLpNSDGTASCKCAAGFQGNGTIC 1514
Cdd:pfam12947    5 NGGCHPNATCT-NTGGSFTCTCNDGYTGDGVTC 36
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH