NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907100034|ref|XP_036014416|]
View 

stabilin-1 isoform X3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
499-644 3.78e-27

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


:

Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 109.23  E-value: 3.78e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034  499 PPPLPGDSKKTVGQILASTEVFTRFETILENCGLPSILDGPGPFTVFAPSNEAVDSLRDGRLIYLFTAG-LSKLQELVRY 577
Cdd:COG2335     22 AEGAAMAPTKNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPEnKATLTKILTY 101
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907100034  578 HIYNhGQLTVEKLISKGRVLTMANQVLTVNISeEGRILLGpeGIPVRRVDVPAANGVIHMLEGILLP 644
Cdd:COG2335    102 HVVP-GKVTAADLKDGKTLTTLQGQTLTVTVS-GGGVTVN--GANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1000-1121 1.44e-18

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


:

Pssm-ID: 396845  Cd Length: 123  Bit Score: 83.07  E-value: 1.44e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034 1000 AHFSAFSQWFKNSSI--TLP-ADSRVTALVPSESAIRRLSLEDQAFWLQ-PKMLPELARAHFLQGAFSEEELArlNGQQV 1075
Cdd:pfam02469    1 PGFSTFVALLKAAGLvdTLNgSQGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLK--NGGTL 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907100034 1076 ATLsATTRWQIHNISGKVWVQNATVDVPDLLATNGILHIVSQVLLP 1121
Cdd:pfam02469   79 ATL-QGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1460-1496 5.39e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 47.21  E-value: 5.39e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907100034 1460 CASGHGGCSPYANCTKVaPGQRTCTCQDGYTGDGELC 1496
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1502-1539 2.33e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.59  E-value: 2.33e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1907100034 1502 CLVHNGGCHVHAECIPTGPqQVSCSCREGYSGDGIqTC 1539
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGV-TC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
952-988 4.08e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.20  E-value: 4.08e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907100034  952 CRVGNGGCHGLATCKAVGGgQRVCTCPPHFGGDGFSC 988
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1545-1582 4.44e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.12  E-value: 4.44e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1907100034 1545 CSQNNGGCSPYAVCKSTgDGQRTCSCDATHTvGDGITC 1582
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYT-GDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
915-946 1.34e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.58  E-value: 1.34e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1907100034  915 GGCHADALCSYVgPGQSRCTCKLGFAGNGYEC 946
Cdd:pfam12947    6 GGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 super family cl48154
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
831-860 5.00e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


The actual alignment was detected with superfamily member pfam12947:

Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.04  E-value: 5.00e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1907100034  831 PCHSDAHCVIQEGVARCVCHDGFEGNGFSC 860
Cdd:pfam12947    7 GCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
 
Name Accession Description Interval E-value
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
499-644 3.78e-27

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 109.23  E-value: 3.78e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034  499 PPPLPGDSKKTVGQILASTEVFTRFETILENCGLPSILDGPGPFTVFAPSNEAVDSLRDGRLIYLFTAG-LSKLQELVRY 577
Cdd:COG2335     22 AEGAAMAPTKNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPEnKATLTKILTY 101
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907100034  578 HIYNhGQLTVEKLISKGRVLTMANQVLTVNISeEGRILLGpeGIPVRRVDVPAANGVIHMLEGILLP 644
Cdd:COG2335    102 HVVP-GKVTAADLKDGKTLTTLQGQTLTVTVS-GGGVTVN--GANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
520-644 4.21e-21

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 90.39  E-value: 4.21e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034  520 FTRFETILENCGLPSILDGP-GPFTVFAPSNEAVDSLRDGRLIYLFtAGLSKLQELVRYHIYNhGQLTVEKLISKGRVLT 598
Cdd:pfam02469    3 FSTFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLL-KDKEQLKNLLKYHVVP-GRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907100034  599 MANQVLTVNIsEEGRILLgpEGIPVRRVDVPAANGVIHMLEGILLP 644
Cdd:pfam02469   81 LQGSKLRVNV-TGGSVTV--NGARVVQADIEATNGVIHVIDKVLLP 123
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
543-645 1.42e-19

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 85.11  E-value: 1.42e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034   543 TVFAPSNEAVDSLRDGRliYLFTAglSKLQELVRYHIYNhGQLTVEKLISKGRVLTMANQVLTVNISEeGRILLGPEGIP 622
Cdd:smart00554    1 TVFAPTDEAFQKLPPDL--NSLLA--DKLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSG-GSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 1907100034   623 VRRVDVPAANGVIHMLEGILLPP 645
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1000-1121 1.44e-18

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 83.07  E-value: 1.44e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034 1000 AHFSAFSQWFKNSSI--TLP-ADSRVTALVPSESAIRRLSLEDQAFWLQ-PKMLPELARAHFLQGAFSEEELArlNGQQV 1075
Cdd:pfam02469    1 PGFSTFVALLKAAGLvdTLNgSQGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLK--NGGTL 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907100034 1076 ATLsATTRWQIHNISGKVWVQNATVDVPDLLATNGILHIVSQVLLP 1121
Cdd:pfam02469   79 ATL-QGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1023-1122 7.28e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 68.93  E-value: 7.28e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034  1023 TALVPSESAIRRLSLEDQAFWLQpkMLPELARAHFLQGAFSEEELarLNGQQVATLS-ATTRWQIHNISGKVWVQNATVD 1101
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLAD--KLKNLLLYHVVPGRLSSADL--LNGGTLPTLAgSKLRITRSGGSGTVTVNGARIV 76
                            90       100
                    ....*....|....*....|.
gi 1907100034  1102 VPDLLATNGILHIVSQVLLPP 1122
Cdd:smart00554   77 EADIAATNGVVHVIDRVLLPP 97
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
991-1121 9.64e-12

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 64.93  E-value: 9.64e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034  991 DIIQELEANAHFSAFSQWFKNSSI--TLPADSRVTALVPSESAIRRLSLEDQAFWLQPKMLPELAR---AHFLQGAFSEE 1065
Cdd:COG2335     32 NIVETAANNPDFSTLVAALKAAGLvdTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLTKiltYHVVPGKVTAA 111
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907100034 1066 ELArlNGQQVATLSATTrWQIHNISGKVWVQNATVDVPDLLATNGILHIVSQVLLP 1121
Cdd:COG2335    112 DLK--DGKTLTTLQGQT-LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1460-1496 5.39e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 47.21  E-value: 5.39e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907100034 1460 CASGHGGCSPYANCTKVaPGQRTCTCQDGYTGDGELC 1496
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1502-1539 2.33e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.59  E-value: 2.33e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1907100034 1502 CLVHNGGCHVHAECIPTGPqQVSCSCREGYSGDGIqTC 1539
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGV-TC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
952-988 4.08e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.20  E-value: 4.08e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907100034  952 CRVGNGGCHGLATCKAVGGgQRVCTCPPHFGGDGFSC 988
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1545-1582 4.44e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.12  E-value: 4.44e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1907100034 1545 CSQNNGGCSPYAVCKSTgDGQRTCSCDATHTvGDGITC 1582
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYT-GDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
915-946 1.34e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.58  E-value: 1.34e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1907100034  915 GGCHADALCSYVgPGQSRCTCKLGFAGNGYEC 946
Cdd:pfam12947    6 GGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
831-860 5.00e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.04  E-value: 5.00e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1907100034  831 PCHSDAHCVIQEGVARCVCHDGFEGNGFSC 860
Cdd:pfam12947    7 GCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
 
Name Accession Description Interval E-value
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
499-644 3.78e-27

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 109.23  E-value: 3.78e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034  499 PPPLPGDSKKTVGQILASTEVFTRFETILENCGLPSILDGPGPFTVFAPSNEAVDSLRDGRLIYLFTAG-LSKLQELVRY 577
Cdd:COG2335     22 AEGAAMAPTKNIVETAANNPDFSTLVAALKAAGLVDTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPEnKATLTKILTY 101
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907100034  578 HIYNhGQLTVEKLISKGRVLTMANQVLTVNISeEGRILLGpeGIPVRRVDVPAANGVIHMLEGILLP 644
Cdd:COG2335    102 HVVP-GKVTAADLKDGKTLTTLQGQTLTVTVS-GGGVTVN--GANVITADIEASNGVIHVIDKVLLP 164
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
520-644 4.21e-21

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 90.39  E-value: 4.21e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034  520 FTRFETILENCGLPSILDGP-GPFTVFAPSNEAVDSLRDGRLIYLFtAGLSKLQELVRYHIYNhGQLTVEKLISKGRVLT 598
Cdd:pfam02469    3 FSTFVALLKAAGLVDTLNGSqGPFTVFAPTNEAFAKLPAGTLNFLL-KDKEQLKNLLKYHVVP-GRLTSSDLKNGGTLAT 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907100034  599 MANQVLTVNIsEEGRILLgpEGIPVRRVDVPAANGVIHMLEGILLP 644
Cdd:pfam02469   81 LQGSKLRVNV-TGGSVTV--NGARVVQADIEATNGVIHVIDKVLLP 123
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
543-645 1.42e-19

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 85.11  E-value: 1.42e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034   543 TVFAPSNEAVDSLRDGRliYLFTAglSKLQELVRYHIYNhGQLTVEKLISKGRVLTMANQVLTVNISEeGRILLGPEGIP 622
Cdd:smart00554    1 TVFAPTDEAFQKLPPDL--NSLLA--DKLKNLLLYHVVP-GRLSSADLLNGGTLPTLAGSKLRITRSG-GSGTVTVNGAR 74
                            90       100
                    ....*....|....*....|...
gi 1907100034   623 VRRVDVPAANGVIHMLEGILLPP 645
Cdd:smart00554   75 IVEADIAATNGVVHVIDRVLLPP 97
Fasciclin pfam02469
Fasciclin domain; This extracellular domain is found repeated four times in grasshopper ...
1000-1121 1.44e-18

Fasciclin domain; This extracellular domain is found repeated four times in grasshopper fasciclin I as well as in proteins from mammals, sea urchins, plants, yeast and bacteria.


Pssm-ID: 396845  Cd Length: 123  Bit Score: 83.07  E-value: 1.44e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034 1000 AHFSAFSQWFKNSSI--TLP-ADSRVTALVPSESAIRRLSLEDQAFWLQ-PKMLPELARAHFLQGAFSEEELArlNGQQV 1075
Cdd:pfam02469    1 PGFSTFVALLKAAGLvdTLNgSQGPFTVFAPTNEAFAKLPAGTLNFLLKdKEQLKNLLKYHVVPGRLTSSDLK--NGGTL 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1907100034 1076 ATLsATTRWQIHNISGKVWVQNATVDVPDLLATNGILHIVSQVLLP 1121
Cdd:pfam02469   79 ATL-QGSKLRVNVTGGSVTVNGARVVQADIEATNGVIHVIDKVLLP 123
FAS1 smart00554
Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;
1023-1122 7.28e-14

Four repeated domains in the Fasciclin I family of proteins, present in many other contexts;


Pssm-ID: 214719  Cd Length: 97  Bit Score: 68.93  E-value: 7.28e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034  1023 TALVPSESAIRRLSLEDQAFWLQpkMLPELARAHFLQGAFSEEELarLNGQQVATLS-ATTRWQIHNISGKVWVQNATVD 1101
Cdd:smart00554    1 TVFAPTDEAFQKLPPDLNSLLAD--KLKNLLLYHVVPGRLSSADL--LNGGTLPTLAgSKLRITRSGGSGTVTVNGARIV 76
                            90       100
                    ....*....|....*....|.
gi 1907100034  1102 VPDLLATNGILHIVSQVLLPP 1122
Cdd:smart00554   77 EADIAATNGVVHVIDRVLLPP 97
FAS1 COG2335
Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function ...
991-1121 9.64e-12

Uncaracterized surface protein containing fasciclin (FAS1) repeats [General function prediction only];


Pssm-ID: 441906 [Multi-domain]  Cd Length: 164  Bit Score: 64.93  E-value: 9.64e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907100034  991 DIIQELEANAHFSAFSQWFKNSSI--TLPADSRVTALVPSESAIRRLSLEDQAFWLQPKMLPELAR---AHFLQGAFSEE 1065
Cdd:COG2335     32 NIVETAANNPDFSTLVAALKAAGLvdTLSGEGPFTVFAPTDAAFAALPAGTLDALLKPENKATLTKiltYHVVPGKVTAA 111
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907100034 1066 ELArlNGQQVATLSATTrWQIHNISGKVWVQNATVDVPDLLATNGILHIVSQVLLP 1121
Cdd:COG2335    112 DLK--DGKTLTTLQGQT-LTVTVSGGGVTVNGANVITADIEASNGVIHVIDKVLLP 164
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1460-1496 5.39e-07

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 47.21  E-value: 5.39e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907100034 1460 CASGHGGCSPYANCTKVaPGQRTCTCQDGYTGDGELC 1496
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1502-1539 2.33e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.59  E-value: 2.33e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1907100034 1502 CLVHNGGCHVHAECIPTGPqQVSCSCREGYSGDGIqTC 1539
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGV-TC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
952-988 4.08e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.20  E-value: 4.08e-05
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1907100034  952 CRVGNGGCHGLATCKAVGGgQRVCTCPPHFGGDGFSC 988
Cdd:pfam12947    1 CSDNNGGCHPNATCTNTGG-SFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
1545-1582 4.44e-04

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 39.12  E-value: 4.44e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1907100034 1545 CSQNNGGCSPYAVCKSTgDGQRTCSCDATHTvGDGITC 1582
Cdd:pfam12947    1 CSDNNGGCHPNATCTNT-GGSFTCTCNDGYT-GDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
915-946 1.34e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 37.58  E-value: 1.34e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1907100034  915 GGCHADALCSYVgPGQSRCTCKLGFAGNGYEC 946
Cdd:pfam12947    6 GGCHPNATCTNT-GGSFTCTCNDGYTGDGVTC 36
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
831-860 5.00e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.04  E-value: 5.00e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1907100034  831 PCHSDAHCVIQEGVARCVCHDGFEGNGFSC 860
Cdd:pfam12947    7 GCHPNATCTNTGGSFTCTCNDGYTGDGVTC 36
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH