NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907120180|ref|XP_036016071|]
View 

protein HEG homolog 1 isoform X4 [Mus musculus]

Protein Classification

calcium-binding EGF-like domain-containing protein; wall-associated receptor kinase family protein( domain architecture ID 10043351)

calcium-binding epidermal growth factor (EGF)-like domain-containing protein may play a crucial role in numerous protein-protein interactions| wall-associated receptor kinase (WAK) family protein containing the calcium-binding EGF and serine/threonine kinase domains but lacking the WAK galacturonan-binding domain, catalyzes the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates, and may function as a signaling receptor of extracellular matrix component

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
EGF_CA smart00179
Calcium-binding EGF-like domain;
857-888 1.95e-10

Calcium-binding EGF-like domain;


:

Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 56.87  E-value: 1.95e-10
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1907120180   857 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQ 888
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYT 33
PHA03247 super family cl33720
large tegument protein UL36; Provisional
450-817 4.28e-10

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 4.28e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  450 PSQAQPKQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVNMPNTLVLDTGTKPVEDPSDSRVPSTQPSPSqPQPFSSAL 529
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP-PPPSPSPA 2634
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  530 PSTRSPGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTlvPHRPREPRVTSvqmstaisaiALIPSNQTANPk 609
Cdd:PHA03247  2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP--PQRPRRRAARP----------TVGSLTSLADP- 2701
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  610 nqstPQQEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPALTGFSTGPALPATSTSLAqmSPALTSAMPQTT--HSP 687
Cdd:PHA03247  2702 ----PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA--RPPTTAGPPAPAppAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  688 VTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAITTEGNREHTDPTTQPIPLTTSTTSAGERTTELGR 767
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907120180  768 AEESSPSHFLTPSSPQTTDVSTAEMLTSRYITFAAQSTSQSPTALPPLTP 817
Cdd:PHA03247  2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
818-854 2.28e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 45.32  E-value: 2.28e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1907120180  818 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQGENC 854
Cdd:cd00054      2 IDECaSGNPCQNGGTCV-NTVG-SYRCSCPPGYTGRNC 37
 
Name Accession Description Interval E-value
EGF_CA smart00179
Calcium-binding EGF-like domain;
857-888 1.95e-10

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 56.87  E-value: 1.95e-10
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1907120180   857 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQ 888
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYT 33
PHA03247 PHA03247
large tegument protein UL36; Provisional
450-817 4.28e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 4.28e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  450 PSQAQPKQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVNMPNTLVLDTGTKPVEDPSDSRVPSTQPSPSqPQPFSSAL 529
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP-PPPSPSPA 2634
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  530 PSTRSPGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTlvPHRPREPRVTSvqmstaisaiALIPSNQTANPk 609
Cdd:PHA03247  2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP--PQRPRRRAARP----------TVGSLTSLADP- 2701
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  610 nqstPQQEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPALTGFSTGPALPATSTSLAqmSPALTSAMPQTT--HSP 687
Cdd:PHA03247  2702 ----PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA--RPPTTAGPPAPAppAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  688 VTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAITTEGNREHTDPTTQPIPLTTSTTSAGERTTELGR 767
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907120180  768 AEESSPSHFLTPSSPQTTDVSTAEMLTSRYITFAAQSTSQSPTALPPLTP 817
Cdd:PHA03247  2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
857-889 1.79e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 54.18  E-value: 1.79e-09
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1907120180  857 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQL 889
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTG 34
EGF_CA pfam07645
Calcium-binding EGF domain;
857-886 2.01e-08

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 51.08  E-value: 2.01e-08
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1907120180  857 DVNECLSSP--CPPLATCNNTQGSFTCRCPVG 886
Cdd:pfam07645    1 DVDECATGThnCPANTVCVNTIGSFECRCPDG 32
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
818-854 2.28e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 45.32  E-value: 2.28e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1907120180  818 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQGENC 854
Cdd:cd00054      2 IDECaSGNPCQNGGTCV-NTVG-SYRCSCPPGYTGRNC 37
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
456-792 6.04e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 50.34  E-value: 6.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  456 KQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVnmPNTLVLDTGTKPVEDPSDSRVPSTQPSPSQPQPFSSALPSTRSP 535
Cdd:pfam17823   89 EHTPHGTDLSEPATREGAADGAASRALAAAASSS--PSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAAS 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  536 GSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTLVPHRPREPRVTSVQMSTAISAIALIPSNQTANPKNQSTPQ 615
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  616 QEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPA-------LTGFSTGPALPATSTSLAQMS---PALTSAMPQTTH 685
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAkhmpsdtMARNPAAPMGAQAQGPIIQVStdqPVHNTAGEPTPS 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  686 SPVTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAIttegnrEHTDPTTQPIPLTTSTTSAGE---RT 762
Cdd:pfam17823  327 PSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEV------EATSPTTQPSPLLPTQGAAGPgilLA 400
                          330       340       350
                   ....*....|....*....|....*....|
gi 1907120180  763 TELGRAEESSPSHFLTPSSPQTTDVSTAEM 792
Cdd:pfam17823  401 PEQVATEATAGTASAGPTPRSSGDPKTLAM 430
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
821-853 1.08e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.14  E-value: 1.08e-05
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1907120180  821 CTVNPCLHDGKCIVdlTGRGYRCVCPPAWQGEN 853
Cdd:pfam00008    1 CAPNPCSNGGTCVD--TPGGYTCICPEGYTGKR 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
818-854 1.79e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 39.92  E-value: 1.79e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1907120180   818 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQ-GENC 854
Cdd:smart00179    2 IDECaSGNPCQNGGTCV-NTVG-SYRCECPPGYTdGRNC 38
 
Name Accession Description Interval E-value
EGF_CA smart00179
Calcium-binding EGF-like domain;
857-888 1.95e-10

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 56.87  E-value: 1.95e-10
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1907120180   857 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQ 888
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYT 33
PHA03247 PHA03247
large tegument protein UL36; Provisional
450-817 4.28e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 64.57  E-value: 4.28e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  450 PSQAQPKQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVNMPNTLVLDTGTKPVEDPSDSRVPSTQPSPSqPQPFSSAL 529
Cdd:PHA03247  2556 PPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDP-PPPSPSPA 2634
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  530 PSTRSPGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTlvPHRPREPRVTSvqmstaisaiALIPSNQTANPk 609
Cdd:PHA03247  2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSP--PQRPRRRAARP----------TVGSLTSLADP- 2701
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  610 nqstPQQEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPALTGFSTGPALPATSTSLAqmSPALTSAMPQTT--HSP 687
Cdd:PHA03247  2702 ----PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPA--RPPTTAGPPAPAppAAP 2775
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  688 VTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAITTEGNREHTDPTTQPIPLTTSTTSAGERTTELGR 767
Cdd:PHA03247  2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGS 2855
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|
gi 1907120180  768 AEESSPSHFLTPSSPQTTDVSTAEMLTSRYITFAAQSTSQSPTALPPLTP 817
Cdd:PHA03247  2856 VAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
857-889 1.79e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 54.18  E-value: 1.79e-09
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1907120180  857 DVNECLS-SPCPPLATCNNTQGSFTCRCPVGYQL 889
Cdd:cd00054      1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTG 34
EGF_CA pfam07645
Calcium-binding EGF domain;
857-886 2.01e-08

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 51.08  E-value: 2.01e-08
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1907120180  857 DVNECLSSP--CPPLATCNNTQGSFTCRCPVG 886
Cdd:pfam07645    1 DVDECATGThnCPANTVCVNTIGSFECRCPDG 32
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
818-854 2.28e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 45.32  E-value: 2.28e-06
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1907120180  818 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQGENC 854
Cdd:cd00054      2 IDECaSGNPCQNGGTCV-NTVG-SYRCSCPPGYTGRNC 37
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
860-891 2.34e-06

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 45.16  E-value: 2.34e-06
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1907120180  860 EC-LSSPCPPLATCNNTQGSFTCRCPVGYQLEK 891
Cdd:cd00053      1 ECaASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
456-792 6.04e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 50.34  E-value: 6.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  456 KQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVnmPNTLVLDTGTKPVEDPSDSRVPSTQPSPSQPQPFSSALPSTRSP 535
Cdd:pfam17823   89 EHTPHGTDLSEPATREGAADGAASRALAAAASSS--PSSAAQSLPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAAS 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  536 GSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTLVPHRPREPRVTSVQMSTAISAIALIPSNQTANPKNQSTPQ 615
Cdd:pfam17823  167 APHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTVTAAVG 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  616 QEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPA-------LTGFSTGPALPATSTSLAQMS---PALTSAMPQTTH 685
Cdd:pfam17823  247 TVTPAALATLAAAAGTVASAAGTINMGDPHARRLSPAkhmpsdtMARNPAAPMGAQAQGPIIQVStdqPVHNTAGEPTPS 326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  686 SPVTSPSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAIttegnrEHTDPTTQPIPLTTSTTSAGE---RT 762
Cdd:pfam17823  327 PSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEV------EATSPTTQPSPLLPTQGAAGPgilLA 400
                          330       340       350
                   ....*....|....*....|....*....|
gi 1907120180  763 TELGRAEESSPSHFLTPSSPQTTDVSTAEM 792
Cdd:pfam17823  401 PEQVATEATAGTASAGPTPRSSGDPKTLAM 430
PHA03247 PHA03247
large tegument protein UL36; Provisional
463-795 7.32e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 7.32e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  463 DDDEPAQSSTESP--VLHTSNLPTYTSTVNMPNTLVLDTGTKPVEDPSDSRVPSTQPSPSqPQPFSSALP------STRS 534
Cdd:PHA03247  2652 PRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPA-PHALVSATPlppgpaAARQ 2730
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  535 PGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTF--PHPSSTLVPHRPREPRVTSVQMSTAISAIALIPSNQTANPKNQS 612
Cdd:PHA03247  2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAgpPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA 2810
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  613 TPQQEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPALTGFSTGPALPATSTSLAQMSPAL--TSAMPQTTHSPVTS 690
Cdd:PHA03247  2811 VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKpaAPARPPVRRLARPA 2890
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  691 PSTLSHVEALTSGAVVVHTTPKKPHLPTNPEILVPHISTEGAITTEGnREHTDPTTQPIPLTTSTTSAGERTTELG---- 766
Cdd:PHA03247  2891 VSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPP-RPQPPLAPTTDPAGAGEPSGAVPQPWLGalvp 2969
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 1907120180  767 --------RAEESSPSHFLTPSSPQTTDVSTAEMLTS 795
Cdd:PHA03247  2970 grvavprfRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
821-853 1.08e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.14  E-value: 1.08e-05
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1907120180  821 CTVNPCLHDGKCIVdlTGRGYRCVCPPAWQGEN 853
Cdd:pfam00008    1 CAPNPCSNGGTCVD--TPGGYTCICPEGYTGKR 31
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
865-888 1.49e-05

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 42.97  E-value: 1.49e-05
                           10        20
                   ....*....|....*....|....
gi 1907120180  865 PCPPLATCNNTQGSFTCRCPVGYQ 888
Cdd:pfam12947    7 GCHPNATCTNTGGSFTCTCNDGYT 30
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
561-811 1.67e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 49.14  E-value: 1.67e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  561 YSVSQTTFPHPSSTLVPHRPREPRVTSVQMSTA-ISAIALIPSNQTANPKNQStPQQEKPITEAKSPSLVSPptdsTKAV 639
Cdd:pfam05109  439 FAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdVTSPTPAGTTSGASPVTPS-PSPRDNGTESKAPDMTSP----TSAV 513
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  640 TVSLPPGAPWSPALTG---FSTGPALPATSTSLA---------QMSPALTSAMPQTTHSPVTSPSTLSHVEALTSGAVVV 707
Cdd:pfam05109  514 TTPTPNATSPTPAVTTptpNATSPTLGKTSPTSAvttptpnatSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSP 593
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  708 HTTPKKPHLPTNPEILVPHISTEGAITTEGNREHTDPTTQPIPLTTSTTSAGERttelgraeessPSHFLTPSSPQTTDV 787
Cdd:pfam05109  594 TVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLR-----------PSSISETLSPSTSDN 662
                          250       260
                   ....*....|....*....|....*.
gi 1907120180  788 STAEM--LTSRYITFAAQSTSQSPTA 811
Cdd:pfam05109  663 STSHMplLTSAHPTGGENITQVTPAS 688
EGF_CA smart00179
Calcium-binding EGF-like domain;
818-854 1.79e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 39.92  E-value: 1.79e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1907120180   818 VNSC-TVNPCLHDGKCIvDLTGrGYRCVCPPAWQ-GENC 854
Cdd:smart00179    2 IDECaSGNPCQNGGTCV-NTVG-SYRCECPPGYTdGRNC 38
EGF smart00181
Epidermal growth factor-like domain;
860-891 1.98e-04

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 39.81  E-value: 1.98e-04
                            10        20        30
                    ....*....|....*....|....*....|...
gi 1907120180   860 ECLS-SPCPPlATCNNTQGSFTCRCPVGYQLEK 891
Cdd:smart00181    1 ECASgGPCSN-GTCINTPGSYTCSCPPGYTGDK 32
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
824-854 5.78e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 38.61  E-value: 5.78e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1907120180  824 NPCLHDGKCIVdlTGRGYRCVCPPAWQGE-NC 854
Cdd:cd00053      6 NPCSNGGTCVN--TPGSYRCVCPPGYTGDrSC 35
EB pfam01683
EB module; This domain has no known function. It is found in several C. elegans proteins. The ...
821-894 5.84e-04

EB module; This domain has no known function. It is found in several C. elegans proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges. This domain is found associated with kunitz domains pfam00014.


Pssm-ID: 460294  Cd Length: 52  Bit Score: 38.95  E-value: 5.84e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907120180  821 CTVNPCLHDGKCIvdltgrgyrcvcPPAWQGENCSVDVNeclsspCPPLATCNNTqgsfTCRCPVGYQLEKGIC 894
Cdd:pfam01683    1 CPPGQVLVNGQCV------------PKVAPGESCEADEQ------CPGGSVCVNG----VCQCPPGFTPVNGRC 52
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
861-887 6.80e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 38.13  E-value: 6.80e-04
                           10        20
                   ....*....|....*....|....*..
gi 1907120180  861 CLSSPCPPLATCNNTQGSFTCRCPVGY 887
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGY 27
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
510-755 1.94e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 42.22  E-value: 1.94e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  510 SRVPSTQPSPSQPQPFS--SALPSTRSPGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTLVPHRPREPRVTS 587
Cdd:PLN03209   321 AKIPSQRVPPKESDAADgpKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLKPPTSPIPTPPSSSPASSKS 400
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  588 VQMSTAISAIALIPSNQTANPKNQSTP-----QQEKPIT------EAKSPSLVSPPTDSTKAVTVSLPPGAPWSPaltgf 656
Cdd:PLN03209   401 VDAVAKPAEPDVVPSPGSASNVPEVEPaqveaKKTRPLSpyaryeDLKPPTSPSPTAPTGVSPSVSSTSSVPAVP----- 475
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  657 STGPALPATSTSL---AQMSPaLTSAMPQTTHSPVTSPSTLShvealtsgavvvhTTPKKPHLPTNPEILVPHISTEGAI 733
Cdd:PLN03209   476 DTAPATAATDAAApppANMRP-LSPYAVYDDLKPPTSPSPAA-------------PVGKVAPSSTNEVVKVGNSAPPTAL 541
                          250       260
                   ....*....|....*....|..
gi 1907120180  734 TTEGNreHTDPttQPIPLTTST 755
Cdd:PLN03209   542 ADEQH--HAQP--KPRPLSPYT 559
PHA03247 PHA03247
large tegument protein UL36; Provisional
306-812 4.52e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.46  E-value: 4.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  306 APLREHSPPEDGAMLSDSSDLADSTSGARTPH------TSAMSTRSGERTLRSLDLSSAATRPARPTPRGNVTEHAGLLS 379
Cdd:PHA03247  2678 SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEpaphalVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARP 2757
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  380 GAPTLGVTGLSYTREHGSDAGqrtssdhtdhgyvpstftkgertllsitdntsyseasesstssvkisdsPSQAQPKQSS 459
Cdd:PHA03247  2758 ARPPTTAGPPAPAPPAAPAAG-------------------------------------------------PPRRLTRPAV 2788
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  460 MSSDDDEPAQSSTESPVLHTSNLPTYTSTVNMPNTlvldtgTKPVEDPSDSRVPSTQPSPSQPQPFSSAL-----PSTRS 534
Cdd:PHA03247  2789 ASLSESRESLPSPWDPADPPAAVLAPAAALPPAAS------PAGPLPPPTSAQPTAPPPPPGPPPPSLPLggsvaPGGDV 2862
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  535 PGSTSETTTSSPSPSPISLLVSTLAPYSVSQTTFPHPSSTLVPHRPREPrvtsvqmstaisaiALIPSNQTANPKNQSTP 614
Cdd:PHA03247  2863 RRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQP--------------QAPPPPQPQPQPPPPPQ 2928
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  615 QQEKPITEAKSPSLVSPPTDSTKAVTVSLPPGAPWSPALtgfsTGPALPATSTSLAQMSPaltsamPQTTHSPVTSPSTL 694
Cdd:PHA03247  2929 PQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGAL----VPGRVAVPRFRVPQPAP------SREAPASSTPPLTG 2998
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  695 SHVEALTSGA--VVVHTTPKKP--------HLPTNPE--------ILVPHISTEGAITTEGNREHTDPTTQPIPlttSTT 756
Cdd:PHA03247  2999 HSLSRVSSWAssLALHEETDPPpvslkqtlWPPDDTEdsdadslfDSDSERSDLEALDPLPPEPHDPFAHEPDP---ATP 3075
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1907120180  757 SAGERttelgraeESSPSHFLTPSspqttdVSTAEMLTSRYItfaaQSTSQSPTAL 812
Cdd:PHA03247  3076 EAGAR--------ESPSSQFGPPP------LSANAALSRRYV----RSTGRSALAV 3113
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
449-813 5.42e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.06  E-value: 5.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  449 SPSQAQPKQSSMSSDDDEPAQSSTESPVLHTSNLPTYTSTVNMPNTLVLDTGTKPVEDPSDSrvpSTQPSPSQPQPFSSA 528
Cdd:pfam05109  460 APASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNAT---SPTPAVTTPTPNATS 536
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  529 LPSTRSPGSTSetttsspspspisllVSTLAPYSVSQT---TFPHPSSTlvphrpreprVTSVQMSTAISAIALIPSNQT 605
Cdd:pfam05109  537 PTLGKTSPTSA---------------VTTPTPNATSPTpavTTPTPNAT----------IPTLGKTSPTSAVTTPTPNAT 591
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  606 ANPKNQSTPQQEKPI----TEAKSPSLVSPPTDSTKAVT-------------VSLPPGA---PWSPALTGFSTG------ 659
Cdd:pfam05109  592 SPTVGETSPQANTTNhtlgGTSSTPVVTSPPKNATSAVTtgqhnitssstssMSLRPSSiseTLSPSTSDNSTShmpllt 671
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  660 PALPATSTSLAQMSPALTSAMPQTTHSPVTSPSTLSHVEALTSGAV--------VVHTTPKK----PHLPTNPEILVPHI 727
Cdd:pfam05109  672 SAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTstkpgevnVTKGTPPKnatsPQAPSGQKTAVPTV 751
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907120180  728 -STEGAITTEGNREHTDPTTQPIPLTTSTTSAGERTTELGRAEESSpshFLTPSSpqTTDVSTAEMLTSRYITFAAQSTS 806
Cdd:pfam05109  752 tSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGDSTTPRTRYNATT---YLPPST--SSKLRPRWTFTSPPVTTAQATVP 826

                   ....*..
gi 1907120180  807 QSPTALP 813
Cdd:pfam05109  827 VPPTSQP 833
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH