NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720386233|ref|XP_030104769|]
View 

synaptojanin-1 isoform X32 [Mus musculus]

Protein Classification

RNA-binding protein( domain architecture ID 13429226)

RNA-binding protein containing an RNA recognition motif (RRM)

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
INPP5c_Synj1 cd09098
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This ...
680-1015 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This subfamily contains the INPP5c domains of human synaptojanin 1 (Synj1) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj1 occurs as two main isoforms: a brain enriched 145 KDa protein (Synj1-145) and a ubiquitously expressed 170KDa protein (Synj1-170). Synj1-145 participates in clathrin-mediated endocytosis. The primary substrate of the Synj1-145 INPP5c domain is PI(4,5)P2, which it converts to PI4P. Synj1-145 may work with membrane curvature sensors/generators (such as endophilin) to remove PI(4,5)P2 from curved membranes. The recruitment of the INPP5c domain of Synj1-145 to endophilin-induced membranes leads to a fragmentation and condensation of these structures. The PI(4,5)P2 to PI4P conversion may cooperate with dynamin to produce membrane fission. In addition to this INPP5c domain, these proteins contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro.


:

Pssm-ID: 197332  Cd Length: 336  Bit Score: 766.12  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 759
Cdd:cd09098      1 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKKAGIPEFQDVRSKPVDIFAIGFEEMVELNAGNIVSASTTNQKLWA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  760 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 839
Cdd:cd09098     81 AELQKTISRDQKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  840 GQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRG 919
Cdd:cd09098    161 GQSQVKERNEDFIEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDIPNEEVKELIRQQNWDSLIAGDQLINQKNAGQVFRG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  920 FLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRA 999
Cdd:cd09098    241 FLEGKLDFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFPDNSKEQYTWSPGTLLHYGRA 320
                          330
                   ....*....|....*.
gi 1720386233 1000 ELKTSDHRPVVALIDI 1015
Cdd:cd09098    321 ELKTSDHRPVVALIDI 336
COG5329 super family cl34984
Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];
204-627 6.52e-95

Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];


The actual alignment was detected with superfamily member COG5329:

Pssm-ID: 227637 [Multi-domain]  Cd Length: 570  Bit Score: 317.79  E-value: 6.52e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  204 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASDEDRIS---------EVRKVLNSGNFYF 274
Cdd:COG5329     61 YGVIGLIKLKGD----IYLIVITGASLVGVIPGHSIYKILDVDFISLNNNKWDDELEEdeanydklsELKKLLSNGTFYF 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  275 A--WSASGvSLDLSLNAHRSMQEHTTDNRFFWNQSL------HLHLKHYGVNCDD-WLLRLMCGGVEIRTIYAAHKQAKA 345
Cdd:COG5329    137 SydFDITN-SLQKNLSEGLEASVDRADLIFMWNSFLleefinHRSKLSSLEKQFDnFLTTVIRGFAETVDIKVGGNTISL 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  346 CLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDCVSSFIQIRGSVPLFWEQPGLQVGShRVRMSRGFEANAPA 425
Cdd:COG5329    216 TLISRRSSERAGTRYLSRGIDDDGNVSNFVETEQIVTDSQYIFSFTQVRGSIPLFWEQSNLLYGP-KIKVTRSSEAAQSA 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  426 FDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKASEHAsDIHMVSFDYHQMVKGGKAEKLHSILKPQVQKFLDY 505
Cdd:COG5329    295 FDKHFDKLREKYGDVYVVNLLKTKGYEAPLLELYEKHLDLSKKP-KIHYTEFDFHKETSQDGFDDVKKLLYLIEQDLLEF 373
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  506 GFFYFDGSEVQRC--QSGTVRTNCLDCLDRTNSVQAFLGLEMLAKQLEALGLAEKpqlVTRFQEVFRSMWSVNGDSISKI 583
Cdd:COG5329    374 GYFAYDINEGKSIseQDGVFRTNCLDCLDRTNVIQSLISRVLLEQFRSEGVISDG---YSPFLQIHRELWADNGDAISRL 450
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720386233  584 YAGTGALEGKAK-------AGKLKDGARSVTRTIQNNFFDSSKQEAIDVLL 627
Cdd:COG5329    451 YTGTGALKSSFTrrgrrsfAGALNDFIKSFSRYYINNFTDGQRQDAIDLLL 501
DUF1866 pfam08952
Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known ...
1014-1155 2.22e-63

Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known function.


:

Pssm-ID: 286093  Cd Length: 146  Bit Score: 211.98  E-value: 2.22e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1014 DIDIFEVEAEERQKIYKEVIAVQGPPDGTVLVSIKS-SAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEG 1092
Cdd:pfam08952    1 DVEIQEVDPEARRRVFKEVIRDQGPPDGTIVVSLCSgDLDEKNIFDENLMDELIQELTSFGEVTLVRFVEDTMWVTFRDG 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720386233 1093 SSALNALSLNGKELLNRTITITLKSPDWIKHLEEEM---SLEKISVTlpSSASSTLLGEDAEVAAD 1155
Cdd:pfam08952   81 HSALNALSKDGMKVCGRALKIRLKSKDWIKGLEEEIilcTDNTIPVS--PCANSTLLAEDFDFGSP 144
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1201-1397 6.25e-08

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 6.25e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1201 PTVPEYSAPSLPIRPSRAPSRTPgPPSSQGSPVDTQPAAQKDSS--QTLEPKRPPPPRPVAPPARPAPPQRPPPPSGARS 1278
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAP-HALVSATPLPPGPAAARQASpaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1279 PAPARKEFGGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPLKP 1358
Cdd:PHA03247  2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720386233 1359 QAAF--------------PQQPSLPTPAQKLQDPLVPIAAPTMPPSG---PQPNLE 1397
Cdd:PHA03247  2849 SLPLggsvapggdvrrrpPSRSPAAKPAAPARPPVRRLARPAVSRSTesfALPPDQ 2904
 
Name Accession Description Interval E-value
INPP5c_Synj1 cd09098
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This ...
680-1015 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This subfamily contains the INPP5c domains of human synaptojanin 1 (Synj1) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj1 occurs as two main isoforms: a brain enriched 145 KDa protein (Synj1-145) and a ubiquitously expressed 170KDa protein (Synj1-170). Synj1-145 participates in clathrin-mediated endocytosis. The primary substrate of the Synj1-145 INPP5c domain is PI(4,5)P2, which it converts to PI4P. Synj1-145 may work with membrane curvature sensors/generators (such as endophilin) to remove PI(4,5)P2 from curved membranes. The recruitment of the INPP5c domain of Synj1-145 to endophilin-induced membranes leads to a fragmentation and condensation of these structures. The PI(4,5)P2 to PI4P conversion may cooperate with dynamin to produce membrane fission. In addition to this INPP5c domain, these proteins contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro.


Pssm-ID: 197332  Cd Length: 336  Bit Score: 766.12  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 759
Cdd:cd09098      1 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKKAGIPEFQDVRSKPVDIFAIGFEEMVELNAGNIVSASTTNQKLWA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  760 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 839
Cdd:cd09098     81 AELQKTISRDQKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  840 GQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRG 919
Cdd:cd09098    161 GQSQVKERNEDFIEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDIPNEEVKELIRQQNWDSLIAGDQLINQKNAGQVFRG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  920 FLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRA 999
Cdd:cd09098    241 FLEGKLDFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFPDNSKEQYTWSPGTLLHYGRA 320
                          330
                   ....*....|....*.
gi 1720386233 1000 ELKTSDHRPVVALIDI 1015
Cdd:cd09098    321 ELKTSDHRPVVALIDI 336
IPPc smart00128
Inositol polyphosphate phosphatase, catalytic domain homologues; Mg(2+)-dependent/Li(+) ...
678-1018 3.38e-128

Inositol polyphosphate phosphatase, catalytic domain homologues; Mg(2+)-dependent/Li(+)-sensitive enzymes.


Pssm-ID: 214525 [Multi-domain]  Cd Length: 306  Bit Score: 399.04  E-value: 3.38e-128
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233   678 KKIRVCVGTWNVNGGKqfrsiaFKNQTLTDWLLdapklagiQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKL 757
Cdd:smart00128    1 RDIKVLIGTWNVGGLE------SPKVDVTSWLF--------QKIEVKQSEKPDIYVIGLQEVVGLAPGVILETIAGKERL 66
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233   758 WAVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 837
Cdd:smart00128   67 WSDLLESSLNGDGQYNVLAKVYLVGILVLVFVKANHLVYIKDVETFTVKTGMGGLWGNKGAVAVRFKLSDTSFCFVNSHL 146
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233   838 AAGQSQVKERNEDFVEIARKLSFPMGRML--FSHDYVFWCGDFNYRIDLP-NEEVKELIRQQNWDSLIAGDQLINQKNAG 914
Cdd:smart00128  147 AAGASNVEQRNQDYKTILRALSFPERALLsqFDHDVVFWFGDLNFRLDSPsYEEVRRKISKKEFDDLLEKDQLNRQREAG 226
                           250       260       270       280       290       300       310       320
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233   915 QIFRGFLEGKVTFAPTYKYDLF-SEDYDTSEKCRTPAWTDRVLWRrrkwpfdRSAEDLDLLNAsfqdeskilytwtpgtl 993
Cdd:smart00128  227 KVFKGFQEGPITFPPTYKYDSVgTETYDTSEKKRVPAWCDRILYR-------SNGPELIQLSE----------------- 282
                           330       340
                    ....*....|....*....|....*
gi 1720386233   994 lHYGRAELKTSDHRPVVALIDIDIF 1018
Cdd:smart00128  283 -YHSGMEITTSDHKPVFATFRLKVT 306
COG5329 COG5329
Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];
204-627 6.52e-95

Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];


Pssm-ID: 227637 [Multi-domain]  Cd Length: 570  Bit Score: 317.79  E-value: 6.52e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  204 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASDEDRIS---------EVRKVLNSGNFYF 274
Cdd:COG5329     61 YGVIGLIKLKGD----IYLIVITGASLVGVIPGHSIYKILDVDFISLNNNKWDDELEEdeanydklsELKKLLSNGTFYF 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  275 A--WSASGvSLDLSLNAHRSMQEHTTDNRFFWNQSL------HLHLKHYGVNCDD-WLLRLMCGGVEIRTIYAAHKQAKA 345
Cdd:COG5329    137 SydFDITN-SLQKNLSEGLEASVDRADLIFMWNSFLleefinHRSKLSSLEKQFDnFLTTVIRGFAETVDIKVGGNTISL 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  346 CLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDCVSSFIQIRGSVPLFWEQPGLQVGShRVRMSRGFEANAPA 425
Cdd:COG5329    216 TLISRRSSERAGTRYLSRGIDDDGNVSNFVETEQIVTDSQYIFSFTQVRGSIPLFWEQSNLLYGP-KIKVTRSSEAAQSA 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  426 FDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKASEHAsDIHMVSFDYHQMVKGGKAEKLHSILKPQVQKFLDY 505
Cdd:COG5329    295 FDKHFDKLREKYGDVYVVNLLKTKGYEAPLLELYEKHLDLSKKP-KIHYTEFDFHKETSQDGFDDVKKLLYLIEQDLLEF 373
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  506 GFFYFDGSEVQRC--QSGTVRTNCLDCLDRTNSVQAFLGLEMLAKQLEALGLAEKpqlVTRFQEVFRSMWSVNGDSISKI 583
Cdd:COG5329    374 GYFAYDINEGKSIseQDGVFRTNCLDCLDRTNVIQSLISRVLLEQFRSEGVISDG---YSPFLQIHRELWADNGDAISRL 450
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720386233  584 YAGTGALEGKAK-------AGKLKDGARSVTRTIQNNFFDSSKQEAIDVLL 627
Cdd:COG5329    451 YTGTGALKSSFTrrgrrsfAGALNDFIKSFSRYYINNFTDGQRQDAIDLLL 501
Syja_N pfam02383
SacI homology domain; This Pfam family represents a protein domain which shows homology to the ...
204-484 2.38e-86

SacI homology domain; This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin.


Pssm-ID: 460545  Cd Length: 295  Bit Score: 283.69  E-value: 2.38e-86
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  204 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASD----------EDRI-SEVRKVLNSGNF 272
Cdd:pfam02383    1 YGILGLIRLLSG----YYLIVITKREQVGQIGGHPIYKITDVEFIPLNSSLSDtqlakkehpdEERLlKLLKLFLSSGSF 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  273 YFAWSasgvsLDLSlnahRSMQEHTT----------DNRFFWNQSLHLHLKHYGVNCDDWLLRLMCGGVEIRTIYAAHKQ 342
Cdd:pfam02383   77 YFSYD-----YDLT----NSLQRNLTrsrspsfdslDDRFFWNRHLLKPLIDFQLDLDRWILPLIQGFVEQGKLSVFGRS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  343 AKACLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDC-----VSSFIQIRGSVPLFWEQPGLQVGSHRVRMSR 417
Cdd:pfam02383  148 VTLTLISRRSRKRAGTRYLRRGIDDDGNVANFVETEQIVSLNTSnsegkIFSFVQIRGSIPLFWSQDPNLKYKPKIQITR 227
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720386233  418 gFEANAPAFDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKAS--EHASDIHMVSFDYHQMVK 484
Cdd:pfam02383  228 -PEATQPAFKKHFDDLIERYGPVHIVNLVEKKGRESKLSEAYEEAVKYLnqFLPDKLRYTAFDFHHECK 295
COG5411 COG5411
Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms];
673-1041 6.90e-70

Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms];


Pssm-ID: 227698 [Multi-domain]  Cd Length: 460  Bit Score: 242.77  E-value: 6.90e-70
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  673 KYSKPKKIRVCVGTWNVNGgkqfrsiafKNQT--LTDWLLdaPklagiqefQDKRSKPTDIFAIGFEEMVELNAGNIVNA 750
Cdd:COG5411     23 KYVIEKDVSIFVSTFNPPG---------KPPKasTKRWLF--P--------EIEATELADLYVVGLQEVVELTPGSILSA 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  751 STtNQKL--W---AVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLF 825
Cdd:COG5411     84 DP-YDRLriWeskVLDCLNGAQSDEKYSLLRSPQLGGILLRVFSLATNLPVVKPVSGTVKKTGFGGSSSNKGAVAIRFNY 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  826 HTTSLCFVCSHFAAGQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNW--DSLIA 903
Cdd:COG5411    163 ERTSFCFVNSHLAAGVNNIEERIFDYRSIASNICFSRGLRIYDHDTIFWLGDLNYRVTSTNEEVRPEIASDDGrlDKLFE 242
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  904 GDQLINQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRrkwpfdrsaedldllnasfqdesk 983
Cdd:COG5411    243 YDQLLWEMEVGNVFPGFKEPVITFPPTYKFDYGTDEYDTSDKGRIPSWTDRILYKS------------------------ 298
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720386233  984 ilYTWTPGTllhYGRAE-LKTSDHRPVVALIDIDIFEVEAEERQKIYKEVIA--VQGPPDG 1041
Cdd:COG5411    299 --EQLTPHS---YSSIPhLMISDHRPVYATFRAKIKVVDPSKKEGLIEKLYAeyKTELGEA 354
DUF1866 pfam08952
Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known ...
1014-1155 2.22e-63

Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known function.


Pssm-ID: 286093  Cd Length: 146  Bit Score: 211.98  E-value: 2.22e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1014 DIDIFEVEAEERQKIYKEVIAVQGPPDGTVLVSIKS-SAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEG 1092
Cdd:pfam08952    1 DVEIQEVDPEARRRVFKEVIRDQGPPDGTIVVSLCSgDLDEKNIFDENLMDELIQELTSFGEVTLVRFVEDTMWVTFRDG 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720386233 1093 SSALNALSLNGKELLNRTITITLKSPDWIKHLEEEM---SLEKISVTlpSSASSTLLGEDAEVAAD 1155
Cdd:pfam08952   81 HSALNALSKDGMKVCGRALKIRLKSKDWIKGLEEEIilcTDNTIPVS--PCANSTLLAEDFDFGSP 144
PLN03191 PLN03191
Type I inositol-1,4,5-trisphosphate 5-phosphatase 2; Provisional
771-1030 5.71e-50

Type I inositol-1,4,5-trisphosphate 5-phosphatase 2; Provisional


Pssm-ID: 215624 [Multi-domain]  Cd Length: 621  Bit Score: 188.58  E-value: 5.71e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  771 KYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAAGQSQVKE--RN 848
Cdd:PLN03191   363 KYVRIVSKQMVGIYVSVWVRKRLRRHINNLKVSPVGVGLMGYMGNKGSVSISMSLFQSRLCFVCSHLTSGHKDGAEqrRN 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  849 EDFVEIARKLSFP------MGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRGFLE 922
Cdd:PLN03191   443 ADVYEIIRRTRFSsvldtdQPQTIPSHDQIFWFGDLNYRLNMLDTEVRKLVAQKRWDELINSDQLIKELRSGHVFDGWKE 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  923 GKVTFAPTYKYDLFSEDY-----DTSEKCRTPAWTDRVLWrrrkwpfdrsaedldlLNASFQDESkilytwtpgtllhYG 997
Cdd:PLN03191   523 GPIKFPPTYKYEINSDRYvgenpKEGEKKRSPAWCDRILW----------------LGKGIKQLC-------------YK 573
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1720386233  998 RAELKTSDHRPVVALIDIdifEVEAEERQKIYK 1030
Cdd:PLN03191   574 RSEIRLSDHRPVSSMFLV---EVEVFDHRKLQR 603
RRM_SYNJ1 cd12719
RNA recognition motif (RRM) found in synaptojanin-1 and similar proteins; This subgroup ...
1040-1116 2.02e-39

RNA recognition motif (RRM) found in synaptojanin-1 and similar proteins; This subgroup corresponds to the RRM of synaptojanin-1, also termed synaptojanin, or synaptic inositol-1,4,5-trisphosphate 5-phosphatase 1, originally identified as one of the major Grb2-binding proteins that may participate in synaptic vesicle endocytosis. It also acts as a Src homology 3 (SH3) domain-binding brain-specific inositol 5-phosphatase with a putative role in clathrin-mediated endocytosis. Synaptojanin-1 contains an N-terminal domain homologous to the cytoplasmic portion of the yeast protein Sac1p, a central inositol 5-phosphatase domain followed by a putative RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal proline-rich region mediating the binding of synaptojanin-1 to various SH3 domain-containing proteins including amphiphysin, SH3p4, SH3p8, SH3p13, and Grb2. Synaptojanin-1 has two tissue-specific alternative splicing isoforms, synaptojanin-145 expressed in brain and synaptojanin-170 expressed in peripheral tissues. Synaptojanin-145 is very abundant in nerve terminals and may play an essential role in the clathrin-mediated endocytosis of synaptic vesicles. In contrast to synaptojanin-145, synaptojanin-170 contains three unique asparagine-proline-phenylalanine (NPF) motifs in the C-terminal region and may functions as a potential binding partner for Eps15, a clathrin coat-associated protein acting as a major substrate for the tyrosine kinase activity of the epidermal growth factor receptor.


Pssm-ID: 410118  Cd Length: 77  Bit Score: 141.00  E-value: 2.02e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720386233 1040 DGTVLVSIKSSAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEGSSALNALSLNGKELLNRTITITLK 1116
Cdd:cd12719      1 DGTVVVSVLSSSPEPNYFDDNLIDALLQQFSSFGEVILIRFVEDKMWVTFLEGSSALAALSLNGTEVLGRTIIISLK 77
PHA03247 PHA03247
large tegument protein UL36; Provisional
1201-1397 6.25e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 6.25e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1201 PTVPEYSAPSLPIRPSRAPSRTPgPPSSQGSPVDTQPAAQKDSS--QTLEPKRPPPPRPVAPPARPAPPQRPPPPSGARS 1278
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAP-HALVSATPLPPGPAAARQASpaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1279 PAPARKEFGGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPLKP 1358
Cdd:PHA03247  2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720386233 1359 QAAF--------------PQQPSLPTPAQKLQDPLVPIAAPTMPPSG---PQPNLE 1397
Cdd:PHA03247  2849 SLPLggsvapggdvrrrpPSRSPAAKPAAPARPPVRRLARPAVSRSTesfALPPDQ 2904
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1157-1394 1.23e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 53.23  E-value: 1.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1157 DMEGDVDDySAEVEELLPQHLQPSSSSGLGTSPSSSPRTSPCQSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSP---V 1233
Cdd:pfam03154  153 DNESDSDS-SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtlI 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1234 DTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSGARSPAPARKEFGGRNQPSPQAGLAGPGPAGYGAARPTI 1313
Cdd:pfam03154  232 QQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPP 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1314 PARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEP-LKPQAAFPqQPSLPTPAQKLQDPLVPIAAP-TMPPSG 1391
Cdd:pfam03154  312 GPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPhIKPPPTTP-IPQLPNPQSHKHPPHLSGPSPfQMNSNL 390

                   ...
gi 1720386233 1392 PQP 1394
Cdd:pfam03154  391 PPP 393
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1199-1394 6.16e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 6.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1199 QSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQrppppsgarS 1278
Cdd:NF033839   285 KEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEV---------K 355
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1279 PAPARkefggrnqPSPQaglAGPGPAGygaARPTIPARAGvisAPQSQARVCAGRPTPDSQSKPsETLKgPAVLPEPLKP 1358
Cdd:NF033839   356 PQPEK--------PKPE---VKPQPEK---PKPEVKPQPE---TPKPEVKPQPEKPKPEVKPQP-EKPK-PEVKPQPEKP 416
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1720386233 1359 QAAFPQQPSLPTPAQKLQdPLVPIAAPTMPPSGPQP 1394
Cdd:NF033839   417 KPEVKPQPEKPKPEVKPQ-PEKPKPEVKPQPEKPKP 451
RRM smart00360
RNA recognition motif;
1058-1113 1.60e-03

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 38.34  E-value: 1.60e-03
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720386233  1058 DDALIDELLRQFAHFGEVILIRFVEDKMW--------VTFLEGSSALNALS-LNGKELLNRTITI 1113
Cdd:smart00360    9 PDTTEEELRELFSKFGKVESVRLVRDKETgkskgfafVEFESEEDAEKALEaLNGKELDGRPLKV 73
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1297-1397 2.70e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 42.06  E-value: 2.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1297 GLAGPGPAGYGAARPTIPaRAGVISAPQSQARVCAGRP-TPDSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPTPAQKL 1375
Cdd:NF033839   278 GLTQDTPKEPGNKKPSAP-KPGMQPSPQPEKKEVKPEPeTPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKP 356
                           90       100
                   ....*....|....*....|..
gi 1720386233 1376 QdPLVPiaAPTMPPSGPQPNLE 1397
Cdd:NF033839   357 Q-PEKP--KPEVKPQPEKPKPE 375
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1311-1396 7.90e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 38.62  E-value: 7.90e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  1311 PTIPARAGVIsaPQSQARVCAGRP--TPDSQSKPSetLKGPAvlPEPLKPQAAFPQQPSLPTPAQKLQDPLVPI----AA 1384
Cdd:smart00818   59 PVLPAQQPVV--PQQPLMPVPGQHsmTPTQHHQPN--LPQPA--QQPFQPQPLQPPQPQQPMQPQPPVHPIPPLppqpPL 132
                            90
                    ....*....|..
gi 1720386233  1385 PTMPPSGPQPNL 1396
Cdd:smart00818  133 PPMFPMQPLPPL 144
 
Name Accession Description Interval E-value
INPP5c_Synj1 cd09098
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This ...
680-1015 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 1; This subfamily contains the INPP5c domains of human synaptojanin 1 (Synj1) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj1 occurs as two main isoforms: a brain enriched 145 KDa protein (Synj1-145) and a ubiquitously expressed 170KDa protein (Synj1-170). Synj1-145 participates in clathrin-mediated endocytosis. The primary substrate of the Synj1-145 INPP5c domain is PI(4,5)P2, which it converts to PI4P. Synj1-145 may work with membrane curvature sensors/generators (such as endophilin) to remove PI(4,5)P2 from curved membranes. The recruitment of the INPP5c domain of Synj1-145 to endophilin-induced membranes leads to a fragmentation and condensation of these structures. The PI(4,5)P2 to PI4P conversion may cooperate with dynamin to produce membrane fission. In addition to this INPP5c domain, these proteins contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro.


Pssm-ID: 197332  Cd Length: 336  Bit Score: 766.12  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 759
Cdd:cd09098      1 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKKAGIPEFQDVRSKPVDIFAIGFEEMVELNAGNIVSASTTNQKLWA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  760 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 839
Cdd:cd09098     81 AELQKTISRDQKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  840 GQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRG 919
Cdd:cd09098    161 GQSQVKERNEDFIEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDIPNEEVKELIRQQNWDSLIAGDQLINQKNAGQVFRG 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  920 FLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRA 999
Cdd:cd09098    241 FLEGKLDFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFPDNSKEQYTWSPGTLLHYGRA 320
                          330
                   ....*....|....*.
gi 1720386233 1000 ELKTSDHRPVVALIDI 1015
Cdd:cd09098    321 ELKTSDHRPVVALIDI 336
INPP5c_Synj cd09089
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanins; This ...
680-1015 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanins; This subfamily contains the INPP5c domains of two human synaptojanins, synaptojanin 1 (Synj1) and synaptojanin 2 (Synj2), and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs). They belong to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj1 occurs as two main isoforms: a brain enriched 145 KDa protein (Synj1-145) and a ubiquitously expressed 170KDa protein (Synj1-170). Synj1-145 participates in clathrin-mediated endocytosis. The primary substrate of the Synj1-145 INPP5c domain is PI(4,5)P2, which it converts to PI4P. Synj1-145 may work with membrane curvature sensors/generators (such as endophilin) to remove PI(4,5)P2 from curved membranes. The recruitment of the INPP5c domain of Synj1-145 to endophilin-induced membranes leads to a fragmentation and condensation of these structures. The PI(4,5)P2 to PI4P conversion may cooperate with dynamin to produce membrane fission. In addition to this INPP5c domain, Synjs contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro. Synj2 can hydrolyze phosphatidylinositol diphosphate (PIP2) to phosphatidylinositol phosphate (PIP). Synj2 occurs as multiple alternative splice variants in various tissues. These variants share the INPP5c domain and the Sac1 domain. Synj2A is recruited to the mitochondria via its interaction with OMP25 (a mitochondrial outer membrane protein). Synj2B is found at nerve terminals in the brain and at the spermatid manchette in testis. Synj2B undergoes further alternative splicing to give 2B1 and 2B2. In clathrin-mediated endocytosis, Synj2 participates in the formation of clathrin-coated pits, and perhaps also in vesicle decoating. Rac1 GTPase regulates the intracellular localization of Synj2 forms, but not Synj1. Synj2 may contribute to the role of Rac1 in cell migration and invasion, and is a potential target for therapeutic intervention in malignant tumors.


Pssm-ID: 197323 [Multi-domain]  Cd Length: 328  Bit Score: 685.66  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQ-EFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLW 758
Cdd:cd09089      1 LRVFVGTWNVNGGKHFRSIAFKHQSMTDWLLDNPKLAGQCsNDSEEDEKPVDIFAIGFEEMVDLNASNIVSASTTNQKEW 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  759 AVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFA 838
Cdd:cd09089     81 GEELQKTISRDHKYVLLTSEQLVGVCLFVFVRPQHAPFIRDVAVDTVKTGLGGAAGNKGAVAIRFLLHSTSLCFVCSHFA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  839 AGQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFR 918
Cdd:cd09089    161 AGQSQVKERNEDFAEIARKLSFPMGRTLDSHDYVFWCGDFNYRIDLPNDEVKELVRNGDWLKLLEFDQLTKQKAAGNVFK 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  919 GFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDllnasfqdeSKILYTWTPGTLLHYGR 998
Cdd:cd09089    241 GFLEGEINFAPTYKYDLFSDDYDTSEKCRTPAWTDRVLWRRRKWPSDKTEESLV---------ETNDPTWNPGTLLYYGR 311
                          330
                   ....*....|....*..
gi 1720386233  999 AELKTSDHRPVVALIDI 1015
Cdd:cd09089    312 AELKTSDHRPVVAIIDI 328
INPP5c_Synj2 cd09099
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 2; This ...
680-1015 0e+00

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of synaptojanin 2; This subfamily contains the INPP5c domains of human synaptojanin 2 (Synj2) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Synj2 can hydrolyze phosphatidylinositol diphosphate (PIP2) to phosphatidylinositol phosphate (PIP). In addition to this INPP5c domain, these proteins contain an N-terminal Sac1-like domain; the Sac1 domain can dephosphorylate a variety of phosphoinositides in vitro. Synj2 occurs as multiple alternative splice variants in various tissues. These variants share the INPP5c domain and the Sac1 domain. Synj2A is recruited to the mitochondria via its interaction with OMP25, a mitochondrial outer membrane protein. Synj2B is found at nerve terminals in the brain and at the spermatid manchette in testis. Synj2B undergoes further alternative splicing to give 2B1 and 2B2. In clathrin-mediated endocytosis, Synj2 participates in the formation of clathrin-coated pits, and perhaps also in vesicle decoating. Rac1 GTPase regulates the intracellular localization of Synj2 forms, but not Synj1. Synj2 may contribute to the role of Rac1 in cell migration and invasion, and is a potential target for therapeutic intervention in malignant tumors.


Pssm-ID: 197333  Cd Length: 336  Bit Score: 566.96  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKQFRSIAFKNQTLTDWLLDAPKLAGIQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 759
Cdd:cd09099      1 TRVAMGTWNVNGGKQFRSNILGTSELTDWLLDSPKLSGTPDFQDDESNPPDIFAVGFEEMVELSAGNIVNASTTNRKMWG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  760 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 839
Cdd:cd09099     81 EQLQKAISRSHRYILLTSAQLVGVCLFIFVRPYHVPFIRDVAIDTVKTGMGGKAGNKGAVAIRFQFYSTSFCFICSHLTA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  840 GQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRG 919
Cdd:cd09099    161 GQNQVKERNEDYKEITQKLSFPMGRNVFSHDYVFWCGDFNYRIDLTYEEVFYFIKRQDWKKLLEFDQLQLQKSSGKIFKD 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  920 FLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRKWPFDRSAEDLDLLNASFQDESKILYTWTPGTLLHYGRA 999
Cdd:cd09099    241 FHEGTINFGPTYKYDVGSEAYDTSDKCRTPAWTDRVLWWRKKWPFEKTAGEINLLDSDLDFDTKIRHTWTPGALMYYGRA 320
                          330
                   ....*....|....*.
gi 1720386233 1000 ELKTSDHRPVVALIDI 1015
Cdd:cd09099    321 ELQASDHRPVLAIVEV 336
IPPc smart00128
Inositol polyphosphate phosphatase, catalytic domain homologues; Mg(2+)-dependent/Li(+) ...
678-1018 3.38e-128

Inositol polyphosphate phosphatase, catalytic domain homologues; Mg(2+)-dependent/Li(+)-sensitive enzymes.


Pssm-ID: 214525 [Multi-domain]  Cd Length: 306  Bit Score: 399.04  E-value: 3.38e-128
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233   678 KKIRVCVGTWNVNGGKqfrsiaFKNQTLTDWLLdapklagiQEFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKL 757
Cdd:smart00128    1 RDIKVLIGTWNVGGLE------SPKVDVTSWLF--------QKIEVKQSEKPDIYVIGLQEVVGLAPGVILETIAGKERL 66
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233   758 WAVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 837
Cdd:smart00128   67 WSDLLESSLNGDGQYNVLAKVYLVGILVLVFVKANHLVYIKDVETFTVKTGMGGLWGNKGAVAVRFKLSDTSFCFVNSHL 146
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233   838 AAGQSQVKERNEDFVEIARKLSFPMGRML--FSHDYVFWCGDFNYRIDLP-NEEVKELIRQQNWDSLIAGDQLINQKNAG 914
Cdd:smart00128  147 AAGASNVEQRNQDYKTILRALSFPERALLsqFDHDVVFWFGDLNFRLDSPsYEEVRRKISKKEFDDLLEKDQLNRQREAG 226
                           250       260       270       280       290       300       310       320
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233   915 QIFRGFLEGKVTFAPTYKYDLF-SEDYDTSEKCRTPAWTDRVLWRrrkwpfdRSAEDLDLLNAsfqdeskilytwtpgtl 993
Cdd:smart00128  227 KVFKGFQEGPITFPPTYKYDSVgTETYDTSEKKRVPAWCDRILYR-------SNGPELIQLSE----------------- 282
                           330       340
                    ....*....|....*....|....*
gi 1720386233   994 lHYGRAELKTSDHRPVVALIDIDIF 1018
Cdd:smart00128  283 -YHSGMEITTSDHKPVFATFRLKVT 306
INPP5c cd09074
Catalytic domain of inositol polyphosphate 5-phosphatases; Inositol polyphosphate ...
680-1015 3.56e-107

Catalytic domain of inositol polyphosphate 5-phosphatases; Inositol polyphosphate 5-phosphatases (5-phosphatases) are signal-modifying enzymes, which hydrolyze the 5-phosphate from the inositol ring of specific 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), such as PI(4,5)P2, PI(3,4,5)P3, PI(3,5)P2, I(1,4,5)P3, and I(1,3,4,5)P4. These enzymes are Mg2+-dependent, and belong to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. In addition to this INPP5c domain, 5-phosphatases often contain additional domains and motifs, such as the SH2 domain, the Sac-1 domain, the proline-rich domain (PRD), CAAX, RhoGAP (RhoGTPase-activating protein), and SKICH [SKIP (skeletal muscle- and kidney-enriched inositol phosphatase) carboxyl homology] domains, that are important for protein-protein interactions and/or for the subcellular localization of these enzymes. 5-phosphatases incorporate into large signaling complexes, and regulate diverse cellular processes including postsynaptic vesicular trafficking, insulin signaling, cell growth and survival, and endocytosis. Loss or gain of function of 5-phosphatases is implicated in certain human diseases. This family also contains a functionally unrelated nitric oxide transport protein, Cimex lectularius (bedbug) nitrophorin, which catalyzes a heme-assisted S-nitrosation of a proximal thiolate; the heme however binds at a site distinct from the active site of the 5-phosphatases.


Pssm-ID: 197308 [Multi-domain]  Cd Length: 299  Bit Score: 341.62  E-value: 3.56e-107
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKqfrsiaFKNQTLTDWLLDAPklagiqefqdkrSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 759
Cdd:cd09074      1 VKIFVVTWNVGGGI------SPPENLENWLSPKG------------TEAPDIYAVGVQEVDMSVQGFVGNDDSAKAREWV 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  760 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAV--DTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 837
Cdd:cd09074     63 DNIQEALNEKENYVLLGSAQLVGIFLFVFVKKEHLPQIKDLEVegVTVGTGGGGKLGNKGGVAIRFQINDTSFCFVNSHL 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  838 AAGQSQVKERNEDFVEIARKLSFPMG----RMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNA 913
Cdd:cd09074    143 AAGQEEVERRNQDYRDILSKLKFYRGdpaiDSIFDHDVVFWFGDLNYRIDSTDDEVRKLISQGDLDDLLEKDQLKKQKEK 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  914 GQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRrkwpfdrsaedldllnasfqdeskilYTWTPGTL 993
Cdd:cd09074    223 GKVFDGFQELPITFPPTYKFDPGTDEYDTSDKKRIPAWCDRILYKS--------------------------KAGSEIQP 276
                          330       340
                   ....*....|....*....|...
gi 1720386233  994 LHYGRAEL-KTSDHRPVVALIDI 1015
Cdd:cd09074    277 LSYTSVPLyKTSDHKPVRATFRV 299
INPP5c_ScInp51p-like cd09090
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Saccharomyces cerevisiae ...
680-1011 8.95e-105

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Saccharomyces cerevisiae Inp51p, Inp52p, and Inp53p, and related proteins; This subfamily contains the INPP5c domain of three Saccharomyces cerevisiae synaptojanin-like inositol polyphosphate 5-phosphatases (INP51, INP52, and INP53), Schizosaccharomyces pombe synaptojanin (SPsynaptojanin), and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. In addition to this INPP5c domain, these proteins have an N-terminal catalytic Sac1-like domain (found in other proteins including the phophoinositide phosphatase Sac1p), and a C-terminal proline-rich domain (PRD). The Sac1 domain allows Inp52p and Inp53p to recognize and dephosphorylate a wider range of substrates including PI3P, PI4P, and PI(3,5)P2. The Sac1 domain of Inp51p is non-functional. Disruption of any two of INP51, INP52, and INP53, in S. cerevisiae leads to abnormal vacuolar and plasma membrane morphology. During hyperosmotic stress, Inp52p and Inp53p localize at actin patches, where they may facilitate the hydrolysis of PI(4,5)P2, and consequently promote actin rearrangement to regulate cell growth. SPsynaptojanin is also active against a range of soluble and lipid inositol phosphates, including I(1,4,5)P3, I(1,3,4,5)P4, I(1,4,5,6)P4, PI(4,5)P2, and PIP3. Transformation of S. cerevisiae with a plasmid expressing the SPsynaptojanin 5-phosphatase domain rescues inp51/inp52/inp53 triple-mutant strains.


Pssm-ID: 197324  Cd Length: 291  Bit Score: 334.69  E-value: 8.95e-105
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGgkqfrsiAFKNQTLTDWLldapklagiqeFQDKRSKPTDIFAIGFEEMVELNAGNIVNASTTNQKLWA 759
Cdd:cd09090      1 INIFVGTFNVNG-------KSYKDDLSSWL-----------FPEENDELPDIVVIGLQEVVELTAGQILNSDPSKSSFWE 62
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  760 VELQKTISR--DNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 837
Cdd:cd09090     63 KKIKTTLNGrgGEKYVLLRSEQLVGTALLFFVKESQLPKVKNVEGSTKKTGLGGMSGNKGAVAIRFDYGDTSFCFVTSHL 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  838 AAGQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIF 917
Cdd:cd09090    143 AAGLTNYEERNNDYKTIARGLRFSRGRTIKDHDHVIWLGDFNYRISLTNEDVRRFILNGKLDKLLEYDQLNQQMNAGEVF 222
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  918 RGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRrrkwpfdrsAEDLDLLNasfqdeskilytwtpgtllhYG 997
Cdd:cd09090    223 PGFSEGPITFPPTYKYDKGTDNYDTSEKQRIPAWTDRILYR---------GENLRQLS--------------------YN 273
                          330
                   ....*....|....
gi 1720386233  998 RAELKTSDHRPVVA 1011
Cdd:cd09090    274 SAPLRFSDHRPVYA 287
INPP5c_INPP5B cd09093
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Type II inositol ...
680-1015 9.48e-100

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Type II inositol polyphosphate 5-phosphatase I, Oculocerebrorenal syndrome of Lowe 1, and related proteins; This subfamily contains the INPP5c domain of type II inositol polyphosphate 5-phosphatase I (INPP5B), Oculocerebrorenal syndrome of Lowe 1 (OCRL-1), and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. INPP5B and OCRL1 preferentially hydrolyze the 5-phosphate of phosphatidylinositol (4,5)- bisphosphate [PI(4,5)P2] and phosphatidylinositol (3,4,5)- trisphosphate [PI(3,4,5)P3]. INPP5B can also hydrolyze soluble inositol (1,4,5)-trisphosphate [I(1,4,5)P3] and inositol (1,3,4,5)-tetrakisphosphate [I(1,3,4,5)P4]. INPP5B participates in the endocytic pathway and in the early secretory pathway. In the latter, it may function in retrograde ERGIC (ER-to-Golgi intermediate compartment)-to-ER transport; it binds specific RAB proteins within the secretory pathway. In the endocytic pathway, it binds RAB5 and during endocytosis, may function in a RAB5-controlled cascade for converting PI(3,4,5)P3 to phosphatidylinositol 3-phosphate (PI3P). This cascade may link growth factor signaling and membrane dynamics. Mutation in OCRL1 is implicated in Lowe syndrome, an X-linked recessive multisystem disorder, which includes defects in eye, brain, and kidney function, and in Type 2 Dent's disease, a disorder with only the renal symptoms. OCRL-1 may have a role in membrane trafficking within the endocytic pathway and at the trans-Golgi network, and may participate in actin dynamics or signaling from endomembranes. OCRL1 and INPP5B have overlapping functions: deletion of both 5-phosphatases in mice is embryonic lethal, deletion of OCRL1 alone has no phenotype, and deletion of Inpp5b alone has only a mild phenotype (male sterility). Several of the proteins that interact with OCRL1 also bind INPP5B, for examples, inositol polyphosphate phosphatase interacting protein of 27kDa (IPIP27)A and B (also known as Ses1 and 2), and endocytic signaling adaptor APPL1. OCRL1, but not INPP5B, binds clathrin heavy chain, the plasma membrane AP2 adaptor subunit alpha-adaptin. In addition to this INPP5c domain, most proteins in this subfamily have a C-terminal RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain.


Pssm-ID: 197327  Cd Length: 292  Bit Score: 320.80  E-value: 9.48e-100
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKqfrsiafKNQTLTDWLldapklagiqefqDKRSKPTDIFAIGFEEmVELNAGNIVNASTTNQKLWA 759
Cdd:cd09093      1 FRIFVGTWNVNGQS-------PDESLRPWL-------------SCDEEPPDIYAIGFQE-LDLSAEAFLFNDSSREQEWV 59
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  760 VELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAA 839
Cdd:cd09093     60 KAVERGLHPDAKYKKVKLIRLVGMMLLVFVKKEHRQHIKEVAAETVGTGIMGKMGNKGGVAVRFQFHNTTFCFVNSHLAA 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  840 GQSQVKERNEDFVEIARKLSFPMG----RMLFSHDYVFWCGDFNYRI-DLPNEEVKELIRQQNWDSLIAGDQLINQKNAG 914
Cdd:cd09093    140 HMEEVERRNQDYKDICARMKFEDPdgppLSISDHDVVFWLGDLNYRIqELPTEEVKELIEKNDLEELLKYDQLNIQRRAG 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  915 QIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRrrkwpfdrsaedldllnasfqdESKIlytwtpgTLL 994
Cdd:cd09093    220 KVFEGFTEGEINFIPTYKYDPGTDNWDSSEKCRAPAWCDRILWR----------------------GTNI-------VQL 270
                          330       340
                   ....*....|....*....|..
gi 1720386233  995 HYGR-AELKTSDHRPVVALIDI 1015
Cdd:cd09093    271 SYRShMELKTSDHKPVSALFDI 292
COG5329 COG5329
Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];
204-627 6.52e-95

Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms];


Pssm-ID: 227637 [Multi-domain]  Cd Length: 570  Bit Score: 317.79  E-value: 6.52e-95
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  204 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASDEDRIS---------EVRKVLNSGNFYF 274
Cdd:COG5329     61 YGVIGLIKLKGD----IYLIVITGASLVGVIPGHSIYKILDVDFISLNNNKWDDELEEdeanydklsELKKLLSNGTFYF 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  275 A--WSASGvSLDLSLNAHRSMQEHTTDNRFFWNQSL------HLHLKHYGVNCDD-WLLRLMCGGVEIRTIYAAHKQAKA 345
Cdd:COG5329    137 SydFDITN-SLQKNLSEGLEASVDRADLIFMWNSFLleefinHRSKLSSLEKQFDnFLTTVIRGFAETVDIKVGGNTISL 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  346 CLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDCVSSFIQIRGSVPLFWEQPGLQVGShRVRMSRGFEANAPA 425
Cdd:COG5329    216 TLISRRSSERAGTRYLSRGIDDDGNVSNFVETEQIVTDSQYIFSFTQVRGSIPLFWEQSNLLYGP-KIKVTRSSEAAQSA 294
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  426 FDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKASEHAsDIHMVSFDYHQMVKGGKAEKLHSILKPQVQKFLDY 505
Cdd:COG5329    295 FDKHFDKLREKYGDVYVVNLLKTKGYEAPLLELYEKHLDLSKKP-KIHYTEFDFHKETSQDGFDDVKKLLYLIEQDLLEF 373
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  506 GFFYFDGSEVQRC--QSGTVRTNCLDCLDRTNSVQAFLGLEMLAKQLEALGLAEKpqlVTRFQEVFRSMWSVNGDSISKI 583
Cdd:COG5329    374 GYFAYDINEGKSIseQDGVFRTNCLDCLDRTNVIQSLISRVLLEQFRSEGVISDG---YSPFLQIHRELWADNGDAISRL 450
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1720386233  584 YAGTGALEGKAK-------AGKLKDGARSVTRTIQNNFFDSSKQEAIDVLL 627
Cdd:COG5329    451 YTGTGALKSSFTrrgrrsfAGALNDFIKSFSRYYINNFTDGQRQDAIDLLL 501
Syja_N pfam02383
SacI homology domain; This Pfam family represents a protein domain which shows homology to the ...
204-484 2.38e-86

SacI homology domain; This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin.


Pssm-ID: 460545  Cd Length: 295  Bit Score: 283.69  E-value: 2.38e-86
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  204 YGLLGVLRLNLGdtmlHYLVLVTGCMSVGKIQESEVFRVTSTEFISLRVDASD----------EDRI-SEVRKVLNSGNF 272
Cdd:pfam02383    1 YGILGLIRLLSG----YYLIVITKREQVGQIGGHPIYKITDVEFIPLNSSLSDtqlakkehpdEERLlKLLKLFLSSGSF 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  273 YFAWSasgvsLDLSlnahRSMQEHTT----------DNRFFWNQSLHLHLKHYGVNCDDWLLRLMCGGVEIRTIYAAHKQ 342
Cdd:pfam02383   77 YFSYD-----YDLT----NSLQRNLTrsrspsfdslDDRFFWNRHLLKPLIDFQLDLDRWILPLIQGFVEQGKLSVFGRS 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  343 AKACLISRLSCERAGTRFNVRGTNDDGHVANFVETEQVIYLDDC-----VSSFIQIRGSVPLFWEQPGLQVGSHRVRMSR 417
Cdd:pfam02383  148 VTLTLISRRSRKRAGTRYLRRGIDDDGNVANFVETEQIVSLNTSnsegkIFSFVQIRGSIPLFWSQDPNLKYKPKIQITR 227
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720386233  418 gFEANAPAFDRHFRTLKDLYGKQIVVNLLGSKEGEHMLSKAFQSHLKAS--EHASDIHMVSFDYHQMVK 484
Cdd:pfam02383  228 -PEATQPAFKKHFDDLIERYGPVHIVNLVEKKGRESKLSEAYEEAVKYLnqFLPDKLRYTAFDFHHECK 295
INPP5c_INPP5J-like cd09094
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of inositol polyphosphate ...
681-1015 8.54e-72

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of inositol polyphosphate 5-phosphatase J and related proteins; INPP5c domain of Inositol polyphosphate-5-phosphatase J (INPP5J), also known as PIB5PA or PIPP, and related proteins. This subfamily belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. INPP5J hydrolyzes PI(4,5)P2, I(1,4,5)P3, and I(1,3,4,5)P4 at ruffling membranes. These proteins contain a C-terminal, SKIP carboxyl homology domain (SKICH), which may direct plasma membrane ruffle localization.


Pssm-ID: 197328  Cd Length: 300  Bit Score: 242.28  E-value: 8.54e-72
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  681 RVCVGTWNVnggkqfrSIAFKNQTLTdwlldapKLAGIQEFQDKrskpTDIFAIGFEEmvelnagniVNASTTNQKL--- 757
Cdd:cd09094      2 RVYVVTWNV-------ATAPPPIDVR-------SLLGLQSPEVA----PDIYIIGLQE---------VNSKPVQFVSdli 54
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  758 ----WAvELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFV 833
Cdd:cd09094     55 fddpWS-DLFMDILSPKGYVKVSSIRLQGLLLLVFVKIQHLPFIRDVQTNYTRTGLGGYWGNKGAVTVRFSLYGHMICFL 133
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  834 CSHFAAGQSQVKERNEDFVEIARKLSFPMGRM--LFSHDYVFWCGDFNYRI-DLPNEEVKELIRQQNWDSLIAGDQLINQ 910
Cdd:cd09094    134 NCHLPAHMEKWEQRIDDFETILSTQVFNECNTpsILDHDYVFWFGDLNFRIeDVSIEFVRELVNSKKYHLLLEKDQLNMA 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  911 KNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRrrkwpfdrsaedLDLLNASFQDESKIlytwtp 990
Cdd:cd09094    214 KRKEEAFQGFQEGPLNFAPTYKFDLGTDEYDTSGKKRKPAWTDRILWK------------VNPDASTEEKFLSI------ 275
                          330       340
                   ....*....|....*....|....*.
gi 1720386233  991 gTLLHY-GRAELKTSDHRPVVALIDI 1015
Cdd:cd09094    276 -TQTSYkSHMEYGISDHKPVTAQFRL 300
COG5411 COG5411
Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms];
673-1041 6.90e-70

Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms];


Pssm-ID: 227698 [Multi-domain]  Cd Length: 460  Bit Score: 242.77  E-value: 6.90e-70
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  673 KYSKPKKIRVCVGTWNVNGgkqfrsiafKNQT--LTDWLLdaPklagiqefQDKRSKPTDIFAIGFEEMVELNAGNIVNA 750
Cdd:COG5411     23 KYVIEKDVSIFVSTFNPPG---------KPPKasTKRWLF--P--------EIEATELADLYVVGLQEVVELTPGSILSA 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  751 STtNQKL--W---AVELQKTISRDNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLF 825
Cdd:COG5411     84 DP-YDRLriWeskVLDCLNGAQSDEKYSLLRSPQLGGILLRVFSLATNLPVVKPVSGTVKKTGFGGSSSNKGAVAIRFNY 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  826 HTTSLCFVCSHFAAGQSQVKERNEDFVEIARKLSFPMGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNW--DSLIA 903
Cdd:COG5411    163 ERTSFCFVNSHLAAGVNNIEERIFDYRSIASNICFSRGLRIYDHDTIFWLGDLNYRVTSTNEEVRPEIASDDGrlDKLFE 242
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  904 GDQLINQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRrkwpfdrsaedldllnasfqdesk 983
Cdd:COG5411    243 YDQLLWEMEVGNVFPGFKEPVITFPPTYKFDYGTDEYDTSDKGRIPSWTDRILYKS------------------------ 298
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720386233  984 ilYTWTPGTllhYGRAE-LKTSDHRPVVALIDIDIFEVEAEERQKIYKEVIA--VQGPPDG 1041
Cdd:COG5411    299 --EQLTPHS---YSSIPhLMISDHRPVYATFRAKIKVVDPSKKEGLIEKLYAeyKTELGEA 354
DUF1866 pfam08952
Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known ...
1014-1155 2.22e-63

Domain of unknown function (DUF1866); This domain, found in Synaptojanin, has no known function.


Pssm-ID: 286093  Cd Length: 146  Bit Score: 211.98  E-value: 2.22e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1014 DIDIFEVEAEERQKIYKEVIAVQGPPDGTVLVSIKS-SAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEG 1092
Cdd:pfam08952    1 DVEIQEVDPEARRRVFKEVIRDQGPPDGTIVVSLCSgDLDEKNIFDENLMDELIQELTSFGEVTLVRFVEDTMWVTFRDG 80
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1720386233 1093 SSALNALSLNGKELLNRTITITLKSPDWIKHLEEEM---SLEKISVTlpSSASSTLLGEDAEVAAD 1155
Cdd:pfam08952   81 HSALNALSKDGMKVCGRALKIRLKSKDWIKGLEEEIilcTDNTIPVS--PCANSTLLAEDFDFGSP 144
INPP5c_INPP5E-like cd09095
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Inositol ...
678-1015 8.67e-57

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of Inositol polyphosphate-5-phosphatase E and related proteins; INPP5c domain of Inositol polyphosphate-5-phosphatase E (also called type IV or 72 kDa 5-phosphatase), rat pharbin, and related proteins. This subfamily belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. INPP5E hydrolyzes the 5-phosphate from PI(3,5)P2, PI(4,5)P2 and PI(3,4,5)P3, forming PI3P, PI4P, and PI(3,4)P2, respectively. It is a very potent PI(3,4,5)P3 5-phosphatase. Its intracellular localization is chiefly cytosolic, with pronounced perinuclear/Golgi localization. INPP5E also has an N-terminal proline rich domain (PRD) and a C-terminal CAAX motif. This protein is expressed in a variety of tissues, including the breast, brain, testis, and haemopoietic cells. It is differentially expressed in several cancers, for example, it is up-regulated in cervical cancer and down-regulated in stomach cancer. It is a candidate target for therapeutics of obesity and related disorders, as it is expressed in the hypothalamus, and following insulin stimulation, it undergoes tyrosine phosphorylation, associates with insulin receptor substrate-1, -2, and PI3-kinase, and become active as a 5-phosphatase. INPP5E may play a role, along with other 5-phosphatases SHIP2 and SKIP, in regulating glucose homoeostasis and energy metabolism. Mice deficient in INPPE5 develop a multi-organ disorder associated with structural defects of the primary cilium.


Pssm-ID: 197329  Cd Length: 298  Bit Score: 199.19  E-value: 8.67e-57
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  678 KKIRVCVGTWNVNGGKQFrsiafkNQTLTDWLLdapklAGIQEFQdkrskpTDIFAIGFEEmvelnagnivnaSTTNQKL 757
Cdd:cd09095      3 RNVGIFVATWNMQGQKEL------PENLDDFLL-----PTSADFA------QDIYVIGVQE------------GCSDRRE 53
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  758 WAVELQKTISrdNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 837
Cdd:cd09095     54 WEIRLQETLG--PSHVLLHSASHGVLHLAVFIRRDLIWFCSEVESATVTTRIVSQIKTKGALAISFTFFGTSFLFITSHF 131
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  838 AAGQSQVKERNEDFVEIARKLSFPmgRMLFSHDY-------------VFWCGDFNYRIDLPNEEVKELIRQ---QNWDSL 901
Cdd:cd09095    132 TSGDGKVKERVLDYNKIIQALNLP--RNVPTNPYksesgdvttrfdeVFWFGDFNFRLSGPRHLVDALINQgqeVDVSAL 209
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  902 IAGDQLINQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRTPAWTDRVLWRRRkwpfdrsaedldllnasfqde 981
Cdd:cd09095    210 LQHDQLTREMSKGSIFKGFQEAPIHFPPTYKFDIGSDVYDTSSKQRVPSYTDRILYRSR--------------------- 268
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 1720386233  982 skilytwTPGTL--LHYGRAE-LKTSDHRPVVALIDI 1015
Cdd:cd09095    269 -------QKGDVccLKYNSCPsIKTSDHRPVFALFRV 298
PLN03191 PLN03191
Type I inositol-1,4,5-trisphosphate 5-phosphatase 2; Provisional
771-1030 5.71e-50

Type I inositol-1,4,5-trisphosphate 5-phosphatase 2; Provisional


Pssm-ID: 215624 [Multi-domain]  Cd Length: 621  Bit Score: 188.58  E-value: 5.71e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  771 KYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHFAAGQSQVKE--RN 848
Cdd:PLN03191   363 KYVRIVSKQMVGIYVSVWVRKRLRRHINNLKVSPVGVGLMGYMGNKGSVSISMSLFQSRLCFVCSHLTSGHKDGAEqrRN 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  849 EDFVEIARKLSFP------MGRMLFSHDYVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLINQKNAGQIFRGFLE 922
Cdd:PLN03191   443 ADVYEIIRRTRFSsvldtdQPQTIPSHDQIFWFGDLNYRLNMLDTEVRKLVAQKRWDELINSDQLIKELRSGHVFDGWKE 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  923 GKVTFAPTYKYDLFSEDY-----DTSEKCRTPAWTDRVLWrrrkwpfdrsaedldlLNASFQDESkilytwtpgtllhYG 997
Cdd:PLN03191   523 GPIKFPPTYKYEINSDRYvgenpKEGEKKRSPAWCDRILW----------------LGKGIKQLC-------------YK 573
                          250       260       270
                   ....*....|....*....|....*....|...
gi 1720386233  998 RAELKTSDHRPVVALIDIdifEVEAEERQKIYK 1030
Cdd:PLN03191   574 RSEIRLSDHRPVSSMFLV---EVEVFDHRKLQR 603
INPP5c_SHIP1-INPP5D cd09100
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing ...
680-958 5.44e-41

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing inositol polyphosphate 5-phosphatase-1 and related proteins; This subfamily contains the INPP5c domain of SHIP1 (SH2 domain containing inositol polyphosphate 5-phosphatase-1, also known as SHIP/INPP5D) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. SHIP1's enzymic activity is restricted to phosphatidylinositol 3,4,5-trisphosphate [PI (3,4,5)P3] and inositol-1,3,4,5- polyphosphate [I(1,3,4,5)P4]. It converts these two phosphoinositides to phosphatidylinositol 3,4-bisphosphate [PI (3,4)P2] and inositol-1,3,4-polyphosphate [I(1,3,4)P3], respectively. SHIP1 is a negative regulator of cell growth and plays a major part in mediating the inhibitory signaling in B cells; it is predominantly expressed in hematopoietic cells. In addition to this INPP5c domain, SHIP1 has an N-terminal SH2 domain, two NPXY motifs, and a C-terminal proline-rich region (PRD). SHIP1's phosphorylated NPXY motifs interact with proteins with phosphotyrosine binding (PTB) domains, and facilitate the translocation of SHIP1 to the plasma membrane to hydrolyze PI(3,4,5)P3. SHIP1 generally acts to oppose the activity of phosphatidylinositol 3-kinase (PI3K). It acts as a negative signaling molecule, reducing the levels of PI(3,4,5)P3, thereby removing the latter as a membrane-targeting signal for PH domain-containing effector molecules. SHIP1 may also, in certain contexts, amplify PI3K signals. SHIP1 and SHIP2 have little overlap in their in vivo functions.


Pssm-ID: 197334  Cd Length: 307  Bit Score: 153.99  E-value: 5.44e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKQFRSIafknqtlTDWLLDApklaGIQEFQDKRSK--PTDIFAIGFEEmvelnagnivnaSTTNQKL 757
Cdd:cd09100      1 ITIFIGTWNMGNAPPPKKI-------TSWFQCK----GQGKTRDDTADyiPHDIYVIGTQE------------DPLGEKE 57
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  758 WAVELQKTISR--DNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCS 835
Cdd:cd09100     58 WLDTLKHSLREitSISFKVIAIQTLWNIRIVVLAKPEHENRISHICTDSVKTGIANTLGNKGAVGVSFMFNGTSFGFVNS 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  836 HFAAGQSQVKERNEDFVEIARKLSF---PMGRMLFSH--DYVFWCGDFNYRIDLPNEEVKEL---IRQQNWDSLIAGDQL 907
Cdd:cd09100    138 HLTSGSEKKLRRNQNYFNILRFLVLgdkKLSPFNITHrfTHLFWLGDLNYRVELPNTEAENIiqkIKQQQYQELLPHDQL 217
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720386233  908 INQKNAGQIFRGFLEGKVTFAPTYKYD-------LFSEDYDTSEKCRTPAWTDRVLWR 958
Cdd:cd09100    218 LIERKESKVFLQFEEEEITFAPTYRFErgtreryAYTKQKATGMKYNLPSWCDRVLWK 275
INPP5c_SHIP cd09091
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing ...
680-1015 2.01e-40

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing inositol polyphosphate 5-phosphatase-1 and -2, and related proteins; This subfamily contains the INPP5c domain of SHIP1 (SH2 domain containing inositol polyphosphate 5-phosphatase-1, also known as SHIP/INPP5D), and SHIP2 (also known as INPPL1). It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. Both SHIP1 and -2 catalyze the dephosphorylation of the PI, phosphatidylinositol 3,4,5-trisphosphate [PI(3,4,5)P3], to phosphatidylinositol 3,4-bisphosphate [PI(3,4)P2]. SHIP1 also converts inositol-1,3,4,5- polyphosphate [I(1,3,4,5)P4] to inositol-1,3,4-polyphosphate [I(1,3,4)P3]. SHIP1 and SHIP2 have little overlap in their in vivo functions. SHIP1 is a negative regulator of cell growth and plays a major part in mediating the inhibitory signaling in B cells; it is predominantly expressed in hematopoietic cells. SHIP2 is as an inhibitor of the insulin signaling pathway, and is implicated in actin structure remodeling, cell adhesion and cell spreading, receptor endocytosis and degradation, and in the JIP1-mediated JNK pathway. SHIP2 is widely expressed, most prominently in brain, heart and in skeletal muscle. In addition to this INPP5c domain, SHIP1 has an N-terminal SH2 domain, two NPXY motifs, and a C-terminal proline-rich region (PRD), while SHIP2 has an N-terminal SH2 domain, a C-terminal proline-rich domain (PRD), which includes a WW-domain binding motif (PPLP), an NPXY motif, and a sterile alpha motif (SAM) domain. The gene encoding SHIP2 is a candidate gene for conferring a predisposition for type 2 diabetes.


Pssm-ID: 197325  Cd Length: 307  Bit Score: 152.02  E-value: 2.01e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKQFRSIafknqtlTDWLLDAPKLAGIQEFQDkrSKPTDIFAIGFEEmvelnagnivnaSTTNQKLWA 759
Cdd:cd09091      1 ISIFIGTWNMGSAPPPKNI-------TSWFTSKGQGKTRDDVAD--YIPHDIYVIGTQE------------DPLGEKEWL 59
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  760 VELQKTISR--DNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCSHF 837
Cdd:cd09091     60 DLLRHSLKEltSLDYKPIAMQTLWNIRIVVLAKPEHENRISHVCTSSVKTGIANTLGNKGAVGVSFMFNGTSFGFVNSHL 139
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  838 AAGQSQVKERNEDFVEIARKLSF---PMGRMLFSH--DYVFWCGDFNYRIDLPNEEVKELI---RQQNWDSLIAGDQLIN 909
Cdd:cd09091    140 TSGSEKKLRRNQNYLNILRFLSLgdkKLSAFNITHrfTHLFWLGDLNYRLDLPIQEAENIIqkiEQQQFEPLLRHDQLNL 219
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  910 QKNAGQIFRGFLEGKVTFAPTYKYDLFSEDY-------DTSEKCRTPAWTDRVLWrrrkwpfdRSAEDLDLLNASFQDES 982
Cdd:cd09091    220 EREEHKVFLRFSEEEITFPPTYRYERGSRDTyaytkqkATGVKYNLPSWCDRILW--------KSYPETHIICQSYGCTD 291
                          330       340       350
                   ....*....|....*....|....*....|...
gi 1720386233  983 KILytwtpgtllhygraelkTSDHRPVVALIDI 1015
Cdd:cd09091    292 DIV-----------------TSDHSPVFGTFEV 307
RRM_SYNJ1 cd12719
RNA recognition motif (RRM) found in synaptojanin-1 and similar proteins; This subgroup ...
1040-1116 2.02e-39

RNA recognition motif (RRM) found in synaptojanin-1 and similar proteins; This subgroup corresponds to the RRM of synaptojanin-1, also termed synaptojanin, or synaptic inositol-1,4,5-trisphosphate 5-phosphatase 1, originally identified as one of the major Grb2-binding proteins that may participate in synaptic vesicle endocytosis. It also acts as a Src homology 3 (SH3) domain-binding brain-specific inositol 5-phosphatase with a putative role in clathrin-mediated endocytosis. Synaptojanin-1 contains an N-terminal domain homologous to the cytoplasmic portion of the yeast protein Sac1p, a central inositol 5-phosphatase domain followed by a putative RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal proline-rich region mediating the binding of synaptojanin-1 to various SH3 domain-containing proteins including amphiphysin, SH3p4, SH3p8, SH3p13, and Grb2. Synaptojanin-1 has two tissue-specific alternative splicing isoforms, synaptojanin-145 expressed in brain and synaptojanin-170 expressed in peripheral tissues. Synaptojanin-145 is very abundant in nerve terminals and may play an essential role in the clathrin-mediated endocytosis of synaptic vesicles. In contrast to synaptojanin-145, synaptojanin-170 contains three unique asparagine-proline-phenylalanine (NPF) motifs in the C-terminal region and may functions as a potential binding partner for Eps15, a clathrin coat-associated protein acting as a major substrate for the tyrosine kinase activity of the epidermal growth factor receptor.


Pssm-ID: 410118  Cd Length: 77  Bit Score: 141.00  E-value: 2.02e-39
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720386233 1040 DGTVLVSIKSSAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEGSSALNALSLNGKELLNRTITITLK 1116
Cdd:cd12719      1 DGTVVVSVLSSSPEPNYFDDNLIDALLQQFSSFGEVILIRFVEDKMWVTFLEGSSALAALSLNGTEVLGRTIIISLK 77
INPP5c_SHIP2-INPPL1 cd09101
Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing ...
680-958 3.90e-39

Catalytic inositol polyphosphate 5-phosphatase (INPP5c) domain of SH2 domain containing inositol 5-phosphatase-2 and related proteins; This subfamily contains the INPP5c domain of SHIP2 (SH2 domain containing inositol 5-phosphatase-2, also called INPPL1) and related proteins. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. SHIP2 catalyzes the dephosphorylation of the PI, phosphatidylinositol 3,4,5-trisphosphate [PI(3,4,5)P3], to phosphatidylinositol 3,4-bisphosphate [PI(3,4)P2]. SHIP2 is widely expressed, most prominently in brain, heart and in skeletal muscle. SHIP2 is an inhibitor of the insulin signaling pathway. It is implicated in actin structure remodeling, cell adhesion and cell spreading, receptor endocytosis and degradation, and in the JIP1-mediated JNK pathway. Its interacting partners include filamin/actin, p130Cas, Shc, Vinexin, Interesectin 1, and c-Jun NH2-terminal kinase (JNK)-interacting protein 1 (JIP1). A large variety of extracellular stimuli appear to lead to the tyrosine phosphorylation of SHIP2, including epidermal growth factor (EGF), platelet-derived growth factor (PDGF), insulin, macrophage colony-stimulating factor (M-CSF) and hepatocyte growth factor (HGF). SHIP2 is localized to the cytosol in quiescent cells; following growth factor stimulation and /or cell adhesion, it relocalizes to membrane ruffles. In addition to this INPP5c domain, SHIP2 has an N-terminal SH2 domain, a C-terminal proline-rich domain (PRD), which includes a WW-domain binding motif (PPLP), an NPXY motif and a sterile alpha motif (SAM) domain. The gene encoding SHIP2 is a candidate for conferring a predisposition for type 2 diabetes; it has been suggested that suppression of SHIP2 may be of benefit in the treatment of obesity and thereby prevent type 2 diabetes. SHIP2 and SHIP1 have little overlap in their in vivo functions.


Pssm-ID: 197335  Cd Length: 304  Bit Score: 148.20  E-value: 3.90e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  680 IRVCVGTWNVNGGKQFRSiafknqtLTDWLLDApklaGIQEFQDKRSK--PTDIFAIGFEEmvelnagnivnaSTTNQKL 757
Cdd:cd09101      1 ISIFIGTWNMGSVPPPKS-------LASWLTSR----GLGKTLDETTVtiPHDIYVFGTQE------------NSVGDRE 57
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  758 WAVELQKTISR--DNKYVLLASEQLVGVCLFVFIRPQHAPFIRDVAVDTVKTGMGGATGNKGAVAIRMLFHTTSLCFVCS 835
Cdd:cd09101     58 WVDFLRASLKEltDIDYQPIALQCLWNIKMVVLVKPEHENRISHVHTSSVKTGIANTLGNKGAVGVSFMFNGTSFGFVNC 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  836 HFAAGQSQVKERNEDFVEIARKLSFPMGR-------MLFSHdyVFWCGDFNYRIDLPNEEVKELIRQQNWDSLIAGDQLI 908
Cdd:cd09101    138 HLTSGNEKTHRRNQNYLDILRSLSLGDKQlnafdisLRFTH--LFWFGDLNYRLDMDIQEILNYITRKEFDPLLAVDQLN 215
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1720386233  909 NQKNAGQIFRGFLEGKVTFAPTYKYDLFSEDYDTSEKCRT-------PAWTDRVLWR 958
Cdd:cd09101    216 LEREKNKVFLRFREEEISFPPTYRYERGSRDTYMWQKQKTtgmrtnvPSWCDRILWK 272
RRM_SYNJ cd12440
RNA recognition motif (RRM) found in synaptojanin-1, synaptojanin-2 and similar proteins; This ...
1040-1116 2.22e-31

RNA recognition motif (RRM) found in synaptojanin-1, synaptojanin-2 and similar proteins; This subfamily corresponds to the RRM of two active phosphatidylinositol phosphate phosphatases, synaptojanin-1 and synaptojanin-2. They have different interaction partners and are likely to have different biological functions. Synaptojanin-1 was originally identified as one of the major Grb2-binding proteins that may participate in synaptic vesicle endocytosis. It also acts as a Src homology 3 (SH3) domain-binding brain-specific inositol 5-phosphatase with a putative role in clathrin-mediated endocytosis. Synaptojanin-2 is a ubiquitously expressed homolog of synaptojanin-1. It is a novel Rac1 effector regulating the early step of clathrin-mediated endocytosis. Synaptojanin-2 directly and specifically interacts with Rac1 in a GTP-dependent manner. It mediates the inhibitory effect of Rac1 on endocytosis and plays an important role in the Rac1-mediated control of cell growth. Both, synaptojanin-1 and synaptojanin-2, have two tissue-specific alternative splicing isoforms, a shorter isoform expressed in brain and a longer isoform in peripheral tissues. Synaptojanin-1 contains an N-terminal domain homologous to the cytoplasmic portion of the yeast protein Sac1p, a central inositol 5-phosphatase domain followed by a putative RNA recognition motif (RRM), also termed RBD (RNA binding domain) or RNP (ribonucleoprotein domain), and a C-terminal proline-rich region mediating the binding of synaptojanin-1 to various SH3 domain-containing proteins including amphiphysin, SH3p4, SH3p8, SH3p13, and Grb2. Synaptojanin-2 shows high sequence homology to the N-terminal Sac1p homology domain, the central inositol 5-phosphatase domain, the putative RNA recognition motif (RRM) of synaptojanin-1, but differs in the proline-rich region.


Pssm-ID: 409874 [Multi-domain]  Cd Length: 77  Bit Score: 117.91  E-value: 2.22e-31
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720386233 1040 DGTVLVSIKSSAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEGSSALNALSLNGKELLNRTITITLK 1116
Cdd:cd12440      1 DATVVVSLDSKSEEWNEFEDALIGELLRVLASYGDVVLVRFAHEGMLVTFRDGRSALAALALNGKQILGRTLKIRLK 77
RRM_SYNJ2 cd12720
RNA recognition motif (RRM) found in synaptojanin-2 and similar proteins; This subgroup ...
1040-1113 1.21e-10

RNA recognition motif (RRM) found in synaptojanin-2 and similar proteins; This subgroup corresponds to the RRM of synaptojanin-2, also termed synaptic inositol-1,4,5-trisphosphate 5-phosphatase 2, an ubiquitously expressed central regulatory enzyme in the phosphoinositide-signaling cascade. As a novel Rac1 effector regulating the early step of clathrin-mediated endocytosis, synaptojanin-2 acts as a polyphosphoinositide phosphatase directly and specifically interacting with Rac1 in a GTP-dependent manner. It mediates the inhibitory effect of Rac1 on endocytosis and plays an important role in the Rac1-mediated control of cell growth. Synaptojanin-2 shows high sequence homology to the N-terminal Sac1p homology domain, the central inositol 5-phosphatase domain, the putative RNA recognition motif (RRM) of synaptojanin-1, but differs in the proline-rich region.


Pssm-ID: 410119 [Multi-domain]  Cd Length: 78  Bit Score: 59.03  E-value: 1.21e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720386233 1040 DGTVLVSIKS-SAQESTFFDDALIDELLRQFAHFGEVILIRFVEDKMWVTFLEGSSALNALSLNGKELLNRTITI 1113
Cdd:cd12720      1 DATVVVNLLSpTLEEKNDFPEDLSTELVQCFQSYGTVILVRFNRGQMLVTFEDSRSALRVLDLDGIKVNGRAVKI 75
PHA03247 PHA03247
large tegument protein UL36; Provisional
1201-1397 6.25e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.64  E-value: 6.25e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1201 PTVPEYSAPSLPIRPSRAPSRTPgPPSSQGSPVDTQPAAQKDSS--QTLEPKRPPPPRPVAPPARPAPPQRPPPPSGARS 1278
Cdd:PHA03247  2690 PTVGSLTSLADPPPPPPTPEPAP-HALVSATPLPPGPAAARQASpaLPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1279 PAPARKEFGGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPLKP 1358
Cdd:PHA03247  2769 PAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1720386233 1359 QAAF--------------PQQPSLPTPAQKLQDPLVPIAAPTMPPSG---PQPNLE 1397
Cdd:PHA03247  2849 SLPLggsvapggdvrrrpPSRSPAAKPAAPARPPVRRLARPAVSRSTesfALPPDQ 2904
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1211-1392 1.46e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 56.04  E-value: 1.46e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1211 LPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTlEPKRPPPPRPVAPPARPAPPQRPPPPSGARSPAP--------A 1282
Cdd:PRK12323   361 LAFRPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAA-PAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPealaaarqA 439
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1283 RKEFGGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSqSKPSETLKGPAVLPEPLKPQAAF 1362
Cdd:PRK12323   440 SARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDD-PPPWEELPPEFASPAPAQPDAAP 518
                          170       180       190
                   ....*....|....*....|....*....|
gi 1720386233 1363 PQQPSLPTPAQKLQDPLVPIAAPTMPPSGP 1392
Cdd:PRK12323   519 AGWVAESIPDPATADPDDAFETLAPAPAAA 548
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1200-1394 3.31e-07

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 54.99  E-value: 3.31e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1200 SPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSG-ARS 1278
Cdd:PRK07764   596 GGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAkAGG 675
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1279 PAPArkefggrNQPSPQAGLAGPGPAGyGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPLKP 1358
Cdd:PRK07764   676 AAPA-------APPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDD 747
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1720386233 1359 QAAFPQQPSLPTPAQKLQDPLVPIAAPTMPPSGPQP 1394
Cdd:PRK07764   748 PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEE 783
EEP cd08372
Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; This large superfamily includes ...
787-884 1.10e-06

Exonuclease-Endonuclease-Phosphatase (EEP) domain superfamily; This large superfamily includes the catalytic domain (exonuclease/endonuclease/phosphatase or EEP domain) of a diverse set of proteins including the ExoIII family of apurinic/apyrimidinic (AP) endonucleases, inositol polyphosphate 5-phosphatases (INPP5), neutral sphingomyelinases (nSMases), deadenylases (such as the vertebrate circadian-clock regulated nocturnin), bacterial cytolethal distending toxin B (CdtB), deoxyribonuclease 1 (DNase1), the endonuclease domain of the non-LTR retrotransposon LINE-1, and related domains. These diverse enzymes share a common catalytic mechanism of cleaving phosphodiester bonds; their substrates range from nucleic acids to phospholipids and perhaps proteins.


Pssm-ID: 197306 [Multi-domain]  Cd Length: 241  Bit Score: 51.33  E-value: 1.10e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  787 VFIRPqhaPFIRDVAVDTVKTGMGgATGNKGAVAIRMLFHTTSLCFVCSHFAAGQSQVKERNEDFVEIARKLSFpmgRML 866
Cdd:cd08372     71 ILSKT---PKFKIVEKHQYKFGEG-DSGERRAVVVKFDVHDKELCVVNAHLQAGGTRADVRDAQLKEVLEFLKR---LRQ 143
                           90
                   ....*....|....*...
gi 1720386233  867 FSHDYVFWCGDFNYRIDL 884
Cdd:cd08372    144 PNSAPVVICGDFNVRPSE 161
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1157-1394 1.23e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 53.23  E-value: 1.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1157 DMEGDVDDySAEVEELLPQHLQPSSSSGLGTSPSSSPRTSPCQSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSP---V 1233
Cdd:pfam03154  153 DNESDSDS-SAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPhtlI 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1234 DTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSGARSPAPARKEFGGRNQPSPQAGLAGPGPAGYGAARPTI 1313
Cdd:pfam03154  232 QQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPP 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1314 PARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEP-LKPQAAFPqQPSLPTPAQKLQDPLVPIAAP-TMPPSG 1391
Cdd:pfam03154  312 GPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPhIKPPPTTP-IPQLPNPQSHKHPPHLSGPSPfQMNSNL 390

                   ...
gi 1720386233 1392 PQP 1394
Cdd:pfam03154  391 PPP 393
PHA03247 PHA03247
large tegument protein UL36; Provisional
1207-1396 2.09e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 2.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1207 SAPSLPIRPsRAPSRTPGPPSSQGSPVDTQPAAqkdssqtlepkrpppprpvapparpappqrPPPPSGARSPAPARKEF 1286
Cdd:PHA03247  2590 DAPPQSARP-RAPVDDRGDPRGPAPPSPLPPDT------------------------------HAPDPPPPSPSPAANEP 2638
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1287 -GGRNQPSPQAGLAGPGPAGYGAARP---TIPARAGVISAPQSQARVCAGRPT---------PDSQSKPSETLKGPAV-- 1351
Cdd:PHA03247  2639 dPHPPPTVPPPERPRDDPAPGRVSRPrraRRLGRAAQASSPPQRPRRRAARPTvgsltsladPPPPPPTPEPAPHALVsa 2718
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720386233 1352 LPEPLKPQA---AFPQQPSLPTPAQKLQDPLVPI-----------------AAPTMPPSGPQPNL 1396
Cdd:PHA03247  2719 TPLPPGPAAarqASPALPAAPAPPAVPAGPATPGgparparppttagppapAPPAAPAAGPPRRL 2783
PHA03378 PHA03378
EBNA-3B; Provisional
1119-1394 2.52e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 52.38  E-value: 2.52e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1119 DWIKHLEEEMSLEKISVTLPSSASSTLLGEDAEVA--ADFDMEGDvddysaEVEELLPQHLQPSSSSGLGTSPSSSPRts 1196
Cdd:PHA03378   507 DLLEKDDEDMEQRVMATLLPPSPPQPRAGRRAPCVytEDLDIESD------EPASTEPVHDQLLPAPGLGPLQIQPLT-- 578
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1197 pcqSPTVPEY--SAPSLPIRPSRA--PSRTPGPPSSQGSPVDTQPAAQkdssqtlepkrPPPPRPVAPPARPAPPQRPPP 1272
Cdd:PHA03378   579 ---SPTTSQLasSAPSYAQTPWPVphPSQTPEPPTTQSHIPETSAPRQ-----------WPMPLRPIPMRPLRMQPITFN 644
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1273 PSGARSP-APARKE--FGGRNQPSPQAGLAGPGPAGYGAAR------------PTIPARAGVISAPQSQARVCAGRPTPD 1337
Cdd:PHA03378   645 VLVFPTPhQPPQVEitPYKPTWTQIGHIPYQPSPTGANTMLpiqwapgtmqppPRAPTPMRPPAAPPGRAQRPAAATGRA 724
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720386233 1338 SQSKPSET-LKGPAVLPEPLKPQAAFPQQPSLPTPAQKLQDPlvPIAAPTMPPSGPQP 1394
Cdd:PHA03378   725 RPPAAAPGrARPPAAAPGRARPPAAAPGRARPPAAAPGRARP--PAAAPGAPTPQPPP 780
INPP5A cd09092
Type I inositol polyphosphate 5-phosphatase I; Type I inositol polyphosphate 5-phosphatase I ...
824-1009 1.72e-05

Type I inositol polyphosphate 5-phosphatase I; Type I inositol polyphosphate 5-phosphatase I (INPP5A) hydrolyzes the 5-phosphate from inositol 1,3,4,5-tetrakisphosphate [I(1,3,4,5)P4] and inositol 1,4,5-trisphosphate [I(1,4,5)P3]. It belongs to a family of Mg2+-dependent inositol polyphosphate 5-phosphatases, which hydrolyze the 5-phosphate from the inositol ring of various 5-position phosphorylated phosphoinositides (PIs) and inositol phosphates (IPs), and to the large EEP (exonuclease/endonuclease/phosphatase) superfamily that contains functionally diverse enzymes that share a common catalytic mechanism of cleaving phosphodiester bonds. As the substrates of INPP5A mobilize intracellular calcium ions, INPP5A is a calcium signal-terminating enzyme. In platelets, phosphorylated pleckstrin binds and activates INPP5A in a 1:1 complex, and accelerates the degradation of the calcium ion-mobilizing I(1,4,5)P3.


Pssm-ID: 197326  Cd Length: 383  Bit Score: 49.00  E-value: 1.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  824 LFHTTSLCFVCSHFAAGQSQVKERNEDFVeIARKLSFPMGRMLFshdYVFwcGDFNYRIDL------------------- 884
Cdd:cd09092    176 LFHDASNLAACESSPSVYSQNRHRALGYV-LERLTDERFEKVPF---FVF--GDFNFRLDTksvvetlcakatmqtvrka 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  885 -PNEEVKELIRQQNWD----------SLIAGDQLINQKNAGQIFRGF-----------LEGKVTFAPTYKYdlfSEDYDT 942
Cdd:cd09092    250 dSNIVVKLEFREKDNDnkvvlqiekkKFDYFNQDVFRDNNGKALLKFdkelevfkdvlYELDISFPPSYPY---SEDPEQ 326
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  943 SE---KCRTPAWTDRVLwrrrkwpFDRSAEDLDLLNasfqDESKILYTwtpgtllHYGRaELKTSDHRPV 1009
Cdd:cd09092    327 GTqymNTRCPAWCDRIL-------MSHSARELKSEN----EEKSVTYD-------MIGP-NVCMGDHKPV 377
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1291-1394 5.07e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.06  E-value: 5.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1291 QPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQArvcagRPTPDSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPT 1370
Cdd:PRK07764   403 AAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAP-----APPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPT 477
                           90       100
                   ....*....|....*....|....
gi 1720386233 1371 PAQklqdplVPIAAPTMPPSGPQP 1394
Cdd:PRK07764   478 AAP------APAPPAAPAPAAAPA 495
RRM3_Prp24 cd12298
RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar ...
1062-1115 6.62e-05

RNA recognition motif 3 in fungal pre-messenger RNA splicing protein 24 (Prp24) and similar proteins; This subfamily corresponds to the RRM3 of Prp24, also termed U4/U6 snRNA-associated-splicing factor PRP24 (U4/U6 snRNP), an RNA-binding protein with four well conserved RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains). It facilitates U6 RNA base-pairing with U4 RNA during spliceosome assembly. Prp24 specifically binds free U6 RNA primarily with RRMs 1 and 2 and facilitates pairing of U6 RNA bases with U4 RNA bases. Additionally, it may also be involved in dissociation of the U4/U6 complex during spliceosome activation.


Pssm-ID: 409739 [Multi-domain]  Cd Length: 78  Bit Score: 42.63  E-value: 6.62e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720386233 1062 IDELLRQFAHFGEV---ILIRFVEDKM--------WVTFLEGSSALNALSLNGKELLNRTITITL 1115
Cdd:cd12298     14 EEALRGIFEKFGEIesiNIPKKQKNRKgrhnngfaFVTFEDADSAESALQLNGTLLDNRKISVSL 78
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
1274-1395 1.26e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.70  E-value: 1.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1274 SGARSPAPARKEFGGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLP 1353
Cdd:PHA03307   808 AADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAA 887
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1720386233 1354 EPLKPQAAFPQQPSLPTPAQKLQDPLVPiaaptMPPSGPQPN 1395
Cdd:PHA03307   888 PPKAAAAAPPAGAPAPRPRPAPRVKLGP-----MPPGGPDPR 924
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
1275-1394 1.45e-04

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 43.87  E-value: 1.45e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1275 GARSPAPARKEFGGRNQPSPQAGLAGP-GPAGYGAARPTIPARAGvisAPQSQARVCAGRPTPDSQSKPSETLKGPAvlp 1353
Cdd:pfam15240   59 PASDDPPGPPPPGGPQQPPPQGGKQKPqGPPPQGGPRPPPGKPQG---PPPQGGNQQQGPPPPGKPQGPPPQGGGPP--- 132
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1720386233 1354 eplkPQAAFPQQPSLPtPAQKLQDPlvpiaaPTMPPSGPQP 1394
Cdd:pfam15240  133 ----PQGGNQQGPPPP-PPGNPQGP------PQRPPQPGNP 162
PHA03378 PHA03378
EBNA-3B; Provisional
1201-1392 1.51e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.60  E-value: 1.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1201 PTVPEYSAPSLPIRPSRAPSRTPGPPSSQG--SPVDTQPAAQKdssqtlepkrpppprpvapparpappqrppPPSGARS 1278
Cdd:PHA03378   673 PYQPSPTGANTMLPIQWAPGTMQPPPRAPTpmRPPAAPPGRAQ------------------------------RPAAATG 722
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1279 PAPARKEFGGRNQPsPQaglAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQskpsetlkgPAVLPEPL-K 1357
Cdd:PHA03378   723 RARPPAAAPGRARP-PA---AAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP---------PQAPPAPQqR 789
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1720386233 1358 PQAAfpqqpslPTPAQKLQDPlvPIAAPTMPPSGP 1392
Cdd:PHA03378   790 PRGA-------PTPQPPPQAG--PTSMQLMPRAAP 815
RRM_SF cd00590
RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP ...
1063-1113 1.60e-04

RNA recognition motif (RRM) superfamily; RRM, also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).


Pssm-ID: 409669 [Multi-domain]  Cd Length: 72  Bit Score: 41.50  E-value: 1.60e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1720386233 1063 DELLRQFAHFGEVILIRFVEDKM-------WVTFLEGSSALNALS-LNGKELLNRTITI 1113
Cdd:cd00590     13 EDLRELFSKFGEVVSVRIVRDRDgkskgfaFVEFESPEDAEKALEaLNGTELGGRPLKV 71
PHA03247 PHA03247
large tegument protein UL36; Provisional
1207-1395 1.77e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 1.77e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1207 SAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTlepkrpppprPVAPPARPAPPQRPPPPSGARSPAParkef 1286
Cdd:PHA03247  2794 SRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTS----------AQPTAPPPPPGPPPPSLPLGGSVAP----- 2858
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1287 GG--RNQPSPQAGLAGPgpagygAARPTIPARAgvISAPQSQARVCAGRPTPDSQSKPSETLKGPAVLPEPLKPQAAFPq 1364
Cdd:PHA03247  2859 GGdvRRRPPSRSPAAKP------AAPARPPVRR--LARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP- 2929
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1720386233 1365 QPSLPTPAQKlQDPLVPIA--APTMPPSGPQPN 1395
Cdd:PHA03247  2930 QPPPPPPPRP-QPPLAPTTdpAGAGEPSGAVPQ 2961
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
1080-1396 2.27e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 45.72  E-value: 2.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1080 FVEDKMWVTFLEGSSALNAL-------SLNGKELLNRTIT--ITLKSPDWIKHLEE-EMSLEKISVTLPSSASSTLLGEd 1149
Cdd:pfam17823   28 FVLNKMWNGAGKQNASGDAVpradnksSEQ*NFCAATAAPapVTLTKGTSAAHLNStEVTAEHTPHGTDLSEPATREGA- 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1150 AEVAADFDMEGDVDDYSAEVEELLPqhlqpSSSSGLGTSPSSSPRTSPCQSPTVpeySAPSLPIRPSRAPsrTPGPPSSQ 1229
Cdd:pfam17823  107 ADGAASRALAAAASSSPSSAAQSLP-----AAIAALPSEAFSAPRAAACRANAS---AAPRAAIAAASAP--HAASPAPR 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1230 GSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSGARSPAPARKefgGRNQPSPQAGLAGPG---PAGY 1306
Cdd:pfam17823  177 TAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAV---GNSSPAAGTVTAAVGtvtPAAL 253
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1307 G---AARPTIPARAGVISAPQSQARvcagRPTPdSQSKPSETL-KGPAvlpEPLKPQAAFP------QQPSLPTPAQKLQ 1376
Cdd:pfam17823  254 AtlaAAAGTVASAAGTINMGDPHAR----RLSP-AKHMPSDTMaRNPA---APMGAQAQGPiiqvstDQPVHNTAGEPTP 325
                          330       340
                   ....*....|....*....|
gi 1720386233 1377 DPLVPIAAPTMPPSGPQPNL 1396
Cdd:pfam17823  326 SPSNTTLEPNTPKSVASTNL 345
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1199-1394 6.16e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 44.37  E-value: 6.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1199 QSPTVPEYSAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQrppppsgarS 1278
Cdd:NF033839   285 KEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEV---------K 355
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1279 PAPARkefggrnqPSPQaglAGPGPAGygaARPTIPARAGvisAPQSQARVCAGRPTPDSQSKPsETLKgPAVLPEPLKP 1358
Cdd:NF033839   356 PQPEK--------PKPE---VKPQPEK---PKPEVKPQPE---TPKPEVKPQPEKPKPEVKPQP-EKPK-PEVKPQPEKP 416
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1720386233 1359 QAAFPQQPSLPTPAQKLQdPLVPIAAPTMPPSGPQP 1394
Cdd:NF033839   417 KPEVKPQPEKPKPEVKPQ-PEKPKPEVKPQPEKPKP 451
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1206-1389 8.38e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.92  E-value: 8.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1206 YSAPSLPIRPSrAPSRTPGPPSSQGSpVDTQPAAqkdSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSGARSPAPARKE 1285
Cdd:PRK10263   333 WAAPVEPVTQT-PPVASVDVPPAQPT-VAWQPVP---GPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPY 407
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1286 FGGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQArvcagrPTPDSQSKPSETLKGPAVLPEPLKPQAAFPQQ 1365
Cdd:PRK10263   408 YAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQA------EEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQP 481
                          170       180
                   ....*....|....*....|....
gi 1720386233 1366 PSLPTPAQKLQDPLVPIAAPTMPP 1389
Cdd:PRK10263   482 QPVEQQPVVEPEPVVEETKPARPP 505
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1201-1393 8.47e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 8.47e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1201 PTVPEysAPSLPIRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPPPPSGARSPA 1280
Cdd:PRK07764   615 PAAPA--APAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAA 692
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1281 PArkefgGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVcAGRPTPDSQSKPSETLKGPAVLPEPLKPQA 1360
Cdd:PRK07764   693 PA-----GAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAAD-DPVPLPPEPDDPPDPAGAPAQPPPPPAPAP 766
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1720386233 1361 AFPQQPSLPTPAQKLQDPLVPIAAPTMPPSGPQ 1393
Cdd:PRK07764   767 AAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRR 799
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
1279-1393 9.34e-04

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 41.56  E-value: 9.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1279 PAParkefGGRNQPSPQAGLAGP-GPAGYGAARPTIPARAGvisAPQSQARVCAGRPTPDSQSKPSETLKGPAvlpeplk 1357
Cdd:pfam15240   68 PPP-----GGPQQPPPQGGKQKPqGPPPQGGPRPPPGKPQG---PPPQGGNQQQGPPPPGKPQGPPPQGGGPP------- 132
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1720386233 1358 PQAAFPQQPSLPtPAQKLQDPLVPIAAPTMPPSGPQ 1393
Cdd:pfam15240  133 PQGGNQQGPPPP-PPGNPQGPPQRPPQPGNPQGPPQ 167
PHA03247 PHA03247
large tegument protein UL36; Provisional
1174-1398 1.03e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1174 PQHLQPSSSSGLGTSPSSSPRTSPCQSPTVPEYSAPSLP-IRPSRAPSRTPGPPSSQGSPVDTQPAAQKDSSQTLepkrp 1252
Cdd:PHA03247  2621 THAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGrVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL----- 2695
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1253 ppprpvapparpappqrpppPSGARSPAPARKEfggrnQPSPQAGLAG----PGPAGYGAARPTIPAR------------ 1316
Cdd:PHA03247  2696 --------------------TSLADPPPPPPTP-----EPAPHALVSAtplpPGPAAARQASPALPAApappavpagpat 2750
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1317 -AGVISAPQSQARVCAGRPTP--DSQSKPSETLKGPAV---------LPEPLKPQ-------AAFPQQPSLPTPAQKLQD 1377
Cdd:PHA03247  2751 pGGPARPARPPTTAGPPAPAPpaAPAAGPPRRLTRPAVaslsesresLPSPWDPAdppaavlAPAAALPPAASPAGPLPP 2830
                          250       260
                   ....*....|....*....|.
gi 1720386233 1378 PLVPIAAPTMPPSGPQPNLET 1398
Cdd:PHA03247  2831 PTSAQPTAPPPPPGPPPPSLP 2851
RRM smart00360
RNA recognition motif;
1058-1113 1.60e-03

RNA recognition motif;


Pssm-ID: 214636 [Multi-domain]  Cd Length: 73  Bit Score: 38.34  E-value: 1.60e-03
                            10        20        30        40        50        60
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720386233  1058 DDALIDELLRQFAHFGEVILIRFVEDKMW--------VTFLEGSSALNALS-LNGKELLNRTITI 1113
Cdd:smart00360    9 PDTTEEELRELFSKFGKVESVRLVRDKETgkskgfafVEFESEEDAEKALEaLNGKELDGRPLKV 73
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
1207-1395 1.88e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 1.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1207 SAPSLPIRPSRAPSRTPGPPSSQGSPVD-TQPAAQKDSSQTLEpkrpppprpvapparpappqrppppsgARSPAPArke 1285
Cdd:pfam05109  430 TSPTLNTTGFAAPNTTTGLPSSTHVPTNlTAPASTGPTVSTAD---------------------------VTSPTPA--- 479
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1286 fGGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARV---CAGRPTPDSQSkPSETLKGPAVLPEPLKPQAAF 1362
Cdd:pfam05109  480 -GTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSptpAVTTPTPNATS-PTLGKTSPTSAVTTPTPNATS 557
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1720386233 1363 PqQPSLPTPAQKLQDPLVPIAAPTMPPSGPQPN 1395
Cdd:pfam05109  558 P-TPAVTTPTPNATIPTLGKTSPTSAVTTPTPN 589
PHA03264 PHA03264
envelope glycoprotein D; Provisional
1276-1385 2.22e-03

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 42.30  E-value: 2.22e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1276 ARSPAPARKEfggRNQPSPQAGLAGPGPAG-YGAARPTIPARAGVISAPQSQARVCAGRPTPdsqskpsetlkgPAVLPE 1354
Cdd:PHA03264   272 GGSPAPPGDD---RPEAKPEPGPVEDGAPGrETGGEGEGPEPAGRDGAAGGEPKPGPPRPAP------------DADRPE 336
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1720386233 1355 PLKPQAAFPQQPslPTPAQklqdPLVPIAAP 1385
Cdd:PHA03264   337 GWPSLEAITFPP--PTPAT----PAVPRARP 361
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
1297-1397 2.70e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 42.06  E-value: 2.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1297 GLAGPGPAGYGAARPTIPaRAGVISAPQSQARVCAGRP-TPDSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPTPAQKL 1375
Cdd:NF033839   278 GLTQDTPKEPGNKKPSAP-KPGMQPSPQPEKKEVKPEPeTPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKP 356
                           90       100
                   ....*....|....*....|..
gi 1720386233 1376 QdPLVPiaAPTMPPSGPQPNLE 1397
Cdd:NF033839   357 Q-PEKP--KPEVKPQPEKPKPE 375
PRK10263 PRK10263
DNA translocase FtsK; Provisional
1274-1426 2.73e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.38  E-value: 2.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1274 SGARSPAPARKEF----GGRNQPSPQAGLAGPGPAGYGAARPTIPARAgviSAPQSQARVCAGRPTPDSQSKPSETLKGP 1349
Cdd:PRK10263   295 SGNRATQPEYDEYdpllNGAPITEPVAVAAAATTATQSWAAPVEPVTQ---TPPVASVDVPPAQPTVAWQPVPGPQTGEP 371
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1350 AVLPEP----LKPQAAFP--------QQPSLPTPAQKLQDPLVPIAAPTMPPSGPQPNLETPPQPPPRSRSSQSLPSDSS 1417
Cdd:PRK10263   372 VIAPAPegypQQSQYAQPavqyneplQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEE 451
                          170
                   ....*....|
gi 1720386233 1418 PQ-LQQEQPT 1426
Cdd:PRK10263   452 QQsTFAPQST 461
RRM2_SREK1 cd12260
RNA recognition motif 2 (RRM2) found in splicing regulatory glutamine/lysine-rich protein 1 ...
1063-1114 3.10e-03

RNA recognition motif 2 (RRM2) found in splicing regulatory glutamine/lysine-rich protein 1 (SREK1) and similar proteins; This subfamily corresponds to the RRM2 of SREK1, also termed serine/arginine-rich-splicing regulatory protein 86-kDa (SRrp86), or splicing factor arginine/serine-rich 12 (SFRS12), or splicing regulatory protein 508 amino acid (SRrp508). SREK1 belongs to a family of proteins containing regions rich in serine-arginine dipeptides (SR proteins family), which is involved in bridge-complex formation and splicing by mediating protein-protein interactions across either introns or exons. It is a unique SR family member and it may play a crucial role in determining tissue specific patterns of alternative splicing. SREK1 can alter splice site selection by both positively and negatively modulating the activity of other SR proteins. For instance, SREK1 can activate SRp20 and repress SC35 in a dose-dependent manner both in vitro and in vivo. In addition, SREK1 contains two (some contain only one) RNA recognition motifs (RRMs), also termed RBDs (RNA binding domains) or RNPs (ribonucleoprotein domains), and two serine-arginine (SR)-rich domains (SR domains) separated by an unusual glutamic acid-lysine (EK) rich region. The RRM and SR domains are highly conserved among other members of the SR superfamily. However, the EK domain is unique to SREK1. It plays a modulatory role controlling SR domain function by involvement in the inhibition of both constitutive and alternative splicing and in the selection of splice-site.


Pssm-ID: 409705 [Multi-domain]  Cd Length: 85  Bit Score: 38.06  E-value: 3.10e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1720386233 1063 DELLRQFAHFGEVILIRFVEDK------MWVTFLEGSSALNALSLNGKELLNRTITIT 1114
Cdd:cd12260     19 DQLLEFFSQAGEVKYVRMAGDEtqptryAFVEFAEQTSVINALKLNGKMFGGRPLKVN 76
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1283-1394 3.32e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.90  E-value: 3.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1283 RKEFGGRNQPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQAR-VCAGRPTPDSQ----SKPSETLKGPAVLPEPLK 1357
Cdd:PRK07764   575 AEELGGDWQVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAApAAPAAPAPAGAaaapAEASAAPAPGVAAPEHHP 654
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1720386233 1358 PQAAFPQQPSLPTPAQKLQDPLVPIAAPTMPPSGPQP 1394
Cdd:PRK07764   655 KHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPA 691
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
1287-1394 4.03e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.76  E-value: 4.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1287 GGRNQPSPQAGLAGPGPAGYGAA-------RPTIPARAGVISAPQSQA-RVCAGRPTPDSQSKP---SETLKGPAVLPEP 1355
Cdd:PRK07003   369 GGGVPARVAGAVPAPGARAAAAVgasavpaVTAVTGAAGAALAPKAAAaAAATRAEAPPAAPAPpatADRGDDAADGDAP 448
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1720386233 1356 LKPQAAFPQQPSLPTPAQKLQDPLVPiaAPTMPPSGPQP 1394
Cdd:PRK07003   449 VPAKANARASADSRCDERDAQPPADS--GSASAPASDAP 485
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
1171-1395 4.05e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 41.68  E-value: 4.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1171 ELLPQHLqPSSSSGLGTSPSSSPRTSPCQSPTVPeysaPSL--PIRPSRAPSRTpGPPSSQgSPVDTQPAAQkdSSQTLE 1248
Cdd:pfam03154  236 TLHPQRL-PSPHPPLQPMTQPPPPSQVSPQPLPQ----PSLhgQMPPMPHSLQT-GPSHMQ-HPVPPQPFPL--TPQSSQ 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1249 PKRPPPPRPVAPPARPAPPQRPPPPSGARSPAPARKefggrnQPSPQAGLAGPgpagYGAARPTIPARAgvISAPQSQ-- 1326
Cdd:pfam03154  307 SQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPRE------QPLPPAPLSMP----HIKPPPTTPIPQ--LPNPQSHkh 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1327 -ARVCAGRP-------TPDSQSKPSETLKG---PAVLPEPLK--PQA----AFPQQP-------SLPTPAQKL------- 1375
Cdd:pfam03154  375 pPHLSGPSPfqmnsnlPPPPALKPLSSLSThhpPSAHPPPLQlmPQSqqlpPPPAQPpvltqsqSLPPPAASHpptsglh 454
                          250       260       270
                   ....*....|....*....|....*....|
gi 1720386233 1376 ---------QDPLVPIAAPT-MPPSGPQPN 1395
Cdd:pfam03154  455 qvpsqspfpQHPFVPGGPPPiTPPSGPPTS 484
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
1292-1389 5.04e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 41.33  E-value: 5.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1292 PSPQAGLAGPGPAGYGAARPTIparagvisAPQSQARVCAGRPTPDSQSKPsETLKGPAVLPEPLKPQAAFPQQPSLPTP 1371
Cdd:PRK14950   362 PVPAPQPAKPTAAAPSPVRPTP--------APSTRPKAAAAANIPPKEPVR-ETATPPPVPPRPVAPPVPHTPESAPKLT 432
                           90
                   ....*....|....*...
gi 1720386233 1372 AQKLQDPLVPIAAPTMPP 1389
Cdd:PRK14950   433 RAAIPVDEKPKYTPPAPP 450
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
1291-1395 5.18e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.40  E-value: 5.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1291 QPSPQAGLAGPGPAGYGAARPTIPARAGVISAPQSQARVCAGRPTPDSQSKPSET-LKGPAVLPEPLKPQAAFPQQPSL- 1368
Cdd:PRK12323   364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAvAAAPARRSPAPEALAAARQASARg 443
                           90       100
                   ....*....|....*....|....*....
gi 1720386233 1369 --PTPAQKLQDPLVPIAAPTMPPSGPQPN 1395
Cdd:PRK12323   444 pgGAPAPAPAPAAAPAAAARPAAAGPRPV 472
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
1294-1394 6.27e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 40.91  E-value: 6.27e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1294 PQAGLAGPGPAGygAARPTIPARAGVISAPQSQARVCAgrptpdSQSKPSETLKGPAVLPEPLKPQAAFPQQPSLPtpaq 1373
Cdd:PRK14971   363 TQKGDDASGGRG--PKQHIKPVFTQPAAAPQPSAAAAA------SPSPSQSSAAAQPSAPQSATQPAGTPPTVSVD---- 430
                           90       100
                   ....*....|....*....|.
gi 1720386233 1374 klqdplVPIAAPTMPPSGPQP 1394
Cdd:PRK14971   431 ------PPAAVPVNPPSTAPQ 445
PHA03247 PHA03247
large tegument protein UL36; Provisional
1201-1404 7.86e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.08  E-value: 7.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1201 PTVPEysAPSLPIRPSRAPSR--TPGPPSSQ-------GSPVDTQPAAQKDSSQTLEPKRPPPPRPVAPPARPAPPQRPP 1271
Cdd:PHA03247  2742 PAVPA--GPATPGGPARPARPptTAGPPAPAppaapaaGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP 2819
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1272 PPSG---------ARSPAPARKEFGGRNQPSPQAGLAGPG-------PAGYGAARPTIP--------ARAGVISAPQSQA 1327
Cdd:PHA03247  2820 PAASpagplppptSAQPTAPPPPPGPPPPSLPLGGSVAPGgdvrrrpPSRSPAAKPAAParppvrrlARPAVSRSTESFA 2899
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233 1328 RVCAG---RPTPDSQSKPSETLKGPAV-LPEPLKPQAAFPQQPSLPT--------PAQKLQDP----LVP--IAAPTMPP 1389
Cdd:PHA03247  2900 LPPDQperPPQPQAPPPPQPQPQPPPPpQPQPPPPPPPRPQPPLAPTtdpagagePSGAVPQPwlgaLVPgrVAVPRFRV 2979
                          250
                   ....*....|....*.
gi 1720386233 1390 SGPQPNLET-PPQPPP 1404
Cdd:PHA03247  2980 PQPAPSREApASSTPP 2995
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
1311-1396 7.90e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 38.62  E-value: 7.90e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720386233  1311 PTIPARAGVIsaPQSQARVCAGRP--TPDSQSKPSetLKGPAvlPEPLKPQAAFPQQPSLPTPAQKLQDPLVPI----AA 1384
Cdd:smart00818   59 PVLPAQQPVV--PQQPLMPVPGQHsmTPTQHHQPN--LPQPA--QQPFQPQPLQPPQPQQPMQPQPPVHPIPPLppqpPL 132
                            90
                    ....*....|..
gi 1720386233  1385 PTMPPSGPQPNL 1396
Cdd:smart00818  133 PPMFPMQPLPPL 144
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH