|
Name |
Accession |
Description |
Interval |
E-value |
| Neogenin_C |
pfam06583 |
Neogenin C-terminus; This family represents the C-terminus of eukaryotic neogenin precursor ... |
652-949 |
4.33e-127 |
|
Neogenin C-terminus; This family represents the C-terminus of eukaryotic neogenin precursor proteins, which contains several potential phosphorylation sites. Neogenin is a member of the N-CAM family of cell adhesion molecules (and therefore contains multiple copies of pfam00047 and pfam00041) and is closely related to the DCC tumour suppressor gene product - these proteins may play an integral role in regulating differentiation programmes and/or cell migration events within many adult and embryonic tissues.
Pssm-ID: 461954 [Multi-domain] Cd Length: 289 Bit Score: 385.81 E-value: 4.33e-127
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 652 LRPPDLWIHHEEMEMKNIEKPAGTDPAGRGSPI-QSCQDLTPVSHSQSESQMGSKSASHSGQDTEEAGSSmstlersLAA 730
Cdd:pfam06583 1 LKPPDLWIHHEQMELKNIEKSPSPNPSGTDSPIgQSSQDLPPVDHSQSESQIHQKSNSYSGNDSDEKSST-------LAG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 731 RRATRTKLMIPMEAQSNNPAVVSAIPVPTLESAQ---YPGILPSPTCGYPHPQFTLrpvPFPTLSVDRGFGAGRSQSVSE 807
Cdd:pfam06583 74 RRGTRPKMMLPMDSQPSNQPVVSAIPIPSLDSSHqyaHPGILPSPTCGYLHNQFSL---PFPGTPVPRSDTAPSAESVEN 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 808 GPTAQQQPMLPPAQPEHP----SSEEAPSRTIPTACVRPTHPLRSFANPLLPPPMSaiepkvPYTPLLSQPGPTLPKTHV 883
Cdd:pfam06583 151 TPLQSQLPYQPSSQSESGslssAVEEEPNRSIPTAKVRPGHPLKSFSVPAPPPQSA------PSTPLQQQHRPTLSKSPV 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958740175 884 KTASLGLAGKARSPLlPVSVPTAPEVSEESHKPTEDPASVYEQDDLSEQMASLEGLMKQLNAITGS 949
Cdd:pfam06583 225 KTASLGTAGKARSPL-PVSVPNAPDTSEETERLLEDAAPSYETDELSEEMANLEGLMKDLNAITAS 289
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1-405 |
4.12e-22 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 102.00 E-value: 4.12e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 1 MYTFRVVAYNEWGPGESSQPIKVATQPELqvPGPVENLHAVSASPTSILITWEPPayANGPVQGYRLFCTEVSTGKEQNI 80
Cdd:COG3401 204 TYYYRVAATDTGGESAPSNEVSVTTPTTP--PSAPTGLTATADTPGSVTLSWDPV--TESDATGYRVYRSNSGDGPFTKV 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 81 -EVDGLSYKLEGLKKFTEYTLRFLAYNRYG-PGVSTDDITVVTLSDVPsAPPQNVSLEVVNSRSIKVSWlpppSGTQNGF 158
Cdd:COG3401 280 aTVTTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP-AAPSGLTATAVGSSSITLSW----TASSDAD 354
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 159 ITGYKIrHRKTTRRGEMETL--EPNNLWYLFTGLEKGSQYSFQVSAMTVNGT-GPPSNWYTAETPENDLDES--QVPDQP 233
Cdd:COG3401 355 VTGYNV-YRSTSGGGTYTKIaeTVTTTSYTDTGLTPGTTYYYKVTAVDAAGNeSAPSEEVSATTASAASGESltASVDAV 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 234 SSLHVRPQTNCIIMSWTPPLNPNIVVRGYIIGYGV-GSPYAETVRVDSKQRYYSIERLESSSHYVISLKAFNNAGEGVPL 312
Cdd:COG3401 434 PLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVpFTTTSSTVTATTTDTTTANLSVTTGSLVGGSGASSVTNSVSVIG 513
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 313 YESATTRSITDPTDPVDYYPLLDDFPTSGPDVSTPMLPPVGVQAVAltheavrvSWADNSVPKNQKTSDVRLYTVRWRTS 392
Cdd:COG3401 514 ASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLITDLVSLTTSASS--------SVSGAGLGSGNLYLITTLGGSLLTTT 585
|
410
....*....|...
gi 1958740175 393 FSASAKYKSEDTT 405
Cdd:COG3401 586 STNTNDVAGVHGG 598
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
32-121 |
4.31e-18 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 80.23 E-value: 4.31e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 32 PGPVENLHAVSASPTSILITWEPPAYANGPVQGYRLFCTEVSTGKEQNIEV---DGLSYKLEGLKKFTEYTLRFLAYNRY 108
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSWTPPEDDGGPITGYVVEYREKGSGDWKEVEVtpgSETSYTLTGLKPGTEYEFRVRAVNGG 80
|
90
....*....|...
gi 1958740175 109 GPGVSTDDITVVT 121
Cdd:cd00063 81 GESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
129-209 |
3.22e-15 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 71.49 E-value: 3.22e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 129 PPQNVSLEVVNSRSIKVSWLPPPSGTQNGFITGYKIRHRKTTRRGEMETLEPNNLWYLFTGLEKGSQYSFQVSAMTVNGT 208
Cdd:smart00060 3 PPSNLRVTDVTSTSVTLSWEPPPDDGITGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82
|
.
gi 1958740175 209 G 209
Cdd:smart00060 83 G 83
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
450-545 |
8.85e-14 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 67.91 E-value: 8.85e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 450 SAPKDLTVitREGKPRAVIVSWQPPLEANGKITAYILFYTLDKNipiDDWI-METISGDRLTHQIMDLSLDTMYYFRIQA 528
Cdd:cd00063 2 SPPTNLRV--TDVTSTSVTLSWTPPEDDGGPITGYVVEYREKGS---GDWKeVEVTPGSETSYTLTGLKPGTEYEFRVRA 76
|
90
....*....|....*..
gi 1958740175 529 RNAKGVGPLSDPILFRT 545
Cdd:cd00063 77 VNGGGESPPSESVTVTT 93
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
33-111 |
1.02e-13 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 67.44 E-value: 1.02e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 33 GPVENLHAVSASPTSILITWEPPAYANGPVQGYRLFCTEVSTGK-EQNIEVDG--LSYKLEGLKKFTEYTLRFLAYNRYG 109
Cdd:pfam00041 1 SAPSNLTVTDVTSTSLTVSWTPPPDGNGPITGYEVEYRPKNSGEpWNEITVPGttTSVTLTGLKPGTEYEVRVQAVNGGG 80
|
..
gi 1958740175 110 PG 111
Cdd:pfam00041 81 EG 82
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
450-538 |
3.24e-13 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 65.90 E-value: 3.24e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 450 SAPKDLTVITREgkPRAVIVSWQPPLEANGKITAYILFY-TLDKNipiDDWIMETISGDRLTHQIMDLSLDTMYYFRIQA 528
Cdd:pfam00041 1 SAPSNLTVTDVT--STSLTVSWTPPPDGNGPITGYEVEYrPKNSG---EPWNEITVPGTTTSVTLTGLKPGTEYEVRVQA 75
|
90
....*....|
gi 1958740175 529 RNAKGVGPLS 538
Cdd:pfam00041 76 VNGGGEGPPS 85
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
450-535 |
8.36e-11 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 59.17 E-value: 8.36e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 450 SAPKDLTVITREgkPRAVIVSWQPPLEANGkiTAYILFYTLDKNIPIDDWIMETISGDRLTHQIMDLSLDTMYYFRIQAR 529
Cdd:smart00060 2 SPPSNLRVTDVT--STSVTLSWEPPPDDGI--TGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAV 77
|
....*.
gi 1958740175 530 NAKGVG 535
Cdd:smart00060 78 NGAGEG 83
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
646-931 |
9.41e-06 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 49.69 E-value: 9.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 646 KGSQKDLRPPDlwihhEEMEMKNIEKPAGtdPAGRGSPiqscqdltPVSHSQSESQMGSKSASHSGQDTEEAGSSMSTLE 725
Cdd:PTZ00449 490 KKSKKKLAPIE-----EEDSDKHDEPPEG--PEASGLP--------PKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKE 554
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 726 RSLAARratrtklmiPMEAQSNNPavvSAIPVPTlESAQYPGILPSPTcGYPHPQFTLRPVpfptlsvdrgfgagrsqsV 805
Cdd:PTZ00449 555 GEVGKK---------PGPAKEHKP---SKIPTLS-KKPEFPKDPKHPK-DPEEPKKPKRPR------------------S 602
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 806 SEGPTAQQQPMLP-----PAQPEHPSSEEAPSRTIPTAcvRPTHPLRSFANPLLPPPMSAIEPKVPYTPLL--------- 871
Cdd:PTZ00449 603 AQRPTRPKSPKLPelldiPKSPKRPESPKSPKRPPPPQ--RPSSPERPEGPKIIKSPKPPKSPKPPFDPKFkekfyddyl 680
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 872 -----SQPGPTLPKTHVKTASLGLAGKARSPLLPVSV-----PTAPEVSEESHKPTEDPASvyEQDDLSE 931
Cdd:PTZ00449 681 daaakSKETKTTVVLDESFESILKETLPETPGTPFTTprplpPKLPRDEEFPFEPIGDPDA--EQPDDIE 748
|
|
| COG4733 |
COG4733 |
Phage-related protein, tail protein J [Mobilome: prophages, transposons]; |
344-533 |
2.41e-04 |
|
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
Pssm-ID: 443767 [Multi-domain] Cd Length: 978 Bit Score: 44.94 E-value: 2.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 344 VSTPMLPPVGVQAVALTheaVRVSWADNSVpkNQKTSDVRLyTVRWRTS---FSASAKYKSED--------TTSLSYTAT 412
Cdd:COG4733 519 IDAGAFDDVPPQWPPVN---VTTSESLSVV--AQGTAVTTL-TVSWDAPagaVAYEVEWRRDDgnwvsvprTSGTSFEVP 592
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 413 GLKPNTmYEFSVM-VTKNRRSSTWSMTAHAT-TYEAAPTSAPKDLTVitrEGKPRAVIVSWQPPLEANgkITAYILFYTl 490
Cdd:COG4733 593 GIYAGD-YEVRVRaINALGVSSAWAASSETTvTGKTAPPPAPTGLTA---TGGLGGITLSWSFPVDAD--TLRTEIRYS- 665
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1958740175 491 dkniPIDDW---IMETISGDRLTHQIMDLSLDTMYYFRIQARNAKG 533
Cdd:COG4733 666 ----TTGDWasaTVAQALYPGNTYTLAGLKAGQTYYYRARAVDRSG 707
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Neogenin_C |
pfam06583 |
Neogenin C-terminus; This family represents the C-terminus of eukaryotic neogenin precursor ... |
652-949 |
4.33e-127 |
|
Neogenin C-terminus; This family represents the C-terminus of eukaryotic neogenin precursor proteins, which contains several potential phosphorylation sites. Neogenin is a member of the N-CAM family of cell adhesion molecules (and therefore contains multiple copies of pfam00047 and pfam00041) and is closely related to the DCC tumour suppressor gene product - these proteins may play an integral role in regulating differentiation programmes and/or cell migration events within many adult and embryonic tissues.
Pssm-ID: 461954 [Multi-domain] Cd Length: 289 Bit Score: 385.81 E-value: 4.33e-127
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 652 LRPPDLWIHHEEMEMKNIEKPAGTDPAGRGSPI-QSCQDLTPVSHSQSESQMGSKSASHSGQDTEEAGSSmstlersLAA 730
Cdd:pfam06583 1 LKPPDLWIHHEQMELKNIEKSPSPNPSGTDSPIgQSSQDLPPVDHSQSESQIHQKSNSYSGNDSDEKSST-------LAG 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 731 RRATRTKLMIPMEAQSNNPAVVSAIPVPTLESAQ---YPGILPSPTCGYPHPQFTLrpvPFPTLSVDRGFGAGRSQSVSE 807
Cdd:pfam06583 74 RRGTRPKMMLPMDSQPSNQPVVSAIPIPSLDSSHqyaHPGILPSPTCGYLHNQFSL---PFPGTPVPRSDTAPSAESVEN 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 808 GPTAQQQPMLPPAQPEHP----SSEEAPSRTIPTACVRPTHPLRSFANPLLPPPMSaiepkvPYTPLLSQPGPTLPKTHV 883
Cdd:pfam06583 151 TPLQSQLPYQPSSQSESGslssAVEEEPNRSIPTAKVRPGHPLKSFSVPAPPPQSA------PSTPLQQQHRPTLSKSPV 224
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958740175 884 KTASLGLAGKARSPLlPVSVPTAPEVSEESHKPTEDPASVYEQDDLSEQMASLEGLMKQLNAITGS 949
Cdd:pfam06583 225 KTASLGTAGKARSPL-PVSVPNAPDTSEETERLLEDAAPSYETDELSEEMANLEGLMKDLNAITAS 289
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
1-405 |
4.12e-22 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 102.00 E-value: 4.12e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 1 MYTFRVVAYNEWGPGESSQPIKVATQPELqvPGPVENLHAVSASPTSILITWEPPayANGPVQGYRLFCTEVSTGKEQNI 80
Cdd:COG3401 204 TYYYRVAATDTGGESAPSNEVSVTTPTTP--PSAPTGLTATADTPGSVTLSWDPV--TESDATGYRVYRSNSGDGPFTKV 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 81 -EVDGLSYKLEGLKKFTEYTLRFLAYNRYG-PGVSTDDITVVTLSDVPsAPPQNVSLEVVNSRSIKVSWlpppSGTQNGF 158
Cdd:COG3401 280 aTVTTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPP-AAPSGLTATAVGSSSITLSW----TASSDAD 354
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 159 ITGYKIrHRKTTRRGEMETL--EPNNLWYLFTGLEKGSQYSFQVSAMTVNGT-GPPSNWYTAETPENDLDES--QVPDQP 233
Cdd:COG3401 355 VTGYNV-YRSTSGGGTYTKIaeTVTTTSYTDTGLTPGTTYYYKVTAVDAAGNeSAPSEEVSATTASAASGESltASVDAV 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 234 SSLHVRPQTNCIIMSWTPPLNPNIVVRGYIIGYGV-GSPYAETVRVDSKQRYYSIERLESSSHYVISLKAFNNAGEGVPL 312
Cdd:COG3401 434 PLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVpFTTTSSTVTATTTDTTTANLSVTTGSLVGGSGASSVTNSVSVIG 513
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 313 YESATTRSITDPTDPVDYYPLLDDFPTSGPDVSTPMLPPVGVQAVAltheavrvSWADNSVPKNQKTSDVRLYTVRWRTS 392
Cdd:COG3401 514 ASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLITDLVSLTTSASS--------SVSGAGLGSGNLYLITTLGGSLLTTT 585
|
410
....*....|...
gi 1958740175 393 FSASAKYKSEDTT 405
Cdd:COG3401 586 STNTNDVAGVHGG 598
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
70-501 |
9.77e-19 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 91.22 E-value: 9.77e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 70 TEVSTGKEQNIEVDGLSYKLEGLKKFTEYTLRFLAYNRYGPGVSTDDITVVTLSDVPSaPPQNVSLEVVNSRSIKVSWLP 149
Cdd:COG3401 177 TAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTTPTTPPS-APTGLTATADTPGSVTLSWDP 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 150 PPsgtqNGFITGYKIrHRKTTRRGEMETL-EPNNLWYLFTGLEKGSQYSFQVSAMTVNGT-GPPSNWYTAETpendldES 227
Cdd:COG3401 256 VT----ESDATGYRV-YRSNSGDGPFTKVaTVTTTSYTDTGLTNGTTYYYRVTAVDAAGNeSAPSNVVSVTT------DL 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 228 QVPDQPSSLHVRPQTNCIIM-SWTPPLNPNIVvrGYII--GYGVGSPYaETVRVDSKQRYYSIERLESSSHYVISLKAFN 304
Cdd:COG3401 325 TPPAAPSGLTATAVGSSSITlSWTASSDADVT--GYNVyrSTSGGGTY-TKIAETVTTTSYTDTGLTPGTTYYYKVTAVD 401
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 305 NAGEGVPLYESATTRSITDPTDPVDYYPLLDDFPTSGPDVSTPML----PPVGVQAVALTHEAVRVSWADNSVPKNQKTS 380
Cdd:COG3401 402 AAGNESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASaasnPGVSAAVLADGGDTGNAVPFTTTSSTVTATT 481
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 381 DVRLYTVRWRTSFSASAKYKSEDTTSLSYTATGLKPNTMYEFSVMVTkNRRSSTWSMTAHATTYEAAPTSAPKDLTVITR 460
Cdd:COG3401 482 TDTTTANLSVTTGSLVGGSGASSVTNSVSVIGASAAAAVGGAPDGTP-NVTGASPVTVGASTGDVLITDLVSLTTSASSS 560
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1958740175 461 EGKPRAVIVSWQPPLEANGKITAYILFYTLDKNIPIDDWIM 501
Cdd:COG3401 561 VSGAGLGSGNLYLITTLGGSLLTTTSTNTNDVAGVHGGTLL 601
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
32-121 |
4.31e-18 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 80.23 E-value: 4.31e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 32 PGPVENLHAVSASPTSILITWEPPAYANGPVQGYRLFCTEVSTGKEQNIEV---DGLSYKLEGLKKFTEYTLRFLAYNRY 108
Cdd:cd00063 1 PSPPTNLRVTDVTSTSVTLSWTPPEDDGGPITGYVVEYREKGSGDWKEVEVtpgSETSYTLTGLKPGTEYEFRVRAVNGG 80
|
90
....*....|...
gi 1958740175 109 GPGVSTDDITVVT 121
Cdd:cd00063 81 GESPPSESVTVTT 93
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
129-219 |
2.33e-16 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 75.23 E-value: 2.33e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 129 PPQNVSLEVVNSRSIKVSWLPPPSGtqNGFITGYKIRHRKTTRRG--EMETLEPNNLWYLFTGLEKGSQYSFQVSAMTVN 206
Cdd:cd00063 3 PPTNLRVTDVTSTSVTLSWTPPEDD--GGPITGYVVEYREKGSGDwkEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGG 80
|
90
....*....|...
gi 1958740175 207 GTGPPSNWYTAET 219
Cdd:cd00063 81 GESPPSESVTVTT 93
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
129-209 |
3.22e-15 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 71.49 E-value: 3.22e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 129 PPQNVSLEVVNSRSIKVSWLPPPSGTQNGFITGYKIRHRKTTRRGEMETLEPNNLWYLFTGLEKGSQYSFQVSAMTVNGT 208
Cdd:smart00060 3 PPSNLRVTDVTSTSVTLSWEPPPDDGITGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82
|
.
gi 1958740175 209 G 209
Cdd:smart00060 83 G 83
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
96-541 |
3.56e-14 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 76.58 E-value: 3.56e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 96 TEYTLRFLAYNRYGPGVSTDDITVVTLSDVPSAPPQNVSLEVVNSRSIKVSWLPPPSGTQNGFITGYKIRHRKTTRRGEm 175
Cdd:COG3401 1 TGSSYLTSLDAGIAASAAANTAVNALSKAGGSGKTILVYLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTSGV- 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 176 eTLEPNNLWYLFTGLEKGSQYSFQVSAMTVNGTGPPSNWYTAETPEN-DLDESQVPDQPSSLHVRPQTNCIIMSWTPPLN 254
Cdd:COG3401 80 -AAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTAtTATAVAGGAATAGTYALGAGLYGVDGANASGT 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 255 PNIVVRGYIIGYGVGSPYAETV-----RVDSKQRYYSIERLESSSHYVISLKAFNNAGEGVPlyesATTRSITDPTdpvd 329
Cdd:COG3401 159 TASSVAGAGVVVSPDTSATAAVattslTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAP----SNEVSVTTPT---- 230
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 330 yypllddfptsgpdvsTPMLPPVGVQAVALTHEAVRVSWADNSVPknqktsDVRLYTVRWRTSfsASAKYKS-EDTTSLS 408
Cdd:COG3401 231 ----------------TPPSAPTGLTATADTPGSVTLSWDPVTES------DATGYRVYRSNS--GDGPFTKvATVTTTS 286
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 409 YTATGLKPNTMYEFSVM-VTKNRRSSTWSMTAHATTYEAAPTsAPKDLTVITREgkPRAVIVSWQPPleANGKITAYILF 487
Cdd:COG3401 287 YTDTGLTNGTTYYYRVTaVDAAGNESAPSNVVSVTTDLTPPA-APSGLTATAVG--SSSITLSWTAS--SDADVTGYNVY 361
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1958740175 488 YTLDKNIPIDdWIMETISGdrLTHQIMDLSLDTMYYFRIQARNAKGV-GPLSDPI 541
Cdd:COG3401 362 RSTSGGGTYT-KIAETVTT--TSYTDTGLTPGTTYYYKVTAVDAAGNeSAPSEEV 413
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
450-545 |
8.85e-14 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 67.91 E-value: 8.85e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 450 SAPKDLTVitREGKPRAVIVSWQPPLEANGKITAYILFYTLDKNipiDDWI-METISGDRLTHQIMDLSLDTMYYFRIQA 528
Cdd:cd00063 2 SPPTNLRV--TDVTSTSVTLSWTPPEDDGGPITGYVVEYREKGS---GDWKeVEVTPGSETSYTLTGLKPGTEYEFRVRA 76
|
90
....*....|....*..
gi 1958740175 529 RNAKGVGPLSDPILFRT 545
Cdd:cd00063 77 VNGGGESPPSESVTVTT 93
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
33-111 |
1.02e-13 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 67.44 E-value: 1.02e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 33 GPVENLHAVSASPTSILITWEPPAYANGPVQGYRLFCTEVSTGK-EQNIEVDG--LSYKLEGLKKFTEYTLRFLAYNRYG 109
Cdd:pfam00041 1 SAPSNLTVTDVTSTSLTVSWTPPPDGNGPITGYEVEYRPKNSGEpWNEITVPGttTSVTLTGLKPGTEYEVRVQAVNGGG 80
|
..
gi 1958740175 110 PG 111
Cdd:pfam00041 81 EG 82
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
32-111 |
1.59e-13 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 66.87 E-value: 1.59e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 32 PGPVENLHAVSASPTSILITWEPPAYAN--GPVQGYRL-FCTEVSTGKEQNIEVDGLSYKLEGLKKFTEYTLRFLAYNRY 108
Cdd:smart00060 1 PSPPSNLRVTDVTSTSVTLSWEPPPDDGitGYIVGYRVeYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1958740175 109 GPG 111
Cdd:smart00060 81 GEG 83
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
450-538 |
3.24e-13 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 65.90 E-value: 3.24e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 450 SAPKDLTVITREgkPRAVIVSWQPPLEANGKITAYILFY-TLDKNipiDDWIMETISGDRLTHQIMDLSLDTMYYFRIQA 528
Cdd:pfam00041 1 SAPSNLTVTDVT--STSLTVSWTPPPDGNGPITGYEVEYrPKNSG---EPWNEITVPGTTTSVTLTGLKPGTEYEVRVQA 75
|
90
....*....|
gi 1958740175 529 RNAKGVGPLS 538
Cdd:pfam00041 76 VNGGGEGPPS 85
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
129-212 |
6.80e-13 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 65.13 E-value: 6.80e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 129 PPQNVSLEVVNSRSIKVSWLPPPSGtqNGFITGYKIRHRKTTRRGEM--ETLEPNNLWYLFTGLEKGSQYSFQVSAMTVN 206
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSWTPPPDG--NGPITGYEVEYRPKNSGEPWneITVPGTTTSVTLTGLKPGTEYEVRVQAVNGG 79
|
....*.
gi 1958740175 207 GTGPPS 212
Cdd:pfam00041 80 GEGPPS 85
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
450-535 |
8.36e-11 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 59.17 E-value: 8.36e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 450 SAPKDLTVITREgkPRAVIVSWQPPLEANGkiTAYILFYTLDKNIPIDDWIMETISGDRLTHQIMDLSLDTMYYFRIQAR 529
Cdd:smart00060 2 SPPSNLRVTDVT--STSVTLSWEPPPDDGI--TGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAV 77
|
....*.
gi 1958740175 530 NAKGVG 535
Cdd:smart00060 78 NGAGEG 83
|
|
| FN3 |
COG3401 |
Fibronectin type 3 domain [General function prediction only]; |
2-303 |
4.76e-10 |
|
Fibronectin type 3 domain [General function prediction only];
Pssm-ID: 442628 [Multi-domain] Cd Length: 603 Bit Score: 63.48 E-value: 4.76e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 2 YTFRVVAYNEWG-PGESSQPIKVATQPELqvPGPVENLHAVSASPTSILITWEPPayANGPVQGYRLFCTEVSTGKEQNI 80
Cdd:COG3401 298 YYYRVTAVDAAGnESAPSNVVSVTTDLTP--PAAPSGLTATAVGSSSITLSWTAS--SDADVTGYNVYRSTSGGGTYTKI 373
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 81 --EVDGLSYKLEGLKKFTEYTLRFLAYNRYGP-GVSTDDITVVTLSDVPSAPPQNVSLEVVNSRSIKVSW-----LPPPS 152
Cdd:COG3401 374 aeTVTTTSYTDTGLTPGTTYYYKVTAVDAAGNeSAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAaasaaSNPGV 453
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 153 GTQNGFITGYKIRHRKTTRRGEMETLEPNNLWYLFTGLEKGSQYSFQVSAMTVNGTGPPSNWYTAETPeNDLDESQVPDQ 232
Cdd:COG3401 454 SAAVLADGGDTGNAVPFTTTSSTVTATTTDTTTANLSVTTGSLVGGSGASSVTNSVSVIGASAAAAVG-GAPDGTPNVTG 532
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1958740175 233 PSSLHVRPQTNCIIMSWTPPLNPNIVVRGYIIGYGVGSPYAETVRVDSKQRYYSIERLESSSHYVISLKAF 303
Cdd:COG3401 533 ASPVTVGASTGDVLITDLVSLTTSASSSVSGAGLGSGNLYLITTLGGSLLTTTSTNTNDVAGVHGGTLLVL 603
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
230-309 |
9.59e-10 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 56.08 E-value: 9.59e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 230 PDQPSSLHVRPQT-NCIIMSWTPPLNPNIV--VRGYIIGYGVGSPYAETVRVDSKQRYYSIERLESSSHYVISLKAFNNA 306
Cdd:smart00060 1 PSPPSNLRVTDVTsTSVTLSWEPPPDDGITgyIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80
|
...
gi 1958740175 307 GEG 309
Cdd:smart00060 81 GEG 83
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
350-443 |
1.35e-09 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 55.97 E-value: 1.35e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 350 PPVGVQAVALTHEAVRVSWadnsVPKNQKTSDVRLYTVRWR-TSFSASAKYKSEDTTSLSYTATGLKPNTMYEFSVMVTK 428
Cdd:cd00063 3 PPTNLRVTDVTSTSVTLSW----TPPEDDGGPITGYVVEYReKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVN 78
|
90
....*....|....*
gi 1958740175 429 NRRSSTWSMTAHATT 443
Cdd:cd00063 79 GGGESPPSESVTVTT 93
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
230-318 |
1.23e-08 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 53.27 E-value: 1.23e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 230 PDQPSSLHVRPQT-NCIIMSWTPPLNPNIVVRGYIIGY-GVGSPYAETVRV-DSKQRYYSIERLESSSHYVISLKAFNNA 306
Cdd:cd00063 1 PSPPTNLRVTDVTsTSVTLSWTPPEDDGGPITGYVVEYrEKGSGDWKEVEVtPGSETSYTLTGLKPGTEYEFRVRAVNGG 80
|
90
....*....|..
gi 1958740175 307 GEGVPLYESATT 318
Cdd:cd00063 81 GESPPSESVTVT 92
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
231-311 |
2.41e-07 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 49.34 E-value: 2.41e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 231 DQPSSLHVRPQT-NCIIMSWTPPLNPNIVVRGYIIGYGV--GSPYAETVRVDSKQRYYSIERLESSSHYVISLKAFNNAG 307
Cdd:pfam00041 1 SAPSNLTVTDVTsTSLTVSWTPPPDGNGPITGYEVEYRPknSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80
|
....
gi 1958740175 308 EGVP 311
Cdd:pfam00041 81 EGPP 84
|
|
| FN3 |
smart00060 |
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ... |
350-433 |
1.08e-06 |
|
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Pssm-ID: 214495 [Multi-domain] Cd Length: 83 Bit Score: 47.22 E-value: 1.08e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 350 PPVGVQAVALTHEAVRVSWadNSVPKNQKTSDVRLYTVRWRTSFSASAKYkSEDTTSLSYTATGLKPNTMYEFSVMVTKN 429
Cdd:smart00060 3 PPSNLRVTDVTSTSVTLSW--EPPPDDGITGYIVGYRVEYREEGSEWKEV-NVTPSSTSYTLTGLKPGTEYEFRVRAVNG 79
|
....
gi 1958740175 430 RRSS 433
Cdd:smart00060 80 AGEG 83
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
646-931 |
9.41e-06 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 49.69 E-value: 9.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 646 KGSQKDLRPPDlwihhEEMEMKNIEKPAGtdPAGRGSPiqscqdltPVSHSQSESQMGSKSASHSGQDTEEAGSSMSTLE 725
Cdd:PTZ00449 490 KKSKKKLAPIE-----EEDSDKHDEPPEG--PEASGLP--------PKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKE 554
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 726 RSLAARratrtklmiPMEAQSNNPavvSAIPVPTlESAQYPGILPSPTcGYPHPQFTLRPVpfptlsvdrgfgagrsqsV 805
Cdd:PTZ00449 555 GEVGKK---------PGPAKEHKP---SKIPTLS-KKPEFPKDPKHPK-DPEEPKKPKRPR------------------S 602
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 806 SEGPTAQQQPMLP-----PAQPEHPSSEEAPSRTIPTAcvRPTHPLRSFANPLLPPPMSAIEPKVPYTPLL--------- 871
Cdd:PTZ00449 603 AQRPTRPKSPKLPelldiPKSPKRPESPKSPKRPPPPQ--RPSSPERPEGPKIIKSPKPPKSPKPPFDPKFkekfyddyl 680
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 872 -----SQPGPTLPKTHVKTASLGLAGKARSPLLPVSV-----PTAPEVSEESHKPTEDPASvyEQDDLSE 931
Cdd:PTZ00449 681 daaakSKETKTTVVLDESFESILKETLPETPGTPFTTprplpPKLPRDEEFPFEPIGDPDA--EQPDDIE 748
|
|
| COG4733 |
COG4733 |
Phage-related protein, tail protein J [Mobilome: prophages, transposons]; |
126-558 |
1.32e-05 |
|
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
Pssm-ID: 443767 [Multi-domain] Cd Length: 978 Bit Score: 49.17 E-value: 1.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 126 PSAPPQNV----SLEVVNS----RSIKVSWLPPPSGTqngfitGYKIRHRK--TTRRGEMETLEPNnlwYLFTGLEKGsQ 195
Cdd:COG4733 529 PQWPPVNVttseSLSVVAQgtavTTLTVSWDAPAGAV------AYEVEWRRddGNWVSVPRTSGTS---FEVPGIYAG-D 598
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 196 YSFQVSAmtVNGTGPPSNWYTAETPENDLDESqVPDQPSSLHVRPQTNCIIMSWTPPLNPNivVRGYIIGYGVGSPY--A 273
Cdd:COG4733 599 YEVRVRA--INALGVSSAWAASSETTVTGKTA-PPPAPTGLTATGGLGGITLSWSFPVDAD--TLRTEIRYSTTGDWasA 673
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 274 ETVRVDSKQRYYSIERLESSSHYVISLKAFNNAGEGVPLYESATT--------RSITDPTDPVDYYPLLDDF--PTSGPD 343
Cdd:COG4733 674 TVAQALYPGNTYTLAGLKAGQTYYYRARAVDRSGNVSAWWVSGQAsadaagilDAITGQILETELGQELDAIiqNATVAE 753
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 344 VSTPMLPPVGVQAVALTHEAVRVSWADNSVPKNQKTSDVRLYTVRWRTS-FSASAKYKSEDTTSLSYTATGLKPNTMYEF 422
Cdd:COG4733 754 VVAATVTDVTAQIDTAVLFAGVATAAAIGAEARVAATVAESATAAAATGtAADAAGDASGGVTAGTSGTTGAGDTAASTT 833
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 423 SVMVTKNRRSSTWSMTAHATTYEAAPTSAPKDLTVITREGKPRAVIVSWQPPLEANGKITAYILFYTLDKNIPIDDWIME 502
Cdd:COG4733 834 RVAAAVVLAGVVVYGDAIIESGNTGDIVATGDIASAAAGAVATTVSGTTAADVSAVADSTAASLTAIVIAATTIIDAIGD 913
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*.
gi 1958740175 503 TISGDRlthQIMDLSLDTMYYFRIQARNAKGVGPLSDPILFRTLKVEHPDKMANDQ 558
Cdd:COG4733 914 GTTREP---AGDIGASGGAQGFAVTIVGSFDGAGAVATVDAGQSVVDGVGTAVEAA 966
|
|
| fn3 |
pfam00041 |
Fibronectin type III domain; |
350-424 |
1.73e-05 |
|
Fibronectin type III domain;
Pssm-ID: 394996 [Multi-domain] Cd Length: 85 Bit Score: 43.94 E-value: 1.73e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1958740175 350 PPVGVQAVALTHEAVRVSWAdnsvPKNQKTSDVRLYTVRWRTSFSASA-KYKSEDTTSLSYTATGLKPNTMYEFSV 424
Cdd:pfam00041 2 APSNLTVTDVTSTSLTVSWT----PPPDGNGPITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRV 73
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
692-920 |
2.26e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.31 E-value: 2.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 692 PVSHSQSESQMGSKSASHSGQDTEEAGSSMSTLERSlAARRATRTKLMIPMEAQSNNPAVVSAIPVPTLESAQYPGILPS 771
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRR-ARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPE 2709
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 772 PTcgyPHPQFTLRPVPfptlsvdrgFGAGRSQSVSEGPTAQQQPMLPPAQPEHPSSEEAPSR-TIPTACVRPTHPLRSFA 850
Cdd:PHA03247 2710 PA---PHALVSATPLP---------PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARpPTTAGPPAPAPPAAPAA 2777
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958740175 851 NP---LLPPPMSAIEPKVPYTPLLSQPGPTLPKTHVKTASLGLAGKARSPLLP--VSVPTAPEVSEESHKPTEDP 920
Cdd:PHA03247 2778 GPprrLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPptSAQPTAPPPPPGPPPPSLPL 2852
|
|
| COG4733 |
COG4733 |
Phage-related protein, tail protein J [Mobilome: prophages, transposons]; |
344-533 |
2.41e-04 |
|
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
Pssm-ID: 443767 [Multi-domain] Cd Length: 978 Bit Score: 44.94 E-value: 2.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 344 VSTPMLPPVGVQAVALTheaVRVSWADNSVpkNQKTSDVRLyTVRWRTS---FSASAKYKSED--------TTSLSYTAT 412
Cdd:COG4733 519 IDAGAFDDVPPQWPPVN---VTTSESLSVV--AQGTAVTTL-TVSWDAPagaVAYEVEWRRDDgnwvsvprTSGTSFEVP 592
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 413 GLKPNTmYEFSVM-VTKNRRSSTWSMTAHAT-TYEAAPTSAPKDLTVitrEGKPRAVIVSWQPPLEANgkITAYILFYTl 490
Cdd:COG4733 593 GIYAGD-YEVRVRaINALGVSSAWAASSETTvTGKTAPPPAPTGLTA---TGGLGGITLSWSFPVDAD--TLRTEIRYS- 665
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1958740175 491 dkniPIDDW---IMETISGDRLTHQIMDLSLDTMYYFRIQARNAKG 533
Cdd:COG4733 666 ----TTGDWasaTVAQALYPGNTYTLAGLKAGQTYYYRARAVDRSG 707
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
672-922 |
3.14e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.93 E-value: 3.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 672 PAGTDPAGRGSPIQSCQDLTPVSHSQSESQMGSKSASHSgqdTEEAGSSMSTLERSLAARRATRTKLMIPMEAQSNNPAV 751
Cdd:PHA03247 2734 ALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP---AAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAA 2810
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 752 VSAiPVPTLESAQYP-GILPSPTCGYPHPQFTLRPVPFPTLSVDRGFGAGrsQSVSEGPTAQQQPMLPpAQPEHPS---- 826
Cdd:PHA03247 2811 VLA-PAAALPPAASPaGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG--GDVRRRPPSRSPAAKP-AAPARPPvrrl 2886
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 827 SEEAPSRTIPTACVRPTHPLRSFANPLLPPPMSAIEPKVPYTPllsQPGPT--------LPKTHVKTASLGLAGKARSPL 898
Cdd:PHA03247 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP---QPPPPppprpqppLAPTTDPAGAGEPSGAVPQPW 2963
|
250 260
....*....|....*....|....*...
gi 1958740175 899 LPVSVPTAPEVSE----ESHKPTEDPAS 922
Cdd:PHA03247 2964 LGALVPGRVAVPRfrvpQPAPSREAPAS 2991
|
|
| Interfer-bind |
pfam09294 |
Interferon-alpha/beta receptor, fibronectin type III; Members of this family adopt a secondary ... |
129-220 |
4.04e-04 |
|
Interferon-alpha/beta receptor, fibronectin type III; Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha.
Pssm-ID: 462746 Cd Length: 103 Bit Score: 40.79 E-value: 4.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 129 PPQnVSLEVVNsRSIKVSWLPPPSGTQNGFIT-------GYKIRHRKTTRRGEMETLEPNNLWYLFTGLEKGSQYSFQVS 201
Cdd:pfam09294 5 PPE-VELEVEG-GSLNVTVKDPETREGKNLSLrdlygslQYRVSYWKNSSNGEKKNTTSTNSFVVLSDLEPGTTYCVSVQ 82
|
90 100
....*....|....*....|.
gi 1958740175 202 A--MTVNGTGPPSNWYTAETP 220
Cdd:pfam09294 83 AfsPLDNKSSQRSPPQCIRTT 103
|
|
| Pur_ac_phosph_N |
pfam16656 |
Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple ... |
364-443 |
1.12e-03 |
|
Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple acid phosphatase proteins.
Pssm-ID: 465220 [Multi-domain] Cd Length: 93 Bit Score: 38.93 E-value: 1.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 364 VRVSWADNSVPKNQKtsdVRLYTVRWRTSFSASAK------YKSEDTTSLSYTATGLKPNTMYEFSVMVTknrrSSTWSM 437
Cdd:pfam16656 15 MTVSWVTPSAVTSPV---VQYGTSSSALTSTATATsstyttGDGGTGYIHRATLTGLEPGTTYYYRVGDD----NGGWSE 87
|
....*.
gi 1958740175 438 TAHATT 443
Cdd:pfam16656 88 VYSFTT 93
|
|
| Pur_ac_phosph_N |
pfam16656 |
Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple ... |
130-219 |
1.98e-03 |
|
Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple acid phosphatase proteins.
Pssm-ID: 465220 [Multi-domain] Cd Length: 93 Bit Score: 38.54 E-value: 1.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 130 PQNVSLEVVN-SRSIKVSWLPPPSGTQNGFITGYKIRHRKTTRRGEMETLEPNNLW------YLFTGLEKGSQYSFQVSA 202
Cdd:pfam16656 1 PEQVHLSLTGdSTSMTVSWVTPSAVTSPVVQYGTSSSALTSTATATSSTYTTGDGGtgyihrATLTGLEPGTTYYYRVGD 80
|
90
....*....|....*..
gi 1958740175 203 mtvnGTGPPSNWYTAET 219
Cdd:pfam16656 81 ----DNGGWSEVYSFTT 93
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
742-882 |
2.20e-03 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 41.95 E-value: 2.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 742 MEAQSNNPAvvsaiPVPTLESAQYPGILPSPtcgyPHPQFTLRPVPFPTLSVDRGFGAGRSQSVSEG--PTAQQQPMLPP 819
Cdd:pfam09770 203 MRAQAKKPA-----QQPAPAPAQPPAAPPAQ----QAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGhpVTILQRPQSPQ 273
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1958740175 820 AQPEHPSSEEAPSRTIPTACVRPTHPLRSFANPLLPPPmsAIEPKVPYTPLLSQPGPTLPKTH 882
Cdd:pfam09770 274 PDPAQPSIQPQAQQFHQQPPPVPVQPTQILQNPNRLSA--ARVGYPQNPQPGVQPAPAHQAHR 334
|
|
| COG4733 |
COG4733 |
Phage-related protein, tail protein J [Mobilome: prophages, transposons]; |
2-117 |
2.48e-03 |
|
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
Pssm-ID: 443767 [Multi-domain] Cd Length: 978 Bit Score: 41.85 E-value: 2.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 2 YTFRVVAYNEWG-PGESSQPIKVATQPELQVPGPVENLHAvSASPTSILITWEPPAYAngPVQGYRLFCTEVSTGKEQNI 80
Cdd:COG4733 599 YEVRVRAINALGvSSAWAASSETTVTGKTAPPPAPTGLTA-TGGLGGITLSWSFPVDA--DTLRTEIRYSTTGDWASATV 675
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1958740175 81 EVD---GLSYKLEGLKKFTEYTLRFLAYNRYG-------PGVSTDDI 117
Cdd:COG4733 676 AQAlypGNTYTLAGLKAGQTYYYRARAVDRSGnvsawwvSGQASADA 722
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
676-907 |
3.20e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.46 E-value: 3.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 676 DPAGRGSPIQSCQDLTPVSHSQSEsqmGSKSASHSGQDTEEAGSSMSTLERSLAARRATRTKlmipMEAQSNNPAVVSAI 755
Cdd:PHA03247 2607 DPRGPAPPSPLPPDTHAPDPPPPS---PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPR----RARRLGRAAQASSP 2679
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 756 PVPTLESAQYPGILPSPTCGYPHPQ-FTLRPVPFPTLS-VDRGFGAGRSQSVSEGPTAQQQPMLPPAQPEHPSSEEAPSR 833
Cdd:PHA03247 2680 PQRPRRRAARPTVGSLTSLADPPPPpPTPEPAPHALVSaTPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPAR 2759
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1958740175 834 -TIPTACVRPTHPLRSFANPLLPPPMSAIEPKVPYTPLLSQPGPTLPKTHVKTASLGLAGKARSPLLPVSVPTAP 907
Cdd:PHA03247 2760 pPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSA 2834
|
|
| COG4733 |
COG4733 |
Phage-related protein, tail protein J [Mobilome: prophages, transposons]; |
46-223 |
3.23e-03 |
|
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
Pssm-ID: 443767 [Multi-domain] Cd Length: 978 Bit Score: 41.47 E-value: 3.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 46 TSILITWEPPAYANG-PVQGYRlfctevSTGKEQNI-EVDGLSYKLEGLKKfTEYTLRFLAYNRYG---PGVSTDDITVV 120
Cdd:COG4733 552 TTLTVSWDAPAGAVAyEVEWRR------DDGNWVSVpRTSGTSFEVPGIYA-GDYEVRVRAINALGvssAWAASSETTVT 624
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 121 TLSDVPSAPpqnVSLEVVNS-RSIKVSWLPPPsgtqNGFITGYKIRHRKTTRRGEME---TLEPNNLWYLfTGLEKGSQY 196
Cdd:COG4733 625 GKTAPPPAP---TGLTATGGlGGITLSWSFPV----DADTLRTEIRYSTTGDWASATvaqALYPGNTYTL-AGLKAGQTY 696
|
170 180
....*....|....*....|....*..
gi 1958740175 197 SFQVSAmtVNGTGPPSNWYTAETPEND 223
Cdd:COG4733 697 YYRARA--VDRSGNVSAWWVSGQASAD 721
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
741-932 |
4.88e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 41.08 E-value: 4.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 741 PMEAQSNNPAVVSAIPVPTLESAQYPGILPSPTCGYPhpqftlrPVPFPTLSVDRGFGAGRSQSVSEGPTAQQQPMLPPA 820
Cdd:PHA03247 2705 PPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAP-------PAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAA 2777
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 821 QPE-----------HPSSEEAPS------RTIPTACVRPTHPLRSFANPLLPPPMSAIEPKVPYTPLLSQPGPTLPKTHV 883
Cdd:PHA03247 2778 GPPrrltrpavaslSESRESLPSpwdpadPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVA 2857
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 1958740175 884 KTASLGLAGKARSPLLPVSVPTAPEVSEESHKPTEDPASVYEQDDLSEQ 932
Cdd:PHA03247 2858 PGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPE 2906
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
715-943 |
6.00e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 40.24 E-value: 6.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 715 EEAGSSMsTLERSLAARR------ATRTKLMIPMEAQSNNPAVVSAIPVPTLESAQYPGILPSPTCGYPHPQF---TLRP 785
Cdd:PRK12323 349 EYAGFTM-TLLRMLAFRPgqsgggAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAaapARRS 427
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 786 VPFPTLSVDRGFGAGRSQSVSEgPTAQQQPMLPPAQPEHPSSEEAPSRTIPTACVRPTHPLRSFANPLLPPPMSAIEPKV 865
Cdd:PRK12323 428 PAPEALAAARQASARGPGGAPA-PAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEF 506
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 866 PYTPL--------------LSQPGPTLPKTHVKTASlglAGKARSPLLPVSVPTAPEVSEESHKPTEDPASVYEQDDLSE 931
Cdd:PRK12323 507 ASPAPaqpdaapagwvaesIPDPATADPDDAFETLA---PAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFDGDWPA 583
|
250
....*....|....
gi 1958740175 932 QMASL--EGLMKQL 943
Cdd:PRK12323 584 LAARLpvRGLAQQL 597
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
669-907 |
9.05e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 40.14 E-value: 9.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 669 IEKPAGTDPAGRGSPIQSCQ------DLTPVSHSQSESQMGSKSASHSGQDTEEAGSSMSTLERSLAARRATRTKLMIPM 742
Cdd:pfam03154 289 MQHPVPPQPFPLTPQSSQSQvppgpsPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPN 368
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 743 EAQSNNPAVVSAiPVPTlesaQYPGILPSPTCGYP-------HPQfTLRPVPFPTLSVDRGFGAGRSQSvsegPTAQQQP 815
Cdd:pfam03154 369 PQSHKHPPHLSG-PSPF----QMNSNLPPPPALKPlsslsthHPP-SAHPPPLQLMPQSQQLPPPPAQP----PVLTQSQ 438
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1958740175 816 MLPPAQPEHPSSeeAPSRTIPTACVRPTHPLRSFANPLLPPP----------MSAIEPKVPYTPLLSQPGP-----TLPK 880
Cdd:pfam03154 439 SLPPPAASHPPT--SGLHQVPSQSPFPQHPFVPGGPPPITPPsgpptstssaMPGIQPPSSASVSSSGPVPaavscPLPP 516
|
250 260
....*....|....*....|....*..
gi 1958740175 881 THVKTASLGLAGKARSPLLPVSVPTAP 907
Cdd:pfam03154 517 VQIKEEALDEAEEPESPPPPPRSPSPE 543
|
|
|