|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
387-550 |
2.27e-29 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. :
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 116.73 E-value: 2.27e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216 79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157
|
....*.
gi 1907182170 545 FTTSMG 550
Cdd:smart00216 158 FRTPDG 163
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1058-1130 |
2.92e-28 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. :
Pssm-ID: 214843 Cd Length: 76 Bit Score: 110.12 E-value: 2.92e-28
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182170 1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832 3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
857-1019 |
3.71e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation. :
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 113.27 E-value: 3.71e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216 1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216 75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
|
170
....*....|..
gi 1907182170 1008 NGNMKDDFETRS 1019
Cdd:smart00216 151 DGEPEDDFRTPD 162
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
45-194 |
2.15e-25 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods. :
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 105.15 E-value: 2.15e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
589-661 |
9.74e-22 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. :
Pssm-ID: 214843 Cd Length: 76 Bit Score: 91.63 E-value: 9.74e-22
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832 4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
303-358 |
1.64e-16 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. :
Pssm-ID: 460351 Cd Length: 55 Bit Score: 76.27 E-value: 1.64e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
765-828 |
2.83e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation. :
Pssm-ID: 410995 Cd Length: 55 Bit Score: 66.96 E-value: 2.83e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941 1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
246-298 |
8.65e-11 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826. :
Pssm-ID: 462584 Cd Length: 68 Bit Score: 60.47 E-value: 8.65e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742 18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4233-4312 |
4.02e-10 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers. :
Pssm-ID: 214482 Cd Length: 82 Bit Score: 58.95 E-value: 4.02e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4233 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4311
Cdd:smart00041 5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78
|
.
gi 1907182170 4312 P 4312
Cdd:smart00041 79 P 79
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
3706-4128 |
1.89e-08 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 1.89e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3706 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 3770
Cdd:PHA03247 2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3771 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 3848
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3849 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 3927
Cdd:PHA03247 2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3928 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 3999
Cdd:PHA03247 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4000 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4078
Cdd:PHA03247 2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 4079 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4128
Cdd:PHA03247 2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
665-722 |
1.74e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation. :
Pssm-ID: 410995 Cd Length: 55 Bit Score: 50.39 E-value: 1.74e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
2341-2562 |
1.03e-06 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 54.76 E-value: 1.03e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2341 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 2420
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2421 HPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTG 2500
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 2501 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 2562
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
360-404 |
2.64e-06 |
|
von Willebrand factor (vWF) type C domain; :
Pssm-ID: 214565 Cd Length: 67 Bit Score: 47.56 E-value: 2.64e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907182170 360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
3250-3450 |
3.91e-06 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.83 E-value: 3.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3250 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3329
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3330 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3409
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1907182170 3410 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3450
Cdd:COG3469 184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
2647-2835 |
1.60e-05 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.91 E-value: 1.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2647 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2726
Cdd:COG3469 40 TTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGA 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2727 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2806
Cdd:COG3469 120 GSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATT 195
|
170 180
....*....|....*....|....*....
gi 1907182170 2807 TTgvsletsvqTTIASPTPSAPQTSLATH 2835
Cdd:COG3469 196 PS---------ATTTATTTGPPTPGLPKH 215
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
2715-3170 |
5.40e-05 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 5.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2715 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 2786
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2787 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 2860
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2861 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 2940
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2941 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 3017
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3018 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3097
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3098 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3166
Cdd:PHA03247 2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015
|
....
gi 1907182170 3167 PSAP 3170
Cdd:PHA03247 3016 ETDP 3019
|
|
| 2A1904 super family |
cl36772 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2068-2422 |
1.73e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds] The actual alignment was detected with superfamily member TIGR00927:
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 48.07 E-value: 1.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2068 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSgpqsslSTHLPLFSTLSVTPTTEGL-----NTPTSP 2142
Cdd:TIGR00927 75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRT------AKITPTTPKNNYSPTAAGTervkeDTPATP 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2143 -----HSLSVASTSMpLMTVLPTT---LEGTRPPHTSVPV-MYTTTAATQTKSSFSTDRTSTphLSQSSTVTPTQSTpip 2213
Cdd:TIGR00927 149 sralnHYISTSGRQR-VKSYTPKPrgeVKSSSPTQTREKVrKYTPSPLGRMVNSYAPSTFMT--MPRSHGITPRTTV--- 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2214 aTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFS 2293
Cdd:TIGR00927 223 -KDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLV 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2294 STSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPT 2366
Cdd:TIGR00927 302 GKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA 381
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182170 2367 sAPHLSETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2422
Cdd:TIGR00927 382 -TPRVRAVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1554-1969 |
4.38e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.45 E-value: 4.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1554 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 1633
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1634 STSTTTGnilPTTIGQTGSPHTSVPviyTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 1713
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1714 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 1792
Cdd:pfam05109 576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1793 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 1867
Cdd:pfam05109 650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1868 TPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAT 1946
Cdd:pfam05109 730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
|
410 420
....*....|....*....|...
gi 1907182170 1947 HLPFSSTSSVTPTSKVIITPTPQ 1969
Cdd:pfam05109 810 RWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Chi1 super family |
cl43877 |
Chitinase [Carbohydrate transport and metabolism]; |
3533-3753 |
1.47e-03 |
|
Chitinase [Carbohydrate transport and metabolism]; The actual alignment was detected with superfamily member COG3469:
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3533 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3612
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3613 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3692
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182170 3693 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 3753
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
|
|
| ROM1 super family |
cl34999 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
1407-1625 |
4.95e-03 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms]; The actual alignment was detected with superfamily member COG5422:
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 43.34 E-value: 4.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422 108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182170 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422 187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
387-550 |
2.27e-29 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 116.73 E-value: 2.27e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216 79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157
|
....*.
gi 1907182170 545 FTTSMG 550
Cdd:smart00216 158 FRTPDG 163
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
398-550 |
4.14e-29 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 115.55 E-value: 4.14e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1058-1130 |
2.92e-28 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 110.12 E-value: 2.92e-28
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182170 1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832 3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
857-1019 |
3.71e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 113.27 E-value: 3.71e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216 1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216 75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
|
170
....*....|..
gi 1907182170 1008 NGNMKDDFETRS 1019
Cdd:smart00216 151 DGEPEDDFRTPD 162
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
1062-1129 |
1.23e-25 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 102.46 E-value: 1.23e-25
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742 1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
45-194 |
2.15e-25 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 105.15 E-value: 2.15e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
45-193 |
1.12e-24 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 103.25 E-value: 1.12e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216 12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182170 120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216 90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
589-661 |
9.74e-22 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 91.63 E-value: 9.74e-22
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832 4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
592-661 |
2.59e-21 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 90.13 E-value: 2.59e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742 1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
869-1019 |
1.67e-20 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 90.89 E-value: 1.67e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094 78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
303-358 |
1.64e-16 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 76.27 E-value: 1.64e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
303-358 |
1.34e-15 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 73.51 E-value: 1.34e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
765-828 |
2.83e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 66.96 E-value: 2.83e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941 1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
765-828 |
3.84e-13 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 66.64 E-value: 3.84e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826 1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
246-298 |
8.65e-11 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 60.47 E-value: 8.65e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742 18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4233-4312 |
4.02e-10 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.
Pssm-ID: 214482 Cd Length: 82 Bit Score: 58.95 E-value: 4.02e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4233 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4311
Cdd:smart00041 5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78
|
.
gi 1907182170 4312 P 4312
Cdd:smart00041 79 P 79
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
246-299 |
5.07e-09 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 55.42 E-value: 5.07e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832 25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3706-4128 |
1.89e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 1.89e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3706 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 3770
Cdd:PHA03247 2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3771 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 3848
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3849 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 3927
Cdd:PHA03247 2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3928 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 3999
Cdd:PHA03247 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4000 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4078
Cdd:PHA03247 2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 4079 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4128
Cdd:PHA03247 2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
665-722 |
1.74e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 50.39 E-value: 1.74e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
3803-4119 |
9.18e-07 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 55.39 E-value: 9.18e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3803 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 3872
Cdd:TIGR00927 68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3873 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 3929
Cdd:TIGR00927 148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3930 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 3997
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3998 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4075
Cdd:TIGR00927 306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1907182170 4076 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4119
Cdd:TIGR00927 386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2341-2562 |
1.03e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 54.76 E-value: 1.03e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2341 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 2420
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2421 HPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTG 2500
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 2501 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 2562
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
360-404 |
2.64e-06 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 47.56 E-value: 2.64e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907182170 360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3250-3450 |
3.91e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.83 E-value: 3.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3250 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3329
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3330 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3409
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1907182170 3410 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3450
Cdd:COG3469 184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2177-2573 |
8.85e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 52.23 E-value: 8.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2177 TTTAATQTKSSFSTDRTSTPhlSQSSTVTPTQSTPiPATTNSLMTTGGLTGTPPVHTTSGttSSPQTPRTTHPFSTVAVS 2256
Cdd:pfam05109 428 TTTSPTLNTTGFAAPNTTTG--LPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGTTSG--ASPVTPSPSPRDNGTESK 502
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2257 NTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPfssTSSVTPTSEVIITPTPQHTLssaststtmgnilPTTIGQTGS 2336
Cdd:pfam05109 503 APDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSP---TLGKTSPTSAVTTPTPNATS-------------PTPAVTTPT 566
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2337 PHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 2416
Cdd:pfam05109 567 PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSL 646
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2417 PRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHlpLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTG 2496
Cdd:pfam05109 647 RPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTH--HVSTSSPAPRPGTTSQASGPGNSSTSTKPGEV 724
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2497 GLT-GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT----KHTTGVSLETSVQTTI---ASPTPSAPQTSLATHLPFSST 2568
Cdd:pfam05109 725 NVTkGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSttggKHTTGHGARTSTEPTTdygGDSTTPRTRYNATTYLPPSTS 804
|
....*
gi 1907182170 2569 SAVTP 2573
Cdd:pfam05109 805 SKLRP 809
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
665-722 |
1.10e-05 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 45.46 E-value: 1.10e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182170 665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2647-2835 |
1.60e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.91 E-value: 1.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2647 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2726
Cdd:COG3469 40 TTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGA 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2727 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2806
Cdd:COG3469 120 GSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATT 195
|
170 180
....*....|....*....|....*....
gi 1907182170 2807 TTgvsletsvqTTIASPTPSAPQTSLATH 2835
Cdd:COG3469 196 PS---------ATTTATTTGPPTPGLPKH 215
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2715-3170 |
5.40e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 5.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2715 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 2786
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2787 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 2860
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2861 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 2940
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2941 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 3017
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3018 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3097
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3098 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3166
Cdd:PHA03247 2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015
|
....
gi 1907182170 3167 PSAP 3170
Cdd:PHA03247 3016 ETDP 3019
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
3871-4105 |
9.62e-05 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 48.73 E-value: 9.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3871 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 3945
Cdd:COG5422 59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3946 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 4025
Cdd:COG5422 134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4026 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4100
Cdd:COG5422 211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286
|
....*
gi 1907182170 4101 PTIHM 4105
Cdd:COG5422 287 MRLQL 291
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
1987-3607 |
1.64e-04 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 48.22 E-value: 1.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1987 TTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 2066
Cdd:COG3210 80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2067 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2146
Cdd:COG3210 160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2147 VASTSMplmTVLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2226
Cdd:COG3210 240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2227 GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVII 2306
Cdd:COG3210 317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2307 TPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 2386
Cdd:COG3210 397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2387 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLF 2466
Cdd:COG3210 477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2467 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2546
Cdd:COG3210 557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2547 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2626
Cdd:COG3210 637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2627 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2697
Cdd:COG3210 717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2698 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 2777
Cdd:COG3210 797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2778 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 2857
Cdd:COG3210 877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2858 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 2937
Cdd:COG3210 957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2938 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 3017
Cdd:COG3210 1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3018 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3097
Cdd:COG3210 1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3098 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3177
Cdd:COG3210 1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3178 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3257
Cdd:COG3210 1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
|
1290 1300 1310 1320 1330 1340 1350 1360
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3258 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3337
Cdd:COG3210 1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
|
1370 1380 1390 1400 1410 1420 1430 1440
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3338 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3417
Cdd:COG3210 1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
|
1450 1460 1470 1480 1490 1500 1510 1520
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3418 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3497
Cdd:COG3210 1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
|
1530 1540 1550 1560 1570 1580 1590 1600
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3498 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3577
Cdd:COG3210 1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
|
1610 1620 1630
....*....|....*....|....*....|
gi 1907182170 3578 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3607
Cdd:COG3210 1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2068-2422 |
1.73e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 48.07 E-value: 1.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2068 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSgpqsslSTHLPLFSTLSVTPTTEGL-----NTPTSP 2142
Cdd:TIGR00927 75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRT------AKITPTTPKNNYSPTAAGTervkeDTPATP 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2143 -----HSLSVASTSMpLMTVLPTT---LEGTRPPHTSVPV-MYTTTAATQTKSSFSTDRTSTphLSQSSTVTPTQSTpip 2213
Cdd:TIGR00927 149 sralnHYISTSGRQR-VKSYTPKPrgeVKSSSPTQTREKVrKYTPSPLGRMVNSYAPSTFMT--MPRSHGITPRTTV--- 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2214 aTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFS 2293
Cdd:TIGR00927 223 -KDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLV 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2294 STSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPT 2366
Cdd:TIGR00927 302 GKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA 381
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182170 2367 sAPHLSETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2422
Cdd:TIGR00927 382 -TPRVRAVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2030-2248 |
1.91e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.44 E-value: 1.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2030 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2109
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2110 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVMYTTTAATQTKSSF 2188
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2189 STDRTSTPHlsqSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2248
Cdd:COG3469 159 ATGGTTTTS---TTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1896-2311 |
1.96e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.60 E-value: 1.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1896 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQHTLSSA 1975
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1976 STSTTTGnilPTTIGKTGSPHTSVPviyTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 2055
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2056 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2134
Cdd:pfam05109 576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2135 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2209
Cdd:pfam05109 650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2210 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTT 2288
Cdd:pfam05109 730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
|
410 420
....*....|....*....|...
gi 1907182170 2289 HLPFSSTSSVTPTSEVIITPTPQ 2311
Cdd:pfam05109 810 RWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2358-2837 |
3.96e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.86 E-value: 3.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2358 STDRT-STPTSAPHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHT 2434
Cdd:PHA03247 2563 APDRSvPPPRPAPRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAA 2635
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2435 TGLPSGTSVQTTTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSP 2514
Cdd:PHA03247 2636 NEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2515 QTPRTTHPFSTVAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAstst 2594
Cdd:PHA03247 2703 PPPPTPEPAPHALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP---- 2760
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2595 ttgnilPTTigqTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGT 2674
Cdd:PHA03247 2761 ------PTT---AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2675 PVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQ 2754
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERP 2908
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2755 STPI----PATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQ 2829
Cdd:PHA03247 2909 PQPQapppPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREA 2988
|
....*...
gi 1907182170 2830 TSLATHLP 2837
Cdd:PHA03247 2989 PASSTPPL 2996
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1554-1969 |
4.38e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.45 E-value: 4.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1554 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 1633
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1634 STSTTTGnilPTTIGQTGSPHTSVPviyTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 1713
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1714 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 1792
Cdd:pfam05109 576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1793 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 1867
Cdd:pfam05109 650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1868 TPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAT 1946
Cdd:pfam05109 730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
|
410 420
....*....|....*....|...
gi 1907182170 1947 HLPFSSTSSVTPTSKVIITPTPQ 1969
Cdd:pfam05109 810 RWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3542-3915 |
6.74e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 45.72 E-value: 6.74e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3542 QSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLS 3621
Cdd:pfam17823 45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3622 THLPLFSTLSVTPTTEGLNTPTSphslSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHL 3701
Cdd:pfam17823 125 SAAQSLPAAIAALPSEAFSAPRA----AACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAAS 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3702 SQSSTVTPTQPTPIPAT-TNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFP 3780
Cdd:pfam17823 201 SAPATLTPARGISTAATaTGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRL 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3781 TPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVS--MSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQG 3858
Cdd:pfam17823 281 SPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGepTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSA 360
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182170 3859 --------SFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTP 3915
Cdd:pfam17823 361 spvpvlhtSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDP 425
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3533-3753 |
1.47e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3533 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3612
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3613 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3692
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182170 3693 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 3753
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2918-3132 |
1.82e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2918 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2997
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2998 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3076
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182170 3077 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3132
Cdd:COG3469 159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
|
|
| VWC |
pfam00093 |
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ... |
360-395 |
1.86e-03 |
|
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.
Pssm-ID: 278520 Cd Length: 57 Bit Score: 39.33 E-value: 1.86e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1907182170 360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093 1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1688-1906 |
1.91e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1688 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 1767
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1768 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 1847
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182170 1848 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 1906
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2644-2968 |
2.40e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 44.22 E-value: 2.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2644 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2717
Cdd:TIGR00927 91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2718 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 2783
Cdd:TIGR00927 169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2784 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 2863
Cdd:TIGR00927 246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2864 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 2933
Cdd:TIGR00927 326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
|
330 340 350
....*....|....*....|....*....|....*
gi 1907182170 2934 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2968
Cdd:TIGR00927 405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2774-3137 |
2.67e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.99 E-value: 2.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2774 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 2853
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2854 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 2933
Cdd:pfam03154 265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2934 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 3009
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3010 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3081
Cdd:pfam03154 413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182170 3082 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3137
Cdd:pfam03154 488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3087-3447 |
2.81e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.99 E-value: 2.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3087 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3166
Cdd:pfam03154 169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3167 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3246
Cdd:pfam03154 236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3247 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3326
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3327 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3392
Cdd:pfam03154 380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182170 3393 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3447
Cdd:pfam03154 460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2511-2827 |
2.83e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 43.75 E-value: 2.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2511 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA---THLPFSSTSAVTPT------------- 2574
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdvtSPTPAGTTSGASPVtpspsprdngtes 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2575 --------SEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSphTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSE 2646
Cdd:pfam05109 502 kapdmtspTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSP--TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2647 TSAVTAHQSTPTAVSANSIKPTMSSTGtpvvHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2726
Cdd:pfam05109 580 TSAVTTPTPNATSPTVGETSPQANTTN----HTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETL 655
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2727 GPQSS--LSTHLPLFStlSVTPTTEGLNTQSTPIPATTNSLMTTgglTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2804
Cdd:pfam05109 656 SPSTSdnSTSHMPLLT--SAHPTGGENITQVTPASTSTHHVSTS---SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGT 730
|
330 340
....*....|....*....|...
gi 1907182170 2805 KHTTGVSLETSVQTTIASPTPSA 2827
Cdd:pfam05109 731 PPKNATSPQAPSGQKTAVPTVTS 753
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
3841-4069 |
3.60e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 43.84 E-value: 3.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3841 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 3920
Cdd:NF033849 250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3921 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 3995
Cdd:NF033849 330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 3996 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4069
Cdd:NF033849 410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1515-1949 |
4.35e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 4.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1515 SQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSntkHTTGVSLETSVQTTIASPT 1594
Cdd:PHA03247 2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG---RAAQASSPPQRPRRRAARP 2690
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1595 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPqhtlssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFS 1674
Cdd:PHA03247 2691 TVGSLTSLADPPPPPPTPEPAPHALVSATPLP------PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA 2764
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1675 TDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkptmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLP 1754
Cdd:PHA03247 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------------------PSPWDP-ADPPAAVLAPAAALPPAASP 2824
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1755 SGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpTTEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTY 1834
Cdd:PHA03247 2825 AGPLPPPTSAQPTAPPPPPG-----PPPPSL----PLGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPA 2890
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1835 TttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPiPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVS 1914
Cdd:PHA03247 2891 V---------SRSTESFALPPDQPERPPQPQAPPP-PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP 2960
|
410 420 430
....*....|....*....|....*....|....*.
gi 1907182170 1915 NTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 1949
Cdd:PHA03247 2961 QPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
1407-1625 |
4.95e-03 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 43.34 E-value: 4.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422 108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182170 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422 187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2353-2695 |
9.95e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 42.29 E-value: 9.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2353 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 2426
Cdd:TIGR00927 73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2427 AVSGTVHTTGLPSGTSVQTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSL 2492
Cdd:TIGR00927 151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2493 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVT 2572
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2573 PTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAiTQTKTSFSTDRTSTSTSAPHLS 2645
Cdd:TIGR00927 308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASA-TFRGLEKNPSTAPSTPATPRVR 386
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 2646 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPrTTHP 2695
Cdd:TIGR00927 387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
387-550 |
2.27e-29 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 116.73 E-value: 2.27e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 387 WTCTKQPCPGHCSLEGGSFVTTFDARPYRFHGTCTYTLLQSpqLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVI 466
Cdd:smart00216 1 WCCTQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD--CSSEPTFSVLLKNVPCGGGATCLKSVKVELNGDEIEL 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 467 SEDEV-ITNNGDTKLLPYKTHNITI-FRQTSTHLQMATTFGLELVfQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDD 544
Cdd:smart00216 79 KDDNGkVTVNGQQVSLPYKTSDGSIqIRSSGGYLVVITSLGLIQV-TFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDD 157
|
....*.
gi 1907182170 545 FTTSMG 550
Cdd:smart00216 158 FRTPDG 163
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
398-550 |
4.14e-29 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 115.55 E-value: 4.14e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 398 CSLEGGSFVTTFDARPYRFHGTCTYTLLQSPQLPNEGTLMAVYDKSGYSHSETSLVAIMYLSKKDKIVISEDEVITNNGD 477
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGASGVCLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 478 TKLLPYKTHNITIFRQTSTHLQMATTFGLELVFQMQPVFQVYITVGPQFKGQTRGLCGNFNGDTTDDFTTSMG 550
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDG 153
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
1058-1130 |
2.92e-28 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 110.12 E-value: 2.92e-28
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182170 1058 WAERKCNIINSQ--TFAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFCP 1130
Cdd:smart00832 3 YACSQCGILLSPrgPFAACHSVVDPEPFFENCVYDTCAC--GGDCECLCDALAAYAAACAEAGVCIsPWRTPTFCP 76
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
857-1019 |
3.71e-28 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 113.27 E-value: 3.71e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 857 WTCqLSTQCPSTCVLYGEGHIITFDGQRFVFDGDCEYMLATDdcgaNSSQPTFKVLTENVICGkSGVTCSRAIKISLGGL 936
Cdd:smart00216 1 WCC-TQEECSPTCSVSGDPHYTTFDGVAYTFPGNCYYVLAQD----CSSEPTFSVLLKNVPCG-GGATCLKSVKVELNGD 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 937 FITMADSN--YTVSGE-------EPLVHLKVKPSPLNLVLdidIPGRLNLTLVWNKHMSVSIKIrRATQQDALCGLCGNA 1007
Cdd:smart00216 75 EIELKDDNgkVTVNGQqvslpykTSDGSIQIRSSGGYLVV---ITSLGLIQVTFDGLTLLSVQL-PSKYRGKTCGLCGNF 150
|
170
....*....|..
gi 1907182170 1008 NGNMKDDFETRS 1019
Cdd:smart00216 151 DGEPEDDFRTPD 162
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
1062-1129 |
1.23e-25 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 102.46 E-value: 1.23e-25
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1062 KCNIINSQT-FAACHSKVYHLPYYEACVRDACGCdtGGDCECLCDAVAAYAKACLDKGVCV-DWRTPDFC 1129
Cdd:pfam08742 1 KCGLLSDSGpFAPCHSVVDPEPYFEACVYDMCSC--GGDDECLCAALAAYARACQAAGVCIgDWRTPTFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
45-194 |
2.15e-25 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 105.15 E-value: 2.15e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDVPATFSIQLRRDMEGN----ISRIIMELGASVVTVNKETISVRDIG 120
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLAKDCSEEPDFSFSVTNKNCNGGAsgvcLKSVTVIVGDLEITLQKGGTVLVNGQ 80
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 121 VVSLPYTSNGLQITPYGQSVQLVAKQLELELVITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDGK 194
Cdd:pfam00094 81 KVSLPYKSDGGEVEILGSGFVVVDLSPGVGLQVDGDGRGQLFVTLSPSYQGKTCGLCGNYNGNQEDDFMTPDGT 154
|
|
| VWD |
smart00216 |
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D ... |
45-193 |
1.12e-24 |
|
von Willebrand factor (vWF) type D domain; Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
Pssm-ID: 214566 [Multi-domain] Cd Length: 163 Bit Score: 103.25 E-value: 1.12e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 45 CSTWGAGHFSTFDGHEYNFQGMCNYIFTATCGDDvpATFSIQLRRDMEG----NISRIIMELGASVVTVNKETISVR-DI 119
Cdd:smart00216 12 CSVSGDPHYTTFDGVAYTFPGNCYYVLAQDCSSE--PTFSVLLKNVPCGggatCLKSVKVELNGDEIELKDDNGKVTvNG 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182170 120 GVVSLPYTSNGLQITPYgQSVQLVAKQLELELV-ITWGPDAHLTVQVETKYMGKLCGLCGNFDGKIDNEFLSEDG 193
Cdd:smart00216 90 QQVSLPYKTSDGSIQIR-SSGGYLVVITSLGLIqVTFDGLTLLSVQLPSKYRGKTCGLCGNFDGEPEDDFRTPDG 163
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
589-661 |
9.74e-22 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 91.63 E-value: 9.74e-22
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 589 AETHCSMLLKKGSVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:smart00832 4 ACSQCGILLSPRGPFAACHSVVDPEPFFENCVYDTCACGGDCECLCDALAAYAAACAEAGVCISPWRTP-TFC 75
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
592-661 |
2.59e-21 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 90.13 E-value: 2.59e-21
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 592 HCSMLLKKGsVFEKCHSVVNPQPFYKRCVYQACNYEETFPHICSALGAYAHACSARGILLWGWRNSvDNC 661
Cdd:pfam08742 1 KCGLLSDSG-PFAPCHSVVDPEPYFEACVYDMCSCGGDDECLCAALAAYARACQAAGVCIGDWRTP-TFC 68
|
|
| VWD |
pfam00094 |
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is ... |
869-1019 |
1.67e-20 |
|
von Willebrand factor type D domain; Swiss:P17554 contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
Pssm-ID: 459671 [Multi-domain] Cd Length: 154 Bit Score: 90.89 E-value: 1.67e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 869 CVLYGEGHIITFDGQRFVFDGDCEYMLAtDDCGANSSqPTFKVLTENVICGKSGVtCSRAIKISLGGLFITMADSNY-TV 947
Cdd:pfam00094 1 CSVSGDPHYVTFDGVKYTFPGTCTYVLA-KDCSEEPD-FSFSVTNKNCNGGASGV-CLKSVTVIVGDLEITLQKGGTvLV 77
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 948 SGEEplVHLKVKPSPLNL------VLDIDIPGRLNLTLVWNKHMSVSIKIRRaTQQDALCGLCGNANGNMKDDFETRS 1019
Cdd:pfam00094 78 NGQK--VSLPYKSDGGEVeilgsgFVVVDLSPGVGLQVDGDGRGQLFVTLSP-SYQGKTCGLCGNYNGNQEDDFMTPD 152
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
303-358 |
1.64e-16 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 76.27 E-value: 1.64e-16
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPpdVCPEPCVEGCVCPPGFVRNS---GGKCVPPSDC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
303-358 |
1.34e-15 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 73.51 E-value: 1.34e-15
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 303 CPANQVYQECGEVCIKTCSNPQH--SCSSPCTFGCFCPHGTLLDDisgNQSCVPVNQC 358
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANPNAppPCTKQCVEGCFCPEGYVRNS---GGKCVPPSQC 55
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
765-828 |
2.83e-13 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 66.96 E-value: 2.83e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:cd19941 1 CPPNEVYSEC--------GSACPPTCANPNAPPPC-TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
765-828 |
3.84e-13 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 66.64 E-value: 3.84e-13
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 765 CPEPKTFQSCsqssedkfGAACAPTCQMLATGIDCvPTKCESGCVCPKGLYENSDGQCVPAEEC 828
Cdd:pfam01826 1 CPANEVYSEC--------GSACPPTCANLSPPDVC-PEPCVEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| C8 |
pfam08742 |
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 ... |
246-298 |
8.65e-11 |
|
C8 domain; This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin. It is often found on proteins containing pfam00094 and pfam01826.
Pssm-ID: 462584 Cd Length: 68 Bit Score: 60.47 E-value: 8.65e-11
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALC 298
Cdd:pfam08742 18 VDPEPYFEACVYDM--CSCGGDDECLCAALAAYARACQAAGVCIGDWRTPTFC 68
|
|
| CT |
smart00041 |
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta ... |
4233-4312 |
4.02e-10 |
|
C-terminal cystine knot-like domain (CTCK); The structures of transforming growth factor-beta (TGFbeta), nerve growth factor (NGF), platelet-derived growth factor (PDGF) and gonadotropin all form 2 highly twisted antiparallel pairs of beta-strands and contain three disulphide bonds. The domain is non-globular and little is conserved among these presumed homologues except for their cysteine residues. CT domains are predicted to form homodimers.
Pssm-ID: 214482 Cd Length: 82 Bit Score: 58.95 E-value: 4.02e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4233 EHQITYQGCVAN-VTLTRCQGFCASSVSFNkdTLQLESSCGCCQPLSTYKKQLSLPCPDpdapGQQLTLTLQVFSSCVCS 4311
Cdd:smart00041 5 RQTITYNGCTSVtVKNAFCEGKCGSASSYS--IQDVQHSCSCCQPHKTKTRQVRLRCPD----GSTVKKTVMHIEECGCE 78
|
.
gi 1907182170 4312 P 4312
Cdd:smart00041 79 P 79
|
|
| C8 |
smart00832 |
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have ... |
246-299 |
5.07e-09 |
|
This domain contains 8 conserved cysteine residues; Not all of the conserved cysteines have been included in the alignment model. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
Pssm-ID: 214843 Cd Length: 76 Bit Score: 55.42 E-value: 5.07e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 246 VPKETLMLSCQADMaaCARPGQPNCSCATLSEYSRRCSMTGQPVRNWRTPALCP 299
Cdd:smart00832 25 VDPEPFFENCVYDT--CACGGDCECLCDALAAYAAACAEAGVCISPWRTPTFCP 76
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3706-4128 |
1.89e-08 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 1.89e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3706 TVTPTQPTP---------------IPATTNSPMTTVGLTGTPvvHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFR 3770
Cdd:PHA03247 2567 SVPPPRPAPrpsepavtsrarrpdAPPQSARPRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPP 2644
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3771 TSEQSTTTFPTPSAPQTSLV--TSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPvstilqttievTTPPNTSTPVTH 3848
Cdd:PHA03247 2645 TVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADP-----------PPPPPTPEPAPH 2713
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3849 S-TSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAIS 3927
Cdd:PHA03247 2714 AlVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSE 2793
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3928 RTTGISGTPFRTPMKTTITFPTPSSLQTSM-ATLFPPfstsvmssteifntPTNPHSVSSASTSRPLSTSLPT------- 3999
Cdd:PHA03247 2794 SRESLPSPWDPADPPAAVLAPAAALPPAASpAGPLPP--------------PTSAQPTAPPPPPGPPPPSLPLggsvapg 2859
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4000 -TIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTAS 4078
Cdd:PHA03247 2860 gDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 4079 PPSSAPTfvSPTAASTVISSALPTI---HMTP---------TPSSRPTSSTGLLSTSKTTSH 4128
Cdd:PHA03247 2940 QPPLAPT--TDPAGAGEPSGAVPQPwlgALVPgrvavprfrVPQPAPSREAPASSTPPLTGH 2999
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
3625-4084 |
2.75e-08 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 60.57 E-value: 2.75e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3625 PLFSTLSVTPTTEGLNTPT-SPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPvtytttaatqtkssfsTDRTSTPHLSQ 3703
Cdd:PHA03307 54 TVVAGAAACDRFEPPTGPPpGPGTEAPANESRSTPTWSLSTLAPASPAREGSP----------------TPPGPSSPDPP 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3704 SSTVTPTQPTPIPATTNSPMTTvglTGTPVVHTPSGTSSIAHTPHTthslPTAASSSTTLSTAPQFRTSEQSTttfPTPS 3783
Cdd:PHA03307 118 PPTPPPASPPPSPAPDLSEMLR---PVGSPGPPPAASPPAAGASPA----AVASDAASSRQAALPLSSPEETA---RAPS 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3784 APQTSLVTSLPPFSTSSVSPTdeihitstnPHTVSSVSMSRPVSTILQTtievttpPNTSTPVTHSTSATTEAQGSFSTE 3863
Cdd:PHA03307 188 SPPAEPPPSTPPAAASPRPPR---------RSSPISASASSPAPAPGRS-------AADDAGASSSDSSSSESSGCGWGP 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3864 RTSTSyLSHPssttvhqstaGPVITSIKSTMGVTGTPPvhttsGTTSSPQTPHSTHPISTAAISRttGISGTPFRTPMKT 3943
Cdd:PHA03307 252 ENECP-LPRP----------APITLPTRIWEASGWNGP-----SSRPGPASSSSSPRERSPSPSP--SSPGSGPAPSSPR 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3944 TITFPTPSSLQTSMATLfppfSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdintTSATTQA 4023
Cdd:PHA03307 314 ASSSSSSSRESSSSSTS----SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPS------SPAASAG 383
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 4024 HSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASP-PSSAP 4084
Cdd:PHA03307 384 RPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPSGEPwPGSPP 445
|
|
| TIL |
cd19941 |
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich ... |
665-722 |
1.74e-07 |
|
trypsin inhibitor-like cysteine rich domain; TIL (trypsin inhibitor-like) cysteine rich domains are found in smapins (small serine proteinase inhibitor), or Ascaris trypsin inhibitor (ATI)-like proteins, whose members include anticoagulant proteins, elastase inhibitors, trypsin inhibitors, thrombin inhibitors, and chymotrypsin inhibitors. The TIL domain is also found in some large modular glycoproteins, including the von Willebrand factor (VWF), mucin-6, mucin-19, and SCO-spondin, among others. The TIL domain is characterized by the presence of five disulfide bonds (two of which are located on either side of the reactive site) in a single small protein domain of 61-62 residues. The cysteine residues that form the disulfide bonds are linked in the pattern: cysteines 1-7, 2-6, 3-5, 4-10 and 8-9. TILs can occur as a single domain or in multiple tandem arrangements. The disulfide bonds account for the unusual resistance to proteolysis and heat denaturation of these proteins. Smapins possess an unusual fold and, with the exception of the reactive site, shows no similarity to other serine protease inhibitors. The serine protease inhibitors comprise a large family of molecules involved in inflammatory responses, blood clotting, and complement activation.
Pssm-ID: 410995 Cd Length: 55 Bit Score: 50.39 E-value: 1.74e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 665 CTGNRTFSYDSQACDRTCLSLsDRETEChvSPVPVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:cd19941 1 CPPNEVYSECGSACPPTCANP-NAPPPC--TKQCVEGCFCPEGYVRNSGGKCVPPSQC 55
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
3803-4119 |
9.18e-07 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 55.39 E-value: 9.18e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3803 PTDEIHITSTNPHTVSS---VSMSRPVSTIL--QTTIEVT-----TPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSH 3872
Cdd:TIGR00927 68 SNDEMMMVSSDPPKSSSemeGEMLAPQATVGrdEATPSIAmentpSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPAT 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3873 PSSTTVH-QSTAG-PVITSIKSTM--GVTGTPPVHTTSGT---TSSP----------------QTPHSTHPISTAAISRT 3929
Cdd:TIGR00927 148 PSRALNHyISTSGrQRVKSYTPKPrgEVKSSSPTQTREKVrkyTPSPlgrmvnsyapstfmtmPRSHGITPRTTVKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3930 TGISGTPFRTPMKTTITFPTPSSLQtSMATLFPPFSTSVMSsTEIFNTPTN---------PHSVSSASTSRP---LSTSL 3997
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLK-GMTDNTPTFLTREVE-TDLLTSPRSvvekntlttPRRVESNSSTNHwglVGKNN 305
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3998 PTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL--QYTPTPSSVSHSPLLTTP 4075
Cdd:TIGR00927 306 LTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASATfrGLEKNPSTAPSTPATPRV 385
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|.
gi 1907182170 4076 TASPPSSA-------PTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGL 4119
Cdd:TIGR00927 386 RAVLTTQVhhcvvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQPDL 436
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2341-2562 |
1.03e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 54.76 E-value: 1.03e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2341 VPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTT 2420
Cdd:COG3469 7 AASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2421 HPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTG 2500
Cdd:COG3469 87 AAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGG 162
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 2501 TPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 2562
Cdd:COG3469 163 TTTTSTTTTTTSASTTPSATTTATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
360-404 |
2.64e-06 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 47.56 E-value: 2.64e-06
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 1907182170 360 CMLNGMVYGPGEITKTACQTCQCTMGRWTCTKQPC-PGHCSLEGGS 404
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCTKVWCgPKPCLLHNLS 46
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3250-3450 |
3.91e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.83 E-value: 3.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3250 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3329
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3330 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3409
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATT 183
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1907182170 3410 PFSTVAVSNTKHTTgvsletsvqTTIASPTPSAPQTSLATH 3450
Cdd:COG3469 184 TATATTASGATTPS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2328-2521 |
5.40e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.45 E-value: 5.40e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2328 PTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTT 2407
Cdd:COG3469 26 AATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGA 105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2408 SGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPA 2487
Cdd:COG3469 106 NTGTSTVTTT----STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|....
gi 1907182170 2488 TTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2521
Cdd:COG3469 182 TTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2362-2585 |
8.79e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 51.68 E-value: 8.79e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2362 TSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 2441
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2442 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2521
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSG 155
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 2522 PFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfSSTSAVTPTSEVIITPTPQH 2585
Cdd:COG3469 156 TETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATT----PSATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2177-2573 |
8.85e-06 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 52.23 E-value: 8.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2177 TTTAATQTKSSFSTDRTSTPhlSQSSTVTPTQSTPiPATTNSLMTTGGLTGTPPVHTTSGttSSPQTPRTTHPFSTVAVS 2256
Cdd:pfam05109 428 TTTSPTLNTTGFAAPNTTTG--LPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGTTSG--ASPVTPSPSPRDNGTESK 502
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2257 NTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPfssTSSVTPTSEVIITPTPQHTLssaststtmgnilPTTIGQTGS 2336
Cdd:pfam05109 503 APDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSP---TLGKTSPTSAVTTPTPNATS-------------PTPAVTTPT 566
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2337 PHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 2416
Cdd:pfam05109 567 PNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSL 646
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2417 PRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHlpLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTG 2496
Cdd:pfam05109 647 RPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPASTSTH--HVSTSSPAPRPGTTSQASGPGNSSTSTKPGEV 724
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2497 GLT-GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT----KHTTGVSLETSVQTTI---ASPTPSAPQTSLATHLPFSST 2568
Cdd:pfam05109 725 NVTkGTPPKNATSPQAPSGQKTAVPTVTSTGGKANSttggKHTTGHGARTSTEPTTdygGDSTTPRTRYNATTYLPPSTS 804
|
....*
gi 1907182170 2569 SAVTP 2573
Cdd:pfam05109 805 SKLRP 809
|
|
| TIL |
pfam01826 |
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well ... |
665-722 |
1.10e-05 |
|
Trypsin Inhibitor like cysteine rich domain; This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
Pssm-ID: 460351 Cd Length: 55 Bit Score: 45.46 E-value: 1.10e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182170 665 CTGNRTFSYDSQACDRTCLSLSDR---ETEChvspvpVDGCNCPEGTYLNHKAECVHKAQC 722
Cdd:pfam01826 1 CPANEVYSECGSACPPTCANLSPPdvcPEPC------VEGCVCPPGFVRNSGGKCVPPSDC 55
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2647-2835 |
1.60e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.91 E-value: 1.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2647 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2726
Cdd:COG3469 40 TTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGA 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2727 GPQSSLSTHLPLFSTLSVTPTTEGLNTQSTpipaTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2806
Cdd:COG3469 120 GSVTSTTSSTAGSTTTSGASATSSAGSTTT----TTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATT 195
|
170 180
....*....|....*....|....*....
gi 1907182170 2807 TTgvsletsvqTTIASPTPSAPQTSLATH 2835
Cdd:COG3469 196 PS---------ATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2372-2583 |
3.27e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.14 E-value: 3.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2372 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrtthPSTTVAVSGTVHTTGLPSGTSVQTTTNFPT 2451
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTT----GSVVVAASGSAGSGTGTTAASSTAATSSTT 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2452 HSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2531
Cdd:COG3469 77 STTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 2532 KHTTGVsleTSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTP 2583
Cdd:COG3469 157 ETATGG---TTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTT 205
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2715-3170 |
5.40e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 49.94 E-value: 5.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2715 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTS--------GTTSS 2786
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPaanepdphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2787 PQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSA------PQTSLATHLPFSSTSSVTPTSEVIITPTPqhtl 2860
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAarptvgSLTSLADPPPPPPTPEPAPHALVSATPLP---- 2722
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2861 ssASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAhqstPTAVSANSIKP 2940
Cdd:PHA03247 2723 --PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR----PAVASLSESRE 2796
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2941 TMSSTGTPVVHTTSGTTSSPQTPRTTHPSttvavsgtvhtTGLPSGTSVHTTTNFPTHSGPQSSLSTH---LPLFSTLSV 3017
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPAAALPPAASPA-----------GPLPPPTSAQPTAPPPPPGPPPPSLPLGgsvAPGGDVRRR 2865
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3018 TPTTEGLNTPTSPHSLSVASTSMPlmTVLPTTLEGTRPPHTSVPVTYTTTAATQTkssfSTDRTSAPHLSQPSTVTPTQS 3097
Cdd:PHA03247 2866 PPSRSPAAKPAAPARPPVRRLARP--AVSRSTESFALPPDQPERPPQPQAPPPPQ----PQPQPPPPPQPQPPPPPPPRP 2939
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3098 TPIPATTNSLMTTGGLTGTPP-----------VHTTSGTTSSPQTPRTTHPFSTvavsNTKHTTGVSLETSVQTTIASPT 3166
Cdd:PHA03247 2940 QPPLAPTTDPAGAGEPSGAVPqpwlgalvpgrVAVPRFRVPQPAPSREAPASST----PPLTGHSLSRVSSWASSLALHE 3015
|
....
gi 1907182170 3167 PSAP 3170
Cdd:PHA03247 3016 ETDP 3019
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3222-3409 |
5.80e-05 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 49.37 E-value: 5.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3222 TGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSS 3301
Cdd:COG3469 28 TAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANT 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3302 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLM 3381
Cdd:COG3469 108 GTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATA 187
|
170 180
....*....|....*....|....*...
gi 1907182170 3382 TTVGLTGTPPVHTTSGTTSSPQTPRTTH 3409
Cdd:COG3469 188 TTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
3871-4105 |
9.62e-05 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 48.73 E-value: 9.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3871 SHPSS---TTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGT--TSSPQTPHSTHpISTAAISrttgISGTPFRTPMKTTI 3945
Cdd:COG5422 59 SKESFgkyALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATssTSSLNSNDGDQ-FSPASDS----LSFNPSSTQSRKDS 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3946 TFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLStslPTTIKGTGTPQTPVSDINTTSATTQAHS 4025
Cdd:COG5422 134 GPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEI---PSLGSQSMQLPSPHFRQKFSSSDTSNGF 210
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4026 SFPTTRTSTSHlslpSSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPT-----ASPPSSAPTFVSPTAASTVISSAL 4100
Cdd:COG5422 211 SYPSIRKNSRH----SSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSssnseAMSTSSKRPYIYPALLSRVAVEFK 286
|
....*
gi 1907182170 4101 PTIHM 4105
Cdd:COG5422 287 MRLQL 291
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
1987-3607 |
1.64e-04 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 48.22 E-value: 1.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1987 TTIGKTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTS 2066
Cdd:COG3210 80 GIGAAAANTAGTLETGLTSNIGGGSVNGSNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAG 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2067 GTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLS 2146
Cdd:COG3210 160 NNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAG 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2147 VASTSMplmTVLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLT 2226
Cdd:COG3210 240 VISTGG---TDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGT 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2227 GTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVII 2306
Cdd:COG3210 317 AAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASS 396
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2307 TPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 2386
Cdd:COG3210 397 TTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSA 476
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2387 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLF 2466
Cdd:COG3210 477 GNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTT 556
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2467 STLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTT 2546
Cdd:COG3210 557 AASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTG 636
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2547 IASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQT 2626
Cdd:COG3210 637 SAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVT 716
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2627 KTSFSTDRTS-------TSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRT--THPST 2697
Cdd:COG3210 717 GQIGALANANgdtvtfgNLGTGATLTLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATLDNagAEISI 796
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2698 TVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPV 2777
Cdd:COG3210 797 DITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAAS 876
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2778 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQ 2857
Cdd:COG3210 877 ITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSA 956
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2858 HTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANS 2937
Cdd:COG3210 957 ASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGT 1036
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2938 IKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSV 3017
Cdd:COG3210 1037 AATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGV 1116
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3018 TPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSAPHLSQPSTVTPTQS 3097
Cdd:COG3210 1117 TASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGT 1196
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3098 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATH 3177
Cdd:COG3210 1197 DLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGN 1276
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3178 LPFSSTSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTGSPHTSVPVIYTTSTITQTKTSFFTDRTSTSTSAP 3257
Cdd:COG3210 1277 AGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNGGN 1356
|
1290 1300 1310 1320 1330 1340 1350 1360
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3258 HLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF 3337
Cdd:COG3210 1357 GATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGTGG 1436
|
1370 1380 1390 1400 1410 1420 1430 1440
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3338 PTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVS 3417
Cdd:COG3210 1437 TGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTAGGTT 1516
|
1450 1460 1470 1480 1490 1500 1510 1520
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3418 NTKHTTGVSLETSVQTTIASPTPSAPQTSLAThlpfsstsSVTPTSEVIITPTPQHTLSSASTSTTTGNILPTTIGQTGS 3497
Cdd:COG3210 1517 AEVAKASLEGGEGTYGGSSVAEAGTGGGILGA--------VSGAGSEGGAAGGVTGSVGVGGTDGAGGDTGGADDTGAQA 1588
|
1530 1540 1550 1560 1570 1580 1590 1600
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3498 PHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQT 3577
Cdd:COG3210 1589 PTAGNTATLTLSLAEGTNAEYGGTTNVTSGTAGNAGATGANSNTVVTTNGGEGVLALVAGGNTTNGTTLSGAVNGAGNGW 1668
|
1610 1620 1630
....*....|....*....|....*....|
gi 1907182170 3578 PRTTHPSTTVAVSGTVHTTGLPSGTSVHTT 3607
Cdd:COG3210 1669 AVDLTDATLAGLGGATTAAAGNVATGDTAP 1698
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2068-2422 |
1.73e-04 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 48.07 E-value: 1.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2068 TTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSgpqsslSTHLPLFSTLSVTPTTEGL-----NTPTSP 2142
Cdd:TIGR00927 75 VSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENTPSPPRRT------AKITPTTPKNNYSPTAAGTervkeDTPATP 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2143 -----HSLSVASTSMpLMTVLPTT---LEGTRPPHTSVPV-MYTTTAATQTKSSFSTDRTSTphLSQSSTVTPTQSTpip 2213
Cdd:TIGR00927 149 sralnHYISTSGRQR-VKSYTPKPrgeVKSSSPTQTREKVrKYTPSPLGRMVNSYAPSTFMT--MPRSHGITPRTTV--- 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2214 aTTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFS 2293
Cdd:TIGR00927 223 -KDSEITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLV 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2294 STSSVTPTSEVIITPTPQHTLSSASTSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPT 2366
Cdd:TIGR00927 302 GKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA 381
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182170 2367 sAPHLSETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2422
Cdd:TIGR00927 382 -TPRVRAVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2030-2248 |
1.91e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 47.44 E-value: 1.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2030 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2109
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2110 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVMYTTTAATQTKSSF 2188
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2189 STDRTSTPHlsqSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2248
Cdd:COG3469 159 ATGGTTTTS---TTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1896-2311 |
1.96e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.60 E-value: 1.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1896 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSKVIITPTPQHTLSSA 1975
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1976 STSTTTGnilPTTIGKTGSPHTSVPviyTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 2055
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2056 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 2134
Cdd:pfam05109 576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2135 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 2209
Cdd:pfam05109 650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2210 TPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTT 2288
Cdd:pfam05109 730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
|
410 420
....*....|....*....|...
gi 1907182170 2289 HLPFSSTSSVTPTSEVIITPTPQ 2311
Cdd:pfam05109 810 RWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2600-2794 |
3.11e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.67 E-value: 3.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2600 LPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSiKPTMSSTGTPVVHT 2679
Cdd:COG3469 22 LLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA-AAATSTSATLVATS 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2680 TSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTQSTPIP 2759
Cdd:COG3469 101 TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPS 180
|
170 180 190
....*....|....*....|....*....|....*
gi 1907182170 2760 ATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTH 2794
Cdd:COG3469 181 ATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3892-4129 |
3.48e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.67 E-value: 3.48e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3892 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSS 3971
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3972 TEIFNTPtnphsvsSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPAS 4051
Cdd:COG3469 82 ATAAAAA-------ATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVS 154
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 4052 RSASTLQYTPTPSSVShspllTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV 4129
Cdd:COG3469 155 GTETATGGTTTTSTTT-----TTTSASTTPSATTTATATTASG-----------ATTPSATTTATTTGPPTPGLPKHV 216
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
3769-4047 |
3.58e-04 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 46.81 E-value: 3.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3769 FRTSEQSTTTFPTPSAPQTSLVTSLPPfsTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTH 3848
Cdd:COG5422 17 FGAPRKSDAFVSKQLLPPRRLQRKLNP--ISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITH 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3849 STSAT--TEAQGSFS---TERTSTSYLSHPSSTTVHQSTAGPvitsikstmgvTGTPpvhttSGTTSSPQTPHSTHPIST 3923
Cdd:COG5422 95 SPSATssTSSLNSNDgdqFSPASDSLSFNPSSTQSRKDSGPG-----------DGSP-----VQKRKNPLLPSSSTHGTH 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3924 AAISrTTGISGTPFRTPMKTTiTFPTPSSLQTSMATLFPPF--STSVMSSTEIFNTP---TNPHSVSSASTSRPLSTSLP 3998
Cdd:COG5422 159 PPIV-FTDNNGSHAGAPNARS-RKEIPSLGSQSMQLPSPHFrqKFSSSDTSNGFSYPsirKNSRHSSNSMPSFPHSSTAV 236
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1907182170 3999 TTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTL 4047
Cdd:COG5422 237 LLKRHSGSSGASLISSNITPSSSNSEAMSTSSKRPYIYPALLSRVAVEF 285
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3545-4118 |
3.68e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 47.24 E-value: 3.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3545 PTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqSSLSTHL 3624
Cdd:PHA03247 2560 PPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP-SPAANEP 2638
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3625 PLFSTLSVTPTTEGLNTPTSPH-SLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTT------------AATQTKSSF 3691
Cdd:PHA03247 2639 DPHPPPTVPPPERPRDDPAPGRvSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpppppptpePAPHALVSA 2718
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3692 STDRTSTPHLSQSSTVTPTQPTPiPATTNSPMTTVGLTGTPVVHTPSGTSSI------AHTPHTTHSLPTAASSSTTLST 3765
Cdd:PHA03247 2719 TPLPPGPAAARQASPALPAAPAP-PAVPAGPATPGGPARPARPPTTAGPPAPappaapAAGPPRRLTRPAVASLSESRES 2797
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3766 APQFRTSEQSTT--TFPTPSAPQTSLVTSL--PPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVStilqttievTTPPN 3841
Cdd:PHA03247 2798 LPSPWDPADPPAavLAPAAALPPAASPAGPlpPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVR---------RRPPS 2868
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3842 TSTPVTHSTSAtteaqgsfsteRTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHT-TSGTTSSPQTPHSTHP 3920
Cdd:PHA03247 2869 RSPAAKPAAPA-----------RPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQpQPPPPPQPQPPPPPPP 2937
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3921 ISTAAISRTTGISGTPfrtpmkttitFPTPSSLQTSMATLFPPfstsvmssteifNTPTNPHSVSSASTSRPLSTSLPTT 4000
Cdd:PHA03247 2938 RPQPPLAPTTDPAGAG----------EPSGAVPQPWLGALVPG------------RVAVPRFRVPQPAPSREAPASSTPP 2995
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4001 IKGTGTPQTpvsdinTTSATTQAHSSFPTTRtstshlslPSSMTSTLTPAS----RSASTLQYTPTPSSVSHSPLLTTPT 4076
Cdd:PHA03247 2996 LTGHSLSRV------SSWASSLALHEETDPP--------PVSLKQTLWPPDdtedSDADSLFDSDSERSDLEALDPLPPE 3061
|
570 580 590 600
....*....|....*....|....*....|....*....|....*....
gi 1907182170 4077 ASPPSSAPTFVSPTAAStviSSALPTIHMTPTP-------SSRPTSSTG 4118
Cdd:PHA03247 3062 PHDPFAHEPDPATPEAG---ARESPSSQFGPPPlsanaalSRRYVRSTG 3107
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
2358-2837 |
3.96e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.86 E-value: 3.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2358 STDRT-STPTSAPHLSEtSAVTAHQSTPTAvsansiKPTMSSTGTPVVHTTSGTTSSPQT--PRTTHPSTTVAVSGTVHT 2434
Cdd:PHA03247 2563 APDRSvPPPRPAPRPSE-PAVTSRARRPDA------PPQSARPRAPVDDRGDPRGPAPPSplPPDTHAPDPPPPSPSPAA 2635
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2435 TGLPSGTSVQTTTNFPTHSGPQSSlSTHLPLFSTLSVTPTTEGLNTQSTPIPATTnslmttggltgtPPVHTTSGTTSSP 2514
Cdd:PHA03247 2636 NEPDPHPPPTVPPPERPRDDPAPG-RVSRPRRARRLGRAAQASSPPQRPRRRAAR------------PTVGSLTSLADPP 2702
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2515 QTPRTTHPFSTVAVSntkhttgvsletsvqttiASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSAstst 2594
Cdd:PHA03247 2703 PPPPTPEPAPHALVS------------------ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP---- 2760
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2595 ttgnilPTTigqTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGT 2674
Cdd:PHA03247 2761 ------PTT---AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPP 2831
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2675 PVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTnfPTHSgPQSSLSTHLPLFSTLSVTPTTEGLNTQ 2754
Cdd:PHA03247 2832 TSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAA--PARP-PVRRLARPAVSRSTESFALPPDQPERP 2908
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2755 STPI----PATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSV-QTTIASPTPSAPQ 2829
Cdd:PHA03247 2909 PQPQapppPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVpRFRVPQPAPSREA 2988
|
....*...
gi 1907182170 2830 TSLATHLP 2837
Cdd:PHA03247 2989 PASSTPPL 2996
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
1554-1969 |
4.38e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.45 E-value: 4.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1554 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 1633
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1634 STSTTTGnilPTTIGQTGSPHTSVPviyTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMS 1713
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSP---TPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLG 575
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1714 STGTpvvhTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTnfPTHSGPQSSLSTH-LPLFSTLSVTPTTE 1792
Cdd:pfam05109 576 KTSP----TSAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSP--PKNATSAVTTGQHnITSSSTSSMSLRPS 649
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1793 GLNTPTSPHSLSVASTSMPLMT-VLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSST----VTPTQS 1867
Cdd:pfam05109 650 SISETLSPSTSDNSTSHMPLLTsAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTkpgeVNVTKG 729
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1868 TPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPR-TTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLAT 1946
Cdd:pfam05109 730 TPPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKhTTGHGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809
|
410 420
....*....|....*....|...
gi 1907182170 1947 HLPFSSTSSVTPTSKVIITPTPQ 1969
Cdd:pfam05109 810 RWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2228-2559 |
4.55e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 46.68 E-value: 4.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2228 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLTTHLPFSSTSSVTPTSEVIIT 2307
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2308 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 2387
Cdd:pfam03154 265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2388 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPqsslSTH- 2462
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHp 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2463 -----LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2537
Cdd:pfam03154 413 pplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGI 492
|
330 340
....*....|....*....|..
gi 1907182170 2538 SLETSVQTTIASPTPSAPQTSL 2559
Cdd:pfam03154 493 QPPSSASVSSSGPVPAAVSCPL 514
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3892-4190 |
5.45e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 46.47 E-value: 5.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3892 STMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPfRTPMKTTITfPTPSSLqTSMATLFPPFST---SV 3968
Cdd:PHA03247 2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPP-QRPRRRAAR-PTVGSL-TSLADPPPPPPTpepAP 2712
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3969 MSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLT 4048
Cdd:PHA03247 2713 HALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4049 PASRSASTLQYTPTPSSVSHSPLLTTPTASPPssAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSH 4128
Cdd:PHA03247 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASP--AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 4129 VPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLPTSA 4190
Cdd:PHA03247 2871 PAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3250-3473 |
5.56e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.90 E-value: 5.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3250 TSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGT 3329
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3330 SVQTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTTH 3409
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSG 155
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 3410 PFSTVAVSNTKHTTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 3473
Cdd:COG3469 156 TETATGGTTTTSTTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3542-3915 |
6.74e-04 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 45.72 E-value: 6.74e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3542 QSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLS 3621
Cdd:pfam17823 45 DAVPRADNKSSEQ*NFCAATAAPAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPS 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3622 THLPLFSTLSVTPTTEGLNTPTSphslSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHL 3701
Cdd:pfam17823 125 SAAQSLPAAIAALPSEAFSAPRA----AACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAAS 200
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3702 SQSSTVTPTQPTPIPAT-TNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFP 3780
Cdd:pfam17823 201 SAPATLTPARGISTAATaTGHPAAGTALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRL 280
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3781 TPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVS--MSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQG 3858
Cdd:pfam17823 281 SPAKHMPSDTMARNPAAPMGAQAQGPIIQVSTDQPVHNTAGepTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSA 360
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907182170 3859 --------SFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTP 3915
Cdd:pfam17823 361 spvpvlhtSMIPEVEATSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDP 425
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2125-2312 |
6.82e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.51 E-value: 6.82e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2125 STLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPvmYTTTAATQTKSSFSTDRTSTPHLSQSSTV 2204
Cdd:COG3469 33 TLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT--ATAAAAAATSTSATLVATSTASGANTGTS 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2205 TPTQ-STPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQ 2283
Cdd:COG3469 111 TVTTtSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTA 190
|
170 180
....*....|....*....|....*....
gi 1907182170 2284 TSLTThlpfSSTSSVTPTSEVIITPTPQH 2312
Cdd:COG3469 191 SGATT----PSATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3908-4187 |
7.20e-04 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 46.06 E-value: 7.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3908 TTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMatlfPPFSTSVMSSTEIFNTPTNPHSVSSA 3987
Cdd:pfam05109 392 TVSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTTGFAA----PNTTTGLPSSTHVPTNLTAPASTGPT 467
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3988 STSRPLSTSLPTTIKGTGTPQTPVSDI----------NTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTL 4057
Cdd:pfam05109 468 VSTADVTSPTPAGTTSGASPVTPSPSPrdngteskapDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSA 547
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4058 QYTPTPSSVSHSPLLTTPTASPPSSAPTFVSPTAASTvissalptihmTPTPSSRPTSSTGLLSTSKTTSHV--PTFSSF 4135
Cdd:pfam05109 548 VTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVT-----------TPTPNATSPTVGETSPQANTTNHTlgGTSSTP 616
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 4136 SSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4187
Cdd:pfam05109 617 VVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMP 668
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2647-2858 |
9.03e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.13 E-value: 9.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2647 TSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2726
Cdd:COG3469 14 GASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTS 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2727 GPQSSLSTHLPLFSTLSVTPTTeglntqSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKH 2806
Cdd:COG3469 94 ATLVATSTASGANTGTSTVTTT------STGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTS 167
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 2807 TTGVSLeTSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEviiTPTPQH 2858
Cdd:COG3469 168 TTTTTT-SASTTPSATTTATATTASGATTPSATTTATTTGPPT---PGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3704-3951 |
1.15e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 45.13 E-value: 1.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3704 SSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTThSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPS 3783
Cdd:COG3469 3 SVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSV-VVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3784 APQTSLVTSLPPFSTSSVSPTDeihitstnphtvssvsmsrPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTE 3863
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTAS-------------------GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATS 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3864 RTSTSYLSHPSSTTVHqstagpvitsikstmGVTGTPPVHTTSGTTSSPQTPhSTHPISTAAISRTTGISGTPFRTPMKT 3943
Cdd:COG3469 143 SAGSTTTTTTVSGTET---------------ATGGTTTTSTTTTTTSASTTP-SATTTATATTASGATTPSATTTATTTG 206
|
....*...
gi 1907182170 3944 TITFPTPS 3951
Cdd:COG3469 207 PPTPGLPK 214
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3695-4093 |
1.25e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.14 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3695 RTSTPHLSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQ 3774
Cdd:pfam03154 168 QTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHP 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3775 STTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPV-THSTSAT 3853
Cdd:pfam03154 248 PLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGqSQQRIHT 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3854 TEAQGSFSTERTSTSYLSHPSSTTVhqstagPVItsikstmgvtgTPPVHTTSGTTSSPQT-PHSTHPISTAAISRTTGI 3932
Cdd:pfam03154 328 PPSQSQLQSQQPPREQPLPPAPLSM------PHI-----------KPPPTTPIPQLPNPQShKHPPHLSGPSPFQMNSNL 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3933 SGTPFRTPMKTTITFPTPSS-------LQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTT--IKG 4003
Cdd:pfam03154 391 PPPPALKPLSSLSTHHPPSAhppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHpfVPG 470
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4004 TGTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSvshspllTTPTASPPSSA 4083
Cdd:pfam03154 471 GPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPES-------PPPPPRSPSPE 543
|
410
....*....|.
gi 1907182170 4084 PTFV-SPTAAS 4093
Cdd:pfam03154 544 PTVVnTPSHAS 554
|
|
| Hamartin |
pfam04388 |
Hamartin protein; This family includes the hamartin protein which is thought to function as a ... |
3836-4119 |
1.26e-03 |
|
Hamartin protein; This family includes the hamartin protein which is thought to function as a tumour suppressor. The hamartin protein interacts with the tuberin protein pfam03542. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterized by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. TSC1 encodes a protein, hamartin, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. The TSC2 gene codes for tuberin pfam03542. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.
Pssm-ID: 461287 [Multi-domain] Cd Length: 730 Bit Score: 45.05 E-value: 1.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3836 VTTPPNTSTpvTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTG--TPPvhTTSGT--TSS 3911
Cdd:pfam04388 276 PTASPYTDQ--QSSYGSSTSTPSSTPRLQLSSSSGTSPPYLSPPSIRLKTDSFPLWSPSSVCGmtTPP--TSPGMvpTTP 351
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3912 PQTPHST-HPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTS 3990
Cdd:pfam04388 352 SELSPSSsHLSSRGSSPPEAAGEATPETTPAKDSPYLKQPPPLSDSHVHRALPASSQPSSPPRKDGRSQSSFPPLSKQAP 431
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3991 RPLSTSLPTTIKGTGTP--QTPVSDINTT----------------------SATTQAHSSFPTTR------TSTSHLSLP 4040
Cdd:pfam04388 432 TNPNSRGLLEPPGDKSSvtLSELPDFIKDlalssedsvegaeeeaaisqelSEITTEKNETDCSRggldmpFSRTMESLA 511
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182170 4041 SSMTSTLTPASRSASTLQYTPTPSSVSHSPLLTTPTASPPSSAPTFvSPTAASTVISSalPTIHMTPTPSSRPTSSTGL 4119
Cdd:pfam04388 512 GSQRSRNRIASYCSSTSQSDSHGPATTPESKPSALAEDGLRRTKSC-SFKQSFTPIEQ--PIESSDDCPTDEQDGENGL 587
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3533-3753 |
1.47e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.74 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3533 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 3612
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3613 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 3692
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182170 3693 TDRTSTPhlSQSSTVTPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSL 3753
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKHVL 217
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2660-2834 |
1.55e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2660 VSANSIKPTMSSTGTPVVHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 2738
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2739 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 2816
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
|
170
....*....|....*...
gi 1907182170 2817 QTTIASPTPSAPQTSLAT 2834
Cdd:COG3469 161 GGTTTTSTTTTTTSASTT 178
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3874-4101 |
1.61e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3874 SSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTpfrtpmkttiTFPTPSSL 3953
Cdd:COG3469 4 VSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTA----------ASSTAATS 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3954 QTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGTGTPQTPVSDINTTSATTQAHSSFPTTRTS 4033
Cdd:COG3469 74 STTSTTATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTV 153
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 4034 TSHLSLPSSMTSTLTPASRSASTLQYTPTPSsvshspllTTPTASPPSSAPTFVSPTAASTVISSALP 4101
Cdd:COG3469 154 SGTETATGGTTTTSTTTTTTSASTTPSATTT--------ATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3399-3857 |
1.62e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 44.91 E-value: 1.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3399 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 3478
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3479 STSTTTGnilPTTIGQTGSPHTSVPVIYTTS----AITQTKTSFSTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIK 3554
Cdd:pfam05109 502 KAPDMTS---PTSAVTTPTPNATSPTPAVTTptpnATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTS 578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3555 PTmSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTP 3634
Cdd:pfam05109 579 PT-SAVTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSP 657
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3635 TTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTP 3714
Cdd:pfam05109 658 STSDNSTSHMPLLTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNATS 737
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3715 --IPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSttlSTAPQFRTSEQSTTTFPTPSAPQTSLVTS 3792
Cdd:pfam05109 738 pqAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGARTSTEPTTDYGGD---STTPRTRYNATTYLPPSTSSKLRPRWTFT 814
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182170 3793 LPPFSTSS----VSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQ 3857
Cdd:pfam05109 815 SPPVTTAQatvpVPPTSQPRFSNLSMLVLQWASLAVLTLLLLLVMADCAFRRNLSTSHTYTTPPYDDAE 883
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2397-2615 |
1.74e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2397 SSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSTHLPlfSTLSVTPTTE 2476
Cdd:COG3469 5 STAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAAT--SSTTSTTATA 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2477 GLNTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQ 2556
Cdd:COG3469 83 TAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGG 162
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182170 2557 TSlathlpfsSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSPHTSVP 2615
Cdd:COG3469 163 TT--------TTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3275-3449 |
1.77e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.77e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3275 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNF-PTHSGPQSSLSTHLPL 3353
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTtAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3354 FSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPVHTTSGTTSSPQTPRTT--HPFSTVAVSNTKHTTGVSLETSV 3431
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTsgASATSSAGSTTTTTTVSGTETAT 160
|
170
....*....|....*...
gi 1907182170 3432 QTTIASPTPSAPQTSLAT 3449
Cdd:COG3469 161 GGTTTTSTTTTTTSASTT 178
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2918-3132 |
1.82e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2918 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 2997
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2998 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPH-TSVPVTYTTTAATQTKSSF 3076
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGAsATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182170 3077 STDRTSAPHLSQPSTVTPTQSTPIPATTnslmTTGGLTGTPPVHTTSGTTSSPQTP 3132
Cdd:COG3469 159 ATGGTTTTSTTTTTTSASTTPSATTTAT----ATTASGATTPSATTTATTTGPPTP 210
|
|
| VWC |
pfam00093 |
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with ... |
360-395 |
1.86e-03 |
|
von Willebrand factor type C domain; The high cutoff was used to prevent overlap with pfam00094.
Pssm-ID: 278520 Cd Length: 57 Bit Score: 39.33 E-value: 1.86e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1907182170 360 CMLNGMVYGPGEITKTA-CQTCQCTMGRWTCTKQPCP 395
Cdd:pfam00093 1 CVQNGVVYENGETWKPDlCTICTCDDGKVLCDKIICP 37
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1688-1906 |
1.91e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 44.36 E-value: 1.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1688 SETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPT 1767
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1768 HSGPQSSLSTHLPLfsTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFS 1847
Cdd:COG3469 81 TATAAAAAATSTSA--TLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTET 158
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1907182170 1848 TDRTSTPhlSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTH 1906
Cdd:COG3469 159 ATGGTTT--TSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2870-3061 |
2.08e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2870 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 2949
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2950 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 3027
Cdd:COG3469 102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|....
gi 1907182170 3028 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 3061
Cdd:COG3469 182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1640-1831 |
2.08e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1640 GNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPhlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPV 1719
Cdd:COG3469 24 GAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST--AATSSTTSTTATATAAAAAATSTSATLVATST 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1720 VHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP--SGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTP 1797
Cdd:COG3469 102 ASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSgaSATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSA 181
|
170 180 190
....*....|....*....|....*....|....
gi 1907182170 1798 TSPHSLSVASTSMPLMTVLPTTleGTRPPHTSVP 1831
Cdd:COG3469 182 TTTATATTASGATTPSATTTAT--TTGPPTPGLP 213
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2055-2264 |
2.08e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2055 SSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSThlplfSTLSVTPTTE 2134
Cdd:COG3469 5 STAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASST-----AATSSTTSTT 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2135 GLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSVPVMYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQSTPIPA 2214
Cdd:COG3469 80 ATATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETA 159
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2215 TTNSlmTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGV 2264
Cdd:COG3469 160 TGGT--TTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGP 207
|
|
| VWC_out |
smart00215 |
von Willebrand factor (vWF) type C domain; |
830-887 |
2.15e-03 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214565 Cd Length: 67 Bit Score: 39.47 E-value: 2.15e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 830 CDYAGVSYPGGFELHTDCKTCTCSQGRWTCQlSTQCPSTCVLYGEGHIITFDGQRFVF 887
Cdd:smart00215 1 CWNNGSYYPPGAKWDDDCNRCTCLNGRVSCT-KVWCGPKPCLLHNLSGECPLGQGCVP 57
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2644-2968 |
2.40e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 44.22 E-value: 2.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2644 LSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTSSSPQTPRT------THPSTTVAVSGTVHTTGLPSGTSVQ 2717
Cdd:TIGR00927 91 LAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSRALNHYISTSGRQRVKSYT 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2718 TTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSLMTTGGLTGTPPVHTTSGT 2783
Cdd:TIGR00927 169 PKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEITATYKMLETNPSKRTAGK 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2784 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQHTLSSA 2863
Cdd:TIGR00927 246 TTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQV 325
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2864 STSTTMGNILPTTIGQTG-------SPHTSVPVIYTTSAITQTKTSFSTDRTSTPTsAPHLSETSAVTAHQST---PTAV 2933
Cdd:TIGR00927 326 TISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASATFRGLEKNPSTAPSTPA-TPRVRAVLTTQVHHCVvvkPAPA 404
|
330 340 350
....*....|....*....|....*....|....*
gi 1907182170 2934 SANSIKPTMSSTGTPVVHTTSGTTSSPQTPrTTHP 2968
Cdd:TIGR00927 405 VPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
| VWC |
smart00214 |
von Willebrand factor (vWF) type C domain; |
360-395 |
2.42e-03 |
|
von Willebrand factor (vWF) type C domain;
Pssm-ID: 214564 Cd Length: 59 Bit Score: 39.04 E-value: 2.42e-03
10 20 30
....*....|....*....|....*....|....*...
gi 1907182170 360 CMLNGMVYGPGEITKT-ACQTCQCTMGRW-TCTKQPCP 395
Cdd:smart00214 1 CVHNGRVYNDGETWKPdPCQICTCLDGTTvLCDPVECP 38
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3628-3795 |
2.49e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.97 E-value: 2.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3628 STLSVTPTTEGLNTPTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTV 3707
Cdd:COG3469 33 TLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTGTSTV 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3708 TPTQPTPIPATTNSPMTTVGLTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQF---RTSEQSTTTFPTPSA 3784
Cdd:COG3469 113 TTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSastTPSATTTATATTASG 192
|
170
....*....|.
gi 1907182170 3785 PQTSLVTSLPP 3795
Cdd:COG3469 193 ATTPSATTTAT 203
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
2774-3137 |
2.67e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.99 E-value: 2.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2774 TPPVHTTSGTTSSPqTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIIT 2853
Cdd:pfam03154 186 PPPPGTTQAATAGP-TPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQ 264
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2854 PTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAV 2933
Cdd:pfam03154 265 PLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQ 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2934 SANSIKPtMSSTGTPVVHTT-SGTTSSPQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHL 3009
Cdd:pfam03154 338 QPPREQP-LPPAPLSMPHIKpPPTTPIPQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHP 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3010 PlfsTLSVTPTTEGLNTP-------TSPHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrT 3081
Cdd:pfam03154 413 P---PLQLMPQSQQLPPPpaqppvlTQSQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--S 487
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1907182170 3082 SAPHLSQPSTVTPTQSTPIPATTNSLMttggltgtPPVHTTSGTTSSPQTPRTTHP 3137
Cdd:pfam03154 488 AMPGIQPPSSASVSSSGPVPAAVSCPL--------PPVQIKEEALDEAEEPESPPP 535
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3087-3447 |
2.81e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 43.99 E-value: 2.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3087 SQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSgtTSSPQTPRTTHPFSTVAVSNTKHTTgvsletsvqttIASPT 3166
Cdd:pfam03154 169 TQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPS--VPPQGSPATSQPPNQTQSTAAPHTL-----------IQQTP 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3167 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtLSSASTSTTMGNILpttigQTGSPHTSVPViyttstitqTKTSFF 3246
Cdd:pfam03154 236 TLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ--PSLHGQMPPMPHSL-----QTGPSHMQHPV---------PPQPFP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3247 TDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLP 3326
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3327 SGTSVQTTTNFPTHSG--PQSSLSTH------------LPLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLTGTPPV 3392
Cdd:pfam03154 380 GPSPFQMNSNLPPPPAlkPLSSLSTHhppsahppplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQ 459
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1907182170 3393 HTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSL 3447
Cdd:pfam03154 460 SPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2511-2827 |
2.83e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 43.75 E-value: 2.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2511 TSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLA---THLPFSSTSAVTPT------------- 2574
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAdvtSPTPAGTTSGASPVtpspsprdngtes 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2575 --------SEVIITPTPQHTFSSASTSTTTGNILPTTIGQTGSphTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHLSE 2646
Cdd:pfam05109 502 kapdmtspTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSP--TSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2647 TSAVTAHQSTPTAVSANSIKPTMSSTGtpvvHTTSGTSSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHS 2726
Cdd:pfam05109 580 TSAVTTPTPNATSPTVGETSPQANTTN----HTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETL 655
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2727 GPQSS--LSTHLPLFStlSVTPTTEGLNTQSTPIPATTNSLMTTgglTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNT 2804
Cdd:pfam05109 656 SPSTSdnSTSHMPLLT--SAHPTGGENITQVTPASTSTHHVSTS---SPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGT 730
|
330 340
....*....|....*....|...
gi 1907182170 2805 KHTTGVSLETSVQTTIASPTPSA 2827
Cdd:pfam05109 731 PPKNATSPQAPSGQKTAVPTVTS 753
|
|
| COG5099 |
COG5099 |
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal ... |
3690-4069 |
3.36e-03 |
|
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis];
Pssm-ID: 227430 [Multi-domain] Cd Length: 777 Bit Score: 43.58 E-value: 3.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3690 SFSTDRTSTPHLSQSSTVTpTQPTPIPATTNSPMTTVGlTGTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTapqf 3769
Cdd:COG5099 38 STPNSFSPIPSKASSSATF-TLNLPINNSVNHKITSSS-SSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPST---- 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3770 rtseqSTTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHI-TSTNPHTVSSVSMSRPVSTILQttiEVTTPPNTSTPVTH 3848
Cdd:COG5099 112 -----SSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSnKLPLPNPNHSNSATTNQSGSSF---INTPASSSSQPLTN 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3849 STSATTEAQGSFSTERTSTSYLSHPSSTTVhqSTAGPVITSIkstmGVTGTPPVHTTSGTTSSPQTPHsTHPISTAAISR 3928
Cdd:COG5099 184 LVVSSIKRFPYLTSLSPFFNYLIDPSSDSA--TASADTSPSF----NPPPNLSPNNLFSTSDLSPLPD-TQSVENNIILN 256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3929 TTGISGTPFRTPMKTTI--TFPTPSSLQTSMATLFPP-FSTSVMSSTEIFNT----PTNPHSVSSASTSRPLSTSLPTTI 4001
Cdd:COG5099 257 SSSSINELTSIYGSVPSirNLRGLNSALVSFLNVSSSsLAFSALNGKEVSPTgspsTRSFARVLPKSSPNNLLTEILTTG 336
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 4002 KGTGTPQTPVSDINTTSATTQAHSsfpttrTSTSHLSLPSSMTSTLTPASRSASTLQYTPTPSSVSHS 4069
Cdd:COG5099 337 VNPPQSLPSLLNPVFLSTSTGFSL------TNLSGYLNPNKNLKKNTLSSLSNLGYSSNVPSPSSSES 398
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
3642-4013 |
3.38e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 43.41 E-value: 3.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3642 PTSPHSLSAASTSMPLMTVLPTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTDRTSTPHLSQSSTVTPTQPTPIPATTns 3721
Cdd:pfam17823 66 APAPVTLTKGTSAAHLNSTEVTAEHTPHGTDLSEPATREGAADGAASRALAAAASSSPSSAAQSLPAAIAALPSEAFS-- 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3722 pmttvgltgTPVVHTPSGTSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPSAPQTSLVTSLPPFSTSSV 3801
Cdd:pfam17823 144 ---------APRAAACRANASAAPRAAIAAASAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGIST 214
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3802 SPTDEIHITSTNphTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTEAqGSFSTERTSTSYLSHPSSTTVHQS 3881
Cdd:pfam17823 215 AATATGHPAAGT--ALAAVGNSSPAAGTVTAAVGTVTPAALATLAAAAGTVASAA-GTINMGDPHARRLSPAKHMPSDTM 291
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3882 TAGPVITSIKSTMG----VTGTPPVHTTSG--------TTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPT 3949
Cdd:pfam17823 292 ARNPAAPMGAQAQGpiiqVSTDQPVHNTAGeptpspsnTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMI 371
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 3950 PSSLQTSMATLFPPFSTSVMSSTEifNTPTNPHSVSSASTsrPLSTSLPTTIKGTGTPQTPVSD 4013
Cdd:pfam17823 372 PEVEATSPTTQPSPLLPTQGAAGP--GILLAPEQVATEAT--AGTASAGPTPRSSGDPKTLAMA 431
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
3841-4069 |
3.60e-03 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 43.84 E-value: 3.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3841 NTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHP 3920
Cdd:NF033849 250 STSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSS 329
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3921 ISTAAISRTTGISGT-----PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLST 3995
Cdd:NF033849 330 SYNVSSGTGVSSSHSdgtsqSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGA 409
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1907182170 3996 SLPTTiKGTGTpQTPVSDINTTSATTQAHSSFPTTRTSTSHlSLPSSMTSTLTpASRSASTLQYTPTPSSVSHS 4069
Cdd:NF033849 410 SQGGS-EGWGS-GDSVQSVSQSYGSSSSTGTSSGHSDSSSH-STSSGQADSVS-QGTSWSEGTGTSQGQSVGTS 479
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1623-1800 |
4.01e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 43.20 E-value: 4.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1623 TPTPQHTLSSASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTA 1702
Cdd:COG3469 29 AASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATSTSATLVATSTASGANTG 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1703 VSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLF 1782
Cdd:COG3469 109 TSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATAT 188
|
170
....*....|....*...
gi 1907182170 1783 STLSVTPTTEGLNTPTSP 1800
Cdd:COG3469 189 TASGATTPSATTTATTTG 206
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1515-1949 |
4.35e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.39 E-value: 4.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1515 SQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSntkHTTGVSLETSVQTTIASPT 1594
Cdd:PHA03247 2614 PSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLG---RAAQASSPPQRPRRRAARP 2690
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1595 PSAPQTSLATHLPFSSTSSVTPTSEVIITPTPqhtlssASTSTTTGNILPTTIGQTGSPHTSVPVIYTTSAITQTKTSFS 1674
Cdd:PHA03247 2691 TVGSLTSLADPPPPPPTPEPAPHALVSATPLP------PGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTA 2764
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1675 TDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIkptmsstgtpvvhttsgttSSPQTPrTTHPSTTVAVSGTVHTTGLP 1754
Cdd:PHA03247 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESL-------------------PSPWDP-ADPPAAVLAPAAALPPAASP 2824
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1755 SGTSVHTTTNFPTHSGPQSSlsthlPLFSTLsvtpTTEGLNTPTSPHSLSVASTSMPLMTVLPttlegTRPPHTSVPVTY 1834
Cdd:PHA03247 2825 AGPLPPPTSAQPTAPPPPPG-----PPPPSL----PLGGSVAPGGDVRRRPPSRSPAAKPAAP-----ARPPVRRLARPA 2890
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1835 TttaatqtksSFSTDRTSTPHLSQSSTVTPTQSTPiPATTNSLMTTGGLTGTPPVHTNSGTTSSPQTPRTTHPFSTVAVS 1914
Cdd:PHA03247 2891 V---------SRSTESFALPPDQPERPPQPQAPPP-PQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP 2960
|
410 420 430
....*....|....*....|....*....|....*.
gi 1907182170 1915 NTKHTTGVSLETSV-QTTIASPTPSAPQTSLATHLP 1949
Cdd:PHA03247 2961 QPWLGALVPGRVAVpRFRVPQPAPSREAPASSTPPL 2996
|
|
| ROM1 |
COG5422 |
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction ... |
1407-1625 |
4.95e-03 |
|
RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms];
Pssm-ID: 227709 [Multi-domain] Cd Length: 1175 Bit Score: 43.34 E-value: 4.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1407 STGPPLGTSVQTTINFPTLSAPQTSLVT-----PHPGLSSSSTALTSEILKTPTSSQMVSSASPQT-IFSSIHPKTTLEA 1480
Cdd:COG5422 28 SKQLLPPRRLQRKLNPISIRNGADNDIInseskESFGKYALGHQIFSSFSSSPKLFQRRNSAGPIThSPSATSSTSSLNS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1481 TTPQHTAPLITSITSSITQAQSSFSTDKTYTSQHS------QPSTMTaHQSRSLPTVTTSTKSTMGLTG---TPPVHTTS 1551
Cdd:COG5422 108 NDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQkrknplLPSSST-HGTHPPIVFTDNNGSHAGAPNarsRKEIPSLG 186
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1907182170 1552 GTTSSPQTPRTTHPFSTVAVSNT---KHTTGVSLETSvqttiaSPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPT 1625
Cdd:COG5422 187 SQSMQLPSPHFRQKFSSSDTSNGfsyPSIRKNSRHSS------NSMPSFPHSSTAVLLKRHSGSSGASLISSNITPS 257
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
2684-3133 |
5.60e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.98 E-value: 5.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2684 SSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQSSLSThlplfstlsvtptteglntQSTPIPATTn 2763
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPNTTTGLPSSTHVPTNLTAPASTGPTVSTAD-------------------VTSPTPAGT- 481
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2764 slmtTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIASPTPSAPQTSLATHLPfsstss 2843
Cdd:pfam05109 482 ----TSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTTPTPNATSPTLGKTSP------ 544
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2844 vtptSEVIITPTPQHTLssaststtmgnilPTTIGQTGSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAV 2923
Cdd:pfam05109 545 ----TSAVTTPTPNATS-------------PTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNATSPTVGETSPQANTTNH 607
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2924 TAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPQS 3003
Cdd:pfam05109 608 TLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITQVTPA 687
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3004 SLSTHLPLFSTLSVTPTTEGL-------NTPTSPHSLSVASTSMPLMTVLPTTLEGTRpphTSVP-VTYTTTAATQTKSS 3075
Cdd:pfam05109 688 STSTHHVSTSSPAPRPGTTSQasgpgnsSTSTKPGEVNVTKGTPPKNATSPQAPSGQK---TAVPtVTSTGGKANSTTGG 764
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 3076 FSTDRTSAPHLSQPST------VTP----TQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPR 3133
Cdd:pfam05109 765 KHTTGHGARTSTEPTTdyggdsTTPrtryNATTYLPPSTSSKLRPRWTFTSPPVTTAQATVPVPPTSQ 832
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
3910-4187 |
6.09e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 43.00 E-value: 6.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3910 SSPQTPHSTHPISTAAISRTTGISGTPF----RTPMKTTITFP---TPSSLQTSMATLFPPFSTSVMSSTEIF--NTPTN 3980
Cdd:PHA03247 2567 SVPPPRPAPRPSEPAVTSRARRPDAPPQsarpRAPVDDRGDPRgpaPPSPLPPDTHAPDPPPPSPSPAANEPDphPPPTV 2646
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3981 PHSVSSASTSRPLSTSLPTTIKGTGTPQTPvsdinttSATTQA--HSSFPTTRTSTSHLSLPSSMTSTltPASRSASTLQ 4058
Cdd:PHA03247 2647 PPPERPRDDPAPGRVSRPRRARRLGRAAQA-------SSPPQRprRRAARPTVGSLTSLADPPPPPPT--PEPAPHALVS 2717
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4059 YTPTP----SSVSHSPLLTTPTASPPSSAPTFV--------SPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTT 4126
Cdd:PHA03247 2718 ATPLPpgpaAARQASPALPAAPAPPAVPAGPATpggparpaRPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1907182170 4127 SHVPTFSSFSSKSTTAHLTSLTTQAATSGLLSSTMGMTNLPSSGSPDINHTTRPPGSSPLP 4187
Cdd:PHA03247 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAP 2858
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
2527-2730 |
6.20e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.43 E-value: 6.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2527 AVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVTPTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQ 2606
Cdd:COG3469 12 AGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAAAAATS 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2607 TGSPHTSVPVIYTTSAITQTKTSFSTDRTSTSTSAPHlSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTSSS 2686
Cdd:COG3469 92 TSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSS-TAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTT 170
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1907182170 2687 PQTPRTTHPSTTVAVSGTVHTTGLPSGTSVQTTTNFPTHSGPQS 2730
Cdd:COG3469 171 TTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3776-3998 |
6.20e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.43 E-value: 6.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3776 TTTFPTPSAPQTSLVTSLPPFSTSSVSPTDEIHITSTNPHTVSSVSMSRPVSTILQTTIEVTTPPNTSTPVTHSTSATTE 3855
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3856 AQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSpqtphsthpiSTAAISRTTGISGT 3935
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTS----------GASATSSAGSTTTT 150
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 3936 PFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLP 3998
Cdd:COG3469 151 TTVSGTETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLP 213
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3825-4036 |
6.58e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.43 E-value: 6.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3825 PVSTILQTTIEVTTPPNTSTPVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPpvhT 3904
Cdd:COG3469 11 TAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTATATAAA---A 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3905 TSGTTSSPQTPHSTHPiSTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSV 3984
Cdd:COG3469 88 AATSTSATLVATSTAS-GANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTT 166
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1907182170 3985 SSASTSRPLSTSLPTTIKGTGTPQTPVSdinTTSATTQAHSSFPTTRTSTSH 4036
Cdd:COG3469 167 STTTTTTSASTTPSATTTATATTASGAT---TPSATTTATTTGPPTPGLPKH 215
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3085-3434 |
6.74e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.60 E-value: 6.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3085 HLSQPSTVTPTQSTPIPATTNSLMTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVavsntkhTTGVSLETSVQTTIAS 3164
Cdd:pfam05109 457 NLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAV-------TTPTPNATSPTPAVTT 529
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3165 PTPSAPQTSLATHLPfsstssvtptSEVIITPTPQHTLssaststtmgnilPTTIGQTGSPHTSVPVIYTTSTITQTKTS 3244
Cdd:pfam05109 530 PTPNATSPTLGKTSP----------TSAVTTPTPNATS-------------PTPAVTTPTPNATIPTLGKTSPTSAVTTP 586
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3245 FFTDRTSTSTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHTTG 3324
Cdd:pfam05109 587 TPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSH 666
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3325 LPSGTSVQTTTNFPTHSGPQSSLSTHlpLFSTLSVTPTTEGLNTQSTPIPATTNSLMTTVGLT-GTPPVHTTSGTTSSPQ 3403
Cdd:pfam05109 667 MPLLTSAHPTGGENITQVTPASTSTH--HVSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTkGTPPKNATSPQAPSGQ 744
|
330 340 350
....*....|....*....|....*....|....*
gi 1907182170 3404 TPRTTHPFSTVAVSNT----KHTTGVSLETSVQTT 3434
Cdd:pfam05109 745 KTAVPTVTSTGGKANSttggKHTTGHGARTSTEPT 779
|
|
| PLN02217 |
PLN02217 |
probable pectinesterase/pectinesterase inhibitor |
3999-4129 |
7.32e-03 |
|
probable pectinesterase/pectinesterase inhibitor
Pssm-ID: 215130 [Multi-domain] Cd Length: 670 Bit Score: 42.38 E-value: 7.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3999 TTIKGTGTPQTP--VSDINTTSATTQAHSSFPTTRTSTShlslpSSMTSTLTPAsrsastlqyTPTPSSVSHSPLLTTPT 4076
Cdd:PLN02217 548 AWIPGKGVPYIPglFAGNPGSTNSTPTGSAASSNTTFSS-----DSPSTVVAPS---------TSPPAGHLGSPPATPSK 613
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 4077 ASPPSSAPTFVSPTAASTVISSALPTIHMTPTPSSRPTSSTGLLSTSKTTSHV 4129
Cdd:PLN02217 614 IVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMV 666
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
1492-1907 |
7.59e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.45 E-value: 7.59e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1492 SITSSITQAQSSFSTDKTYTSQHSQPSTMTAHQSRSLPTVTTSTKSTMGLTGTPPVHTTSGTTSSPQ-TPRTTHPFSTVA 1570
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQgSPATSQPPNQTQ 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1571 VSNTKHTTgvsletsvqttIASPTPSAPQTSLATHLPFSSTSSVTPTSEVIITPTPQhtlssaSTSTTTGNILPTTIgQT 1650
Cdd:pfam03154 223 STAAPHTL-----------IQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQ------PSLHGQMPPMPHSL-QT 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1651 GSPHTSVPVIYTTSAITQTKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPtMSSTGTPVVHTT-SGTTSS 1729
Cdd:pfam03154 285 GPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQP-LPPAPLSMPHIKpPPTTPI 363
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1730 PQTPRT---THPSTTVAVSGTVHTTGLPSGTSVHTTTNFPTHSGPqsslSTHLPlfsTLSVTPTTEGLNTP-------TS 1799
Cdd:pfam03154 364 PQLPNPqshKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPP----SAHPP---PLQLMPQSQQLPPPpaqppvlTQ 436
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1800 PHSLSVASTSMPLMTVL-PTTLEGTRPPHTSVPVTYTTTAATQTKSSFSTdrTSTPHLSQSSTVTPTQSTPIPATTNSLM 1878
Cdd:pfam03154 437 SQSLPPPAASHPPTSGLhQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTS--SAMPGIQPPSSASVSSSGPVPAAVSCPL 514
|
410 420
....*....|....*....|....*....
gi 1907182170 1879 ttggltgtPPVHTNSGTTSSPQTPRTTHP 1907
Cdd:pfam03154 515 --------PPVQIKEEALDEAEEPESPPP 535
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
3740-4130 |
8.05e-03 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 42.45 E-value: 8.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3740 TSSIAHTPHTTHSLPTAASSSTTLSTAPQFRTSEQSTTTFPTPSAP-QTSLVTSLPPFSTSSVSPTDEihiTSTNPHTVS 3818
Cdd:pfam03154 144 TSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPgTTQAATAGPTPSAPSVPPQGS---PATSQPPNQ 220
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3819 SVSMSRPVSTILQT-TIEVTTPPNTSTPVTHSTSATTEAQgsFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVT 3897
Cdd:pfam03154 221 TQSTAAPHTLIQQTpTLHPQRLPSPHPPLQPMTQPPPPSQ--VSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPF 298
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3898 GTPPVHTTSGTTSSPQT--PHSTHPISTAAISRTTGISGTPFR------TPMKTTITFPTPSSLQTSMATLFPPFSTSVM 3969
Cdd:pfam03154 299 PLTPQSSQSQVPPGPSPaaPGQSQQRIHTPPSQSQLQSQQPPReqplppAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHL 378
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3970 SSTEIFNTPTNphsVSSASTSRPLStSLPTTIKGTGTPqtPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTP 4049
Cdd:pfam03154 379 SGPSPFQMNSN---LPPPPALKPLS-SLSTHHPPSAHP--PPLQLMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSG 452
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 4050 ASRSASTLQYTPTPSSVSHSPLLTtptasPPSSAPTFVSPtaastvissALPTIHmtpTPSSRPTSSTGLLSTSKTTSHV 4129
Cdd:pfam03154 453 LHQVPSQSPFPQHPFVPGGPPPIT-----PPSGPPTSTSS---------AMPGIQ---PPSSASVSSSGPVPAAVSCPLP 515
|
.
gi 1907182170 4130 P 4130
Cdd:pfam03154 516 P 516
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
3825-4123 |
8.40e-03 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 42.21 E-value: 8.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3825 PVSTILQTTIEVT--TPPNTSTPVTHSTSATTEAQGSFSTERT-STSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPP 3901
Cdd:pfam05109 425 PESTTTSPTLNTTgfAAPNTTTGLPSSTHVPTNLTAPASTGPTvSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAP 504
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3902 VHTTSGTTSSPQTPHSTHPISTAAISRTTGISGTPFRTPMKTTITFPTPSSLQTSMATLFPPFSTSV-----MSSTEIFN 3976
Cdd:pfam05109 505 DMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIptlgkTSPTSAVT 584
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3977 TPTNPHSVSSASTSRPLSTSLPTTIKGTG-TPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSlPSSMTSTLTPASRSAS 4055
Cdd:pfam05109 585 TPTPNATSPTVGETSPQANTTNHTLGGTSsTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLR-PSSISETLSPSTSDNS 663
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1907182170 4056 TlqytptpssvSHSPLLTtptasppssaptfvsptaastvisSALPTIHMTPTPSSRPTSSTGLLSTS 4123
Cdd:pfam05109 664 T----------SHMPLLT------------------------SAHPTGGENITQVTPASTSTHHVSTS 697
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
1671-1890 |
9.81e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.05 E-value: 9.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1671 TSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSIKPTMSSTGTPVVHTTSGTTSSPQTPRTTHPSTTVAVSGTVHT 1750
Cdd:COG3469 2 SSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTAT 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1751 TGLPSGTSVHTTTNFPTHSGPQSSLSTHLPLFSTLSVTPTTEGLNTPTSPHSLSVASTSMPLMTVLPTTLEGTRPPHTSV 1830
Cdd:COG3469 82 ATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATG 161
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 1831 PVTYTTTaatqtkSSFSTDRTSTPHLSQSSTVTPTQSTPIPATTNSLMTTGGLTGTPPVH 1890
Cdd:COG3469 162 GTTTTST------TTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPGLPKH 215
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
3845-4051 |
9.90e-03 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 42.05 E-value: 9.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3845 PVTHSTSATTEAQGSFSTERTSTSYLSHPSSTTVHQSTAGPVITSIKSTMGVTGTPPVHTTSGTTSSPQTPHSTHPISTA 3924
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 3925 AISRTTGISGTPFRTPMKTTitfpTPSSLQTSMATLFPPFSTSVMSSTEIFNTPTNPHSVSSASTSRPLSTSLPTTIKGT 4004
Cdd:COG3469 81 TATAAAAAATSTSATLVATS----TASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGT 156
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1907182170 4005 GTPQTPVSDINTTSATTQAHSSFPTTRTSTSHLSLPSSMTSTLTPAS 4051
Cdd:COG3469 157 ETATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTAT 203
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
2353-2695 |
9.95e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 42.29 E-value: 9.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2353 TKTSFSTDRTSTPTSAPHLSETSAVTAHQSTPTAVSANSikPTMSSTGTPVVHTTSGTTSSPQTPRT------THPSTTV 2426
Cdd:TIGR00927 73 MMVSSDPPKSSSEMEGEMLAPQATVGRDEATPSIAMENT--PSPPRRTAKITPTTPKNNYSPTAAGTervkedTPATPSR 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2427 AVSGTVHTTGLPSGTSVQTTT------NFPTHSGPQSSLSTHLPL--------FSTLSVTPTTEGLNTQSTpipATTNSL 2492
Cdd:TIGR00927 151 ALNHYISTSGRQRVKSYTPKPrgevksSSPTQTREKVRKYTPSPLgrmvnsyaPSTFMTMPRSHGITPRTT---VKDSEI 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2493 MTTGGLTGTPPVHTTSGTTSSPQTPRTTHPFSTVAVSNTKHTTGVSLETSVQTTIASPTPSAPQTSLATHLPFSSTSAVT 2572
Cdd:TIGR00927 228 TATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907182170 2573 PTSEVIITPTPQHTFSSASTSTTTGNILPTTIGQTG-------SPHTSVPVIYTTSAiTQTKTSFSTDRTSTSTSAPHLS 2645
Cdd:TIGR00927 308 TPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAawkirnpLSRTSAPAVRIASA-TFRGLEKNPSTAPSTPATPRVR 386
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|...
gi 1907182170 2646 ETSAVTAHQST---PTAVSANSIKPTMSSTGTPVVHTTSGTSSSPQTPrTTHP 2695
Cdd:TIGR00927 387 AVLTTQVHHCVvvkPAPAVPTTPSPSLTTALFPEAPSPSPSALPPGQP-DLHP 438
|
|
|