|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
25-344 |
1.07e-21 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; :
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 97.15 E-value: 1.07e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055 3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055 79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055 143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055 199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
|
330
....*....|
gi 2462629226 337 WSG--RDGYR 344
Cdd:COG3055 257 IGGetKPGVR 266
|
|
| Herpes_BLLF1 super family |
cl37540 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
466-771 |
9.40e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo. The actual alignment was detected with superfamily member pfam05109:
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.60 E-value: 9.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 466 TVPGSSISVPTAARTQGVPAVLKVTgPQATTGTPLVTMRPASQAGKAPVTVTSLPAgvrMVVPTQSAQGTVIGSSPQMSG 545
Cdd:pfam05109 507 TSPTSAVTTPTPNATSPTPAVTTPT-PNATSPTLGKTSPTSAVTTPTPNATSPTPA---VTTPTPNATIPTLGKTSPTSA 582
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 546 MaalaaaaaaTQKIPPSSAPTV-LSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVM-----VSNPATRMLKTAAAQVGT 619
Cdd:pfam05109 583 V---------TTPTPNATSPTVgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTtgqhnITSSSTSSMSLRPSSISE 653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 620 SVSSAT---NTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISNLGKvmSVVQTKPVQTS 695
Cdd:pfam05109 654 TLSPSTsdnSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASGPGN--SSTSTKPGEVN 725
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462629226 696 AVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPSTTKPGTTTIIKT 771
Cdd:pfam05109 726 VTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPTTDYGGDSTTPRT 791
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
1128-1619 |
1.06e-03 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 1.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1128 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 1203
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1204 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 1275
Cdd:PHA03247 2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1276 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1351
Cdd:PHA03247 2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1352 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1431
Cdd:PHA03247 2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1432 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPISSMTETAPRAL 1511
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1512 TTevPIPAKITVTIA-NTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1590
Cdd:PHA03247 3000 SL--SRVSSWASSLAlHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
|
490 500
....*....|....*....|....*....
gi 2462629226 1591 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1619
Cdd:PHA03247 3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1900-1929 |
2.88e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.02 E-value: 2.88e-03
10 20 30
....*....|....*....|....*....|
gi 2462629226 1900 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1929
Cdd:cd00063 64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1935-2040 |
7.71e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases. :
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 37.86 E-value: 7.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1935 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVylaiqssqaggELKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 2013
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVV-----------EYREKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
|
90 100
....*....|....*....|....*..
gi 2462629226 2014 HIDYTtkpaiiFRIAARNEKGYGPATQ 2040
Cdd:cd00063 67 GTEYE------FRVRAVNGGGESPPSE 87
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
25-344 |
1.07e-21 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 97.15 E-value: 1.07e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055 3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055 79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055 143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055 199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
|
330
....*....|
gi 2462629226 337 WSG--RDGYR 344
Cdd:COG3055 257 IGGetKPGVR 266
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
27-322 |
5.12e-16 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 83.08 E-value: 5.12e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 27 GPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGFVCDGTRLLVFGGMVE 102
Cdd:PLN02193 162 GPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGVRMVSIGSTLYVFGGRDA 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 103 YGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpknniPRYLNDLYILELRpg 181
Cdd:PLN02193 240 SRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR------LKTLDSYNIVDKK-- 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 182 sgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSAT 261
Cdd:PLN02193 306 -----WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASA 374
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462629226 262 TIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193 375 AVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
32-69 |
7.91e-05 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 41.83 E-value: 7.91e-05
10 20 30
....*....|....*....|....*....|....*....
gi 2462629226 32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
466-771 |
9.40e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.60 E-value: 9.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 466 TVPGSSISVPTAARTQGVPAVLKVTgPQATTGTPLVTMRPASQAGKAPVTVTSLPAgvrMVVPTQSAQGTVIGSSPQMSG 545
Cdd:pfam05109 507 TSPTSAVTTPTPNATSPTPAVTTPT-PNATSPTLGKTSPTSAVTTPTPNATSPTPA---VTTPTPNATIPTLGKTSPTSA 582
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 546 MaalaaaaaaTQKIPPSSAPTV-LSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVM-----VSNPATRMLKTAAAQVGT 619
Cdd:pfam05109 583 V---------TTPTPNATSPTVgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTtgqhnITSSSTSSMSLRPSSISE 653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 620 SVSSAT---NTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISNLGKvmSVVQTKPVQTS 695
Cdd:pfam05109 654 TLSPSTsdnSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASGPGN--SSTSTKPGEVN 725
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462629226 696 AVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPSTTKPGTTTIIKT 771
Cdd:pfam05109 726 VTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPTTDYGGDSTTPRT 791
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1128-1619 |
1.06e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 1.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1128 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 1203
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1204 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 1275
Cdd:PHA03247 2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1276 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1351
Cdd:PHA03247 2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1352 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1431
Cdd:PHA03247 2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1432 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPISSMTETAPRAL 1511
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1512 TTevPIPAKITVTIA-NTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1590
Cdd:PHA03247 3000 SL--SRVSSWASSLAlHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
|
490 500
....*....|....*....|....*....
gi 2462629226 1591 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1619
Cdd:PHA03247 3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1900-1929 |
2.88e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.02 E-value: 2.88e-03
10 20 30
....*....|....*....|....*....|
gi 2462629226 1900 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1929
Cdd:cd00063 64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1935-2040 |
7.71e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 37.86 E-value: 7.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1935 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVylaiqssqaggELKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 2013
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVV-----------EYREKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
|
90 100
....*....|....*....|....*..
gi 2462629226 2014 HIDYTtkpaiiFRIAARNEKGYGPATQ 2040
Cdd:cd00063 67 GTEYE------FRVRAVNGGGESPPSE 87
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
25-344 |
1.07e-21 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 97.15 E-value: 1.07e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 25 WS--GPVPRPRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQWfipaVRGDIPPGCAAYGFVC--DGTRLLVFGG 99
Cdd:COG3055 3 WSslPDLPTPRSEAAAALLDGKVYVAGGLSGGsASNSFEVYDPATNTW----SELAPLPGPPRHHAAAvaQDGKLYVFGG 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 100 MVEY---GKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLVGNKCYLFGGlandsedpkNNIPRYLNDLYIL 176
Cdd:COG3055 79 FTGAnpsSTPLNDVYVYDPATNTWTKL-------APMPTPRGGATALLLDGKIYVVGG---------WDDGGNVAWVEVY 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 177 ELRPGSgvvaWDipiTYGVLPPPRESHTAVVYTEkdnkkSKLVIYGGMSGCRLGDLWTldidtltwNKPSLsgvaPLPRS 256
Cdd:COG3055 143 DPATGT----WT---QLAPLPTPRDHLAAAVLPD-----GKILVIGGRNGSGFSNTWT--------TLAPL----PTARA 198
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 257 LHSATTIGNKMYVFGGwvplvmddvkvathekEWKCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAINTRLYI 336
Cdd:COG3055 199 GHAAAVLGGKILVFGG----------------ESGFSDEVEAYDPATNTWTAL------GELPTPRHGHAAVLTDGKVYV 256
|
330
....*....|
gi 2462629226 337 WSG--RDGYR 344
Cdd:COG3055 257 IGGetKPGVR 266
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
76-345 |
4.00e-17 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 83.67 E-value: 4.00e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 76 GDIP-PGCAAYGFVCDGtRLLVFGGMvEYGKYSNDLYELQASRWEWKRLkaktpknGPPPCPRLGHSFSLV-GNKCYLFG 153
Cdd:COG3055 7 PDLPtPRSEAAAALLDG-KVYVAGGL-SGGSASNSFEVYDPATNTWSEL-------APLPGPPRHHAAAVAqDGKLYVFG 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 154 GLandseDPKNNIPRYLNDLYILELRPGSgvvaWdipITYGVLPPPRESHTAVVYtekDNKKskLVIYGGMSGCRLGDLW 233
Cdd:COG3055 78 GF-----TGANPSSTPLNDVYVYDPATNT----W---TKLAPMPTPRGGATALLL---DGKI--YVVGGWDDGGNVAWVE 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 234 TLDIDTLTWNKPslsGVAPLPRSLHSATTIGN-KMYVFGGwvplvmddVKVATHEKEWkctntlaclnldtmawetilmd 312
Cdd:COG3055 141 VYDPATGTWTQL---APLPTPRDHLAAAVLPDgKILVIGG--------RNGSGFSNTW---------------------- 187
|
250 260 270
....*....|....*....|....*....|...
gi 2462629226 313 TLEDNIPRARAGHCAVAINTRLYIWSGRDGYRK 345
Cdd:COG3055 188 TTLAPLPTARAGHAAAVLGGKILVFGGESGFSD 220
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
27-322 |
5.12e-16 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 83.08 E-value: 5.12e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 27 GPVPRPRHGHRAVAIKelIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGFVCDGTRLLVFGGMVE 102
Cdd:PLN02193 162 GPGLRCSHGIAQVGNK--IYSFGGEftpNQPIDKHLYVFDLETRTWSISPATGDVPHlSCLGVRMVSIGSTLYVFGGRDA 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 103 YGKYsNDLYELQASRWEWKRLkakTP-KNGppPCPRLGHSFSLVGNKCYLFGGLANDSEdpknniPRYLNDLYILELRpg 181
Cdd:PLN02193 240 SRQY-NGFYSFDTTTNEWKLL---TPvEEG--PTPRSFHSMAADEENVYVFGGVSATAR------LKTLDSYNIVDKK-- 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 182 sgvvaWDIPITygvlppPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSAT 261
Cdd:PLN02193 306 -----WFHCST------PGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASA 374
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462629226 262 TIGNKMYVFGGwvPLVMDDVkvaTHEKEWKCTNTLACLNLDTMAWETILMDTLEDNIPRAR 322
Cdd:PLN02193 375 AVGKHIVIFGG--EIAMDPL---AHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSR 430
|
|
| PLN02153 |
PLN02153 |
epithiospecifier protein |
12-330 |
3.41e-15 |
|
epithiospecifier protein
Pssm-ID: 177814 [Multi-domain] Cd Length: 341 Bit Score: 79.26 E-value: 3.41e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 12 AVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGG---NEGIVDELHVYNTATNQWFIPAVRGDIPP-GCAAYGF 87
Cdd:PLN02153 2 APTLQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGElkpNEHIDKDLYVFDFNTHTWSIAPANGDVPRiSCLGVRM 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 88 VCDGTRLLVFGGMVEYGKYsNDLYELQASRWEWKRLKAKTPKNGPPPcpRLGHSFSLVGNKCYLFGGLANDS-------- 159
Cdd:PLN02153 82 VAVGTKLYIFGGRDEKREF-SDFYSYDTVKNEWTFLTKLDEEGGPEA--RTFHSMASDENHVYVFGGVSKGGlmktperf 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 160 ----------------EDPKNNiprylndlyiLELRPGSG--VVAWDIPITYGVLpppreshTAVVYTEKDNKKSKLVIY 221
Cdd:PLN02153 159 rtieayniadgkwvqlPDPGEN----------FEKRGGAGfaVVQGKIWVVYGFA-------TSILPGGKSDYESNAVQF 221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 222 ggmsgcrlgdlwtLDIDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWvplVMDDVKvaTHEKEWKCTNTLACLNL 301
Cdd:PLN02153 222 -------------FDPASGKWTEVETTGAKPSARSVFAHAVVGKYIIIFGGE---VWPDLK--GHLGPGTLSNEGYALDT 283
|
330 340
....*....|....*....|....*....
gi 2462629226 302 DTMAWETiLMDTLEDNIPRARAGHCAVAI 330
Cdd:PLN02153 284 ETLVWEK-LGECGEPAMPRGWTAYTTATV 311
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
126-272 |
3.38e-08 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 58.43 E-value: 3.38e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 126 KTPKNGPPPCPRLGHSFSLVGNKCYLFGGlandSEDPKNNIPRYlndLYILELRPGSgvvaWDIPITYGVLPppresHTA 205
Cdd:PLN02193 155 KVEQKGEGPGLRCSHGIAQVGNKIYSFGG----EFTPNQPIDKH---LYVFDLETRT----WSISPATGDVP-----HLS 218
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462629226 206 VVYTEKDNKKSKLVIYGGMSGCR-LGDLWTLDIDTLTWNKPSLSGVAPLPRSLHSATTIGNKMYVFGG 272
Cdd:PLN02193 219 CLGVRMVSIGSTLYVFGGRDASRqYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGG 286
|
|
| PRK14131 |
PRK14131 |
N-acetylneuraminate epimerase; |
18-99 |
1.82e-05 |
|
N-acetylneuraminate epimerase;
Pssm-ID: 237617 [Multi-domain] Cd Length: 376 Bit Score: 49.24 E-value: 1.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 18 RWKRVVGWSGPvprPRHGHRAVAIKELIVVFGG----GNEG---IVDELHVYNTATNQWFIPAVRGdiPPGCA-AYGFVC 89
Cdd:PRK14131 63 GWTKIAAFPGG---PREQAVAAFIDGKLYVFGGigktNSEGspqVFDDVYKYDPKTNSWQKLDTRS--PVGLAgHVAVSL 137
|
90
....*....|
gi 2462629226 90 DGTRLLVFGG 99
Cdd:PRK14131 138 HNGKAYITGG 147
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
27-114 |
4.76e-05 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 47.46 E-value: 4.76e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 27 GPVPRPRHGHRAVAIKELIVVFGGGNeGIVDELHVYNTATNQWFipaVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKY 106
Cdd:COG3055 191 APLPTARAGHAAAVLGGKILVFGGES-GFSDEVEAYDPATNTWT---ALGELPTPRHGHAAVLTDGKVYVIGGETKPGVR 266
|
....*...
gi 2462629226 107 SNDLYELQ 114
Cdd:COG3055 267 TPLVTSAE 274
|
|
| PLN02193 |
PLN02193 |
nitrile-specifier protein |
242-393 |
7.72e-05 |
|
nitrile-specifier protein
Pssm-ID: 177844 [Multi-domain] Cd Length: 470 Bit Score: 47.64 E-value: 7.72e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 242 WNKPSLSGVAPLPRSLHSATTIGNKMYVFGG-WVPlvmdDVKVATHekewkctntLACLNLDTMAWEtilMDTLEDNIPR 320
Cdd:PLN02193 153 WIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGeFTP----NQPIDKH---------LYVFDLETRTWS---ISPATGDVPH 216
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 321 ARA-GHCAVAINTRLYIWSGRDGYRK--AWNNQVCCKDLWYLET--EKPPPPARVQLVRANTNSLEVSWGAVATA----- 390
Cdd:PLN02193 217 LSClGVRMVSIGSTLYVFGGRDASRQynGFYSFDTTTNEWKLLTpvEEGPTPRSFHSMAADEENVYVFGGVSATArlktl 296
|
...
gi 2462629226 391 DSY 393
Cdd:PLN02193 297 DSY 299
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
32-69 |
7.91e-05 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 41.83 E-value: 7.91e-05
10 20 30
....*....|....*....|....*....|....*....
gi 2462629226 32 PRHGHRAVAIKELIVVFGGGNEG-IVDELHVYNTATNQW 69
Cdd:pfam01344 1 RRSGAGVVVVGGKIYVIGGFDGNqSLNSVEVYDPETNTW 39
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
466-771 |
9.40e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.60 E-value: 9.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 466 TVPGSSISVPTAARTQGVPAVLKVTgPQATTGTPLVTMRPASQAGKAPVTVTSLPAgvrMVVPTQSAQGTVIGSSPQMSG 545
Cdd:pfam05109 507 TSPTSAVTTPTPNATSPTPAVTTPT-PNATSPTLGKTSPTSAVTTPTPNATSPTPA---VTTPTPNATIPTLGKTSPTSA 582
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 546 MaalaaaaaaTQKIPPSSAPTV-LSVPAGTTIVKTMAVTPGTTTLPATVKVASSPVM-----VSNPATRMLKTAAAQVGT 619
Cdd:pfam05109 583 V---------TTPTPNATSPTVgETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTtgqhnITSSSTSSMSLRPSSISE 653
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 620 SVSSAT---NTSTRPIIT-VHKSGTVTVAQQAQVVTTVVGGVTktitlvKSPISVPGGSALISNLGKvmSVVQTKPVQTS 695
Cdd:pfam05109 654 TLSPSTsdnSTSHMPLLTsAHPTGGENITQVTPASTSTHHVST------SSPAPRPGTTSQASGPGN--SSTSTKPGEVN 725
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462629226 696 AVTGQastgPVTQIIQTKGPLPAGTILKLVTSADGKPTTIITTTQASGAGTKptilgiSSVSPSTTKPGTTTIIKT 771
Cdd:pfam05109 726 VTKGT----PPKNATSPQAPSGQKTAVPTVTSTGGKANSTTGGKHTTGHGAR------TSTEPTTDYGGDSTTPRT 791
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
264-330 |
1.20e-04 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 41.51 E-value: 1.20e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462629226 264 GNKMYVFGGWVPLVMDdvkvathekewkCTNTLACLNLDTMAWETIlmdtleDNIPRARAGHCAVAI 330
Cdd:pfam13415 1 GDKLYIFGGLGFDGQT------------RLNDLYVYDLDTNTWTQI------GDLPPPRSGHSATYI 49
|
|
| NanM |
COG3055 |
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis]; |
241-348 |
1.68e-04 |
|
N-acetylneuraminic acid mutarotase [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442289 [Multi-domain] Cd Length: 277 Bit Score: 45.92 E-value: 1.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 241 TWNK-PSLsgvaPLPRSLHSATTIGNKMYVFGGWvplvmddvkvatheKEWKCTNTLACLNLDTMAWETIlmdtleDNIP 319
Cdd:COG3055 2 TWSSlPDL----PTPRSEAAAALLDGKVYVAGGL--------------SGGSASNSFEVYDPATNTWSEL------APLP 57
|
90 100 110
....*....|....*....|....*....|
gi 2462629226 320 RARAGH-CAVAINTRLYIWSGRDGYRKAWN 348
Cdd:COG3055 58 GPPRHHaAAVAQDGKLYVFGGFTGANPSST 87
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
215-263 |
4.01e-04 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 39.97 E-value: 4.01e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 2462629226 215 KSKLVIYGG---MSGCRLGDLWTLDIDTLTWNKPslsGVAPLPRSLHSATTI 263
Cdd:pfam13415 1 GDKLYIFGGlgfDGQTRLNDLYVYDLDTNTWTQI---GDLPPPRSGHSATYI 49
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1128-1619 |
1.06e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 1.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1128 GANHQRDARRACAAGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLG----PSMAREPGG 1203
Cdd:PHA03247 2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSladpPPPPPTPEP 2710
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1204 RSPAFVQLAPL---SSKVRLSSPSIKDLPAGRHSHA-----VSTAAMTRSSVGAGEPRMAPVCESLQGGSPSTTVTVTAL 1275
Cdd:PHA03247 2711 APHALVSATPLppgPAAARQASPALPAAPAPPAVPAgpatpGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVAS 2790
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1276 EALLCPSATVTQVCSNPPCETHETGTTNTATTSNAG----SAQRVCSNPPCETHETGTTHTATTATSNGGTGQPEGGQQP 1351
Cdd:PHA03247 2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGplppPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRS 2870
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1352 PAGRPcethqttstgttmsvsvgallpdATSSHRTVESglevAAAPSVTPQAGTALLAPFPTQRvcsnPPCETHETGTTH 1431
Cdd:PHA03247 2871 PAAKP-----------------------AAPARPPVRR----LARPAVSRSTESFALPPDQPER----PPQPQAPPPPQP 2919
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1432 TATTVTSNMSSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPISSMTETAPRAL 1511
Cdd:PHA03247 2920 QPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1512 TTevPIPAKITVTIA-NTETSDMPFSAVDILQPPEELQvspgprqqlpprqllQSASTALMGESAEVLSASQTPELPAAV 1590
Cdd:PHA03247 3000 SL--SRVSSWASSLAlHEETDPPPVSLKQTLWPPDDTE---------------DSDADSLFDSDSERSDLEALDPLPPEP 3062
|
490 500
....*....|....*....|....*....
gi 2462629226 1591 DLSSTGEPSSGQESAGSAVVATVVVQPPP 1619
Cdd:PHA03247 3063 HDPFAHEPDPATPEAGARESPSSQFGPPP 3091
|
|
| Kelch_4 |
pfam13418 |
Galactose oxidase, central domain; |
32-80 |
1.37e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433191 [Multi-domain] Cd Length: 49 Bit Score: 38.36 E-value: 1.37e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|..
gi 2462629226 32 PRHGHRAVAIKE-LIVVFGG--GNEGIVDELHVYNTATNQWfipAVRGDIPP 80
Cdd:pfam13418 1 PRAYHTSTSIPDdTIYLFGGegEDGTLLSDLWVFDLSTNEW---TRLGSLPS 49
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
1468-1641 |
2.05e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 43.54 E-value: 2.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1468 SSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPISSMT---ETAPRALTTEVPI-PAKITVTIANTETSDMPFSAVDILQP 1543
Cdd:PRK10263 321 AVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTvawQPVPGPQTGEPVIaPAPEGYPQQSQYAQPAVQYNEPLQQP 400
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1544 PEELQVSPGPRQQLPPRQLLQSASTAlmgESAEVLSASQTPELPAAVDLSSTGEPSSGQESAGSAVVATVVVQPPPPTQS 1623
Cdd:PRK10263 401 VQPQQPYYAPAAEQPAQQPYYAPAPE---QPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPL 477
|
170
....*....|....*...
gi 2462629226 1624 EVDQLSLPQELMAEAQAG 1641
Cdd:PRK10263 478 YQQPQPVEQQPVVEPEPV 495
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
30-67 |
2.65e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.54 E-value: 2.65e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2462629226 30 PRPRHGHRAVAIKELIVVFGG---GNEGIVDELHVYNTATN 67
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGytgGEGQPSDDVYVLSLPTF 41
|
|
| Kelch_5 |
pfam13854 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
319-351 |
2.84e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 433528 [Multi-domain] Cd Length: 41 Bit Score: 37.16 E-value: 2.84e-03
10 20 30
....*....|....*....|....*....|...
gi 2462629226 319 PRARAGHCAVAINTRLYIWSGRDGYRKAWNNQV 351
Cdd:pfam13854 1 PVPRYGHCAVTVGDYIYLYGGYTGGEGQPSDDV 33
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1900-1929 |
2.88e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 39.02 E-value: 2.88e-03
10 20 30
....*....|....*....|....*....|
gi 2462629226 1900 LQPGTAYKFRVAGINACGRGPFSEISAFKT 1929
Cdd:cd00063 64 LKPGTEYEFRVRAVNGGGESPPSESVTVTT 93
|
|
| FN3 |
cd00063 |
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ... |
1935-2040 |
7.71e-03 |
|
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Pssm-ID: 238020 [Multi-domain] Cd Length: 93 Bit Score: 37.86 E-value: 7.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462629226 1935 PGAPCAIKISK-SPDGAHLTWEPPSVTSGKIIEYSVylaiqssqaggELKSSTPAQLAFMRVYCGPSPSCLVqsSSLSnA 2013
Cdd:cd00063 1 PSPPTNLRVTDvTSTSVTLSWTPPEDDGGPITGYVV-----------EYREKGSGDWKEVEVTPGSETSYTL--TGLK-P 66
|
90 100
....*....|....*....|....*..
gi 2462629226 2014 HIDYTtkpaiiFRIAARNEKGYGPATQ 2040
Cdd:cd00063 67 GTEYE------FRVRAVNGGGESPPSE 87
|
|
| Kelch_3 |
pfam13415 |
Galactose oxidase, central domain; |
146-208 |
9.57e-03 |
|
Galactose oxidase, central domain;
Pssm-ID: 433188 [Multi-domain] Cd Length: 49 Bit Score: 36.11 E-value: 9.57e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462629226 146 GNKCYLFGGLANDSEDpknniprYLNDLYilELRPGSGVVAwdipiTYGVLPPPRESHTAVVY 208
Cdd:pfam13415 1 GDKLYIFGGLGFDGQT-------RLNDLY--VYDLDTNTWT-----QIGDLPPPRSGHSATYI 49
|
|
| Kelch_1 |
pfam01344 |
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six ... |
254-273 |
9.74e-03 |
|
Kelch motif; The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that Swiss:Q04652 is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Pssm-ID: 396078 [Multi-domain] Cd Length: 46 Bit Score: 36.05 E-value: 9.74e-03
|
|