|
Name |
Accession |
Description |
Interval |
E-value |
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
883-1222 |
2.19e-23 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 108.17 E-value: 2.19e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 883 GTLSTSVSFGGPSSTSANCGGTLSTSIcfdgspSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFG 962
Cdd:NF033849 217 GQKSISFGVSLPMMYAANLGQSAGTGY------GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQS 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 963 GSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFS 1042
Cdd:NF033849 291 TSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHS 370
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1043 GVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFG-GAHSTSLCFGGAPSTSLCFGSASNTnlcf 1121
Cdd:NF033849 371 TSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGdSVQSVSQSYGSSSSTGTSSGHSDSS---- 446
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1122 ggppSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGF 1201
Cdd:NF033849 447 ----SHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGT 522
|
330 340
....*....|....*....|.
gi 967505707 1202 SGGPGTSTGFGGGLGTSAGFS 1222
Cdd:NF033849 523 SGGRTSGAGGSMGLGPSISLG 543
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
407-567 |
6.61e-23 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 98.11 E-value: 6.61e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 407 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYALEKMFRVNLKEID--------------------KQSS 465
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 466 LYILIST---RESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVRHSLFG 530
Cdd:pfam01454 81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
|
170 180 190
....*....|....*....|....*....|....*....
gi 967505707 531 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 567
Cdd:pfam01454 161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1016-1383 |
4.13e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 100.46 E-value: 4.13e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1016 NTNASFGcaISTSAGFSGAVGTSAGfsgvpstnPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAH 1095
Cdd:NF033849 218 QKSISFG--VSLPMMYAANLGQSAG--------TGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1096 STSLcfggapSTSLCFGSASNTnlcfggppSTSACFSGATSpsfgDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGL 1175
Cdd:NF033849 288 TQST------SESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1176 GTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGlgTSAGFSGGLGTGAgfggglVTSDGFGGGLGTNASFGSTL 1255
Cdd:NF033849 350 STSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIAGGG------VTSEGLGASQGGSEGWGSGD 421
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1256 GtGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSI 1335
Cdd:NF033849 422 S-VQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSE 500
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 967505707 1336 SgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGfgsGATSLGAcGFSYG 1383
Cdd:NF033849 501 S-VSQGDGRSTGRSESQGTSLGTSGGRTSGAG---GSMGLGP-SISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
816-1182 |
5.80e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 100.08 E-value: 5.80e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 816 STSTSFG-SAPTtntVFSSAL--STSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGgspcTSTGFGGTLSTSVSFG 892
Cdd:NF033849 218 QKSISFGvSLPM---MYAANLgqSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQS----HTTGHGSTRGWSHTQS 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 893 GPSSTSANCGGTLSTSIcfdgspstgagfggALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFG 972
Cdd:NF033849 291 TSESESTGQSSSVGTSE--------------SQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHS 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 973 SALNTSAGFGGAVSTSTDFGGTLSTSVCFggSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVpSTNPGFG 1052
Cdd:NF033849 357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSV-SQSYGSS 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1053 GAFNTSAGfggaLSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSlcfgsasntnlcfgGPPSTSACFS 1132
Cdd:NF033849 434 SSTGTSSG----HSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS--------------QSETDSVGDS 495
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 967505707 1133 GATSPSFGDGPSTSTGFSfgNGLSTSAGFGGGLNTSAGFGGGLGTSAGFS 1182
Cdd:NF033849 496 TGTSESVSQGDGRSTGRS--ESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
724-1042 |
8.53e-19 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 93.15 E-value: 8.53e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 724 TASISFGGTLSTSSSFSSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGG 803
Cdd:NF033849 218 QKSISFGVSLPMMYAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 804 TLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 883
Cdd:NF033849 298 GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSS 377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 884 TLSTSVSFGgpSSTSANCGGTLStsicfdGSPSTGAGFGGALNTSASFGSA-----LNTSAGFGGAMSTSADFGSTLSTS 958
Cdd:NF033849 378 SESSSRSSS--SGVSGGFSGGIA------GGGVTSEGLGASQGGSEGWGSGdsvqsVSQSYGSSSSTGTSSGHSDSSSHS 449
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 959 VCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTS 1038
Cdd:NF033849 450 TSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
....
gi 967505707 1039 AGFS 1042
Cdd:NF033849 530 AGGS 533
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
665-1041 |
1.74e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 85.44 E-value: 1.74e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 665 ARAQENadasTSVNFSRGAGTRA---GFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSS 741
Cdd:NF033849 202 EAAAEE----TSNWASRQQGQKSisfGVSLPMMYAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTT 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 742 AASISFGGAPSTSTSFSSEASISfggtpctsASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSF 821
Cdd:NF033849 278 GHGSTRGWSHTQSTSESESTGQS--------SSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 822 GSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFGgpSSTSANc 901
Cdd:NF033849 350 STSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWG--SGDSVQ- 424
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 902 ggtlSTSICFDGSPSTGAGFGGALNTSASFGSAlnTSAGFGGAMSTSADFGSTLSTSVcfggspGTSVSFGSALNTSAGF 981
Cdd:NF033849 425 ----SVSQSYGSSSSTGTSSGHSDSSSHSTSSG--QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 982 GGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFgcaistSAGFSGAVGTSAGF 1041
Cdd:NF033849 493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGG------SMGLGPSISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
669-1042 |
3.04e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 84.67 E-value: 3.04e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 669 ENADASTSVNFSRGAGTRAGFSDGASISFngapSSSGGPGITFGGAPSSSASFSNTASISfggtlstsSSFSSAASISFG 748
Cdd:NF033849 245 ESVGHSTSQGQSHSVGTSESHSVGTSQSQ----SHTTGHGSTRGWSHTQSTSESESTGQS--------SSVGTSESQSHG 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 749 GAPSTSTSFSSEASISFGGTPCTSASFsggvsssfsgplNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTN 828
Cdd:NF033849 313 TTEGTSTTDSSSHSQSSSYNVSSGTGV------------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 829 TVFSSALSTSTGFGGTLstsvcfggspsssgsfggtlstsicfGGSPCTSTGFGGTLSTSVSFGgpSStsancggtlsts 908
Cdd:NF033849 381 SSRSSSSGVSGGFSGGI--------------------------AGGGVTSEGLGASQGGSEGWG--SG------------ 420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 909 icfDGSPSTGAGFGGALNTSASFGSALNTSagfggaMSTSADFGSTLSTSVcfGGSPGTSVSFGSALNTSAGFGGAVSTS 988
Cdd:NF033849 421 ---DSVQSVSQSYGSSSSTGTSSGHSDSSS------HSTSSGQADSVSQGT--SWSEGTGTSQGQSVGTSESWSTSQSET 489
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 967505707 989 TDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFS 1042
Cdd:NF033849 490 DSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
660-1377 |
1.79e-15 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 82.51 E-value: 1.79e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 660 SFEIEARAQENADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSF 739
Cdd:COG3210 795 SIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATA 874
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 740 SSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTST 819
Cdd:COG3210 875 ASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGL 954
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 820 SFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSA 899
Cdd:COG3210 955 SAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGT 1034
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 900 NCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSA 979
Cdd:COG3210 1035 GTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTG 1114
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 980 GFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSA 1059
Cdd:COG3210 1115 GVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTA 1194
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1060 GFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSF 1139
Cdd:COG3210 1195 GTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVA 1274
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1140 GDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSA 1219
Cdd:COG3210 1275 GNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNG 1354
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1220 GFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGF 1299
Cdd:COG3210 1355 GNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGT 1434
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 967505707 1300 IGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGATSLGA 1377
Cdd:COG3210 1435 GGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTA 1512
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
670-1297 |
7.69e-12 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 70.57 E-value: 7.69e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 670 NADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSSAASISFGG 749
Cdd:COG3210 119 TAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVG 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 750 APSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNT 829
Cdd:COG3210 199 GALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVT 278
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 830 VFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSI 909
Cdd:COG3210 279 GTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTG 358
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 910 CFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTST 989
Cdd:COG3210 359 AGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGN 438
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 990 DFGGTLSTSVCFGGSPSTSAGFSGALNTNASfgcaisTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTT 1069
Cdd:COG3210 439 GTVTGGTIGGLTGSGTTNGAGLSGNTDVSGT------GTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIAT 512
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1070 DFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGF 1149
Cdd:COG3210 513 GLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGT 592
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1150 SFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGA 1229
Cdd:COG3210 593 GTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGG 672
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 967505707 1230 GFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTST 1297
Cdd:COG3210 673 GTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGAT 740
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
961-1183 |
7.10e-08 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 57.39 E-value: 7.10e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 961 FGGSPGTSVSFGSALNTSAGFGGAVStstdfGGTLSTSvcfggSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAG 1040
Cdd:PTZ00395 339 YGGFHDGSPNAASAGAPFNGLGNQAD-----GGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1041 FSGVPSTNPGfggafNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLC 1120
Cdd:PTZ00395 409 FSNAGYSNPG-----NSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSA 483
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 967505707 1121 FggppSTSACFSGATSPSfGDGP----STSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSG 1183
Cdd:PTZ00395 484 Y----HAAYQHRAANQPA-ANLPtanqPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1022-1219 |
3.01e-06 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 51.59 E-value: 3.01e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1022 GCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFntsaGFGGALSTTTDFGGTPNNSIGFGAAPstsvsFGGAHSTSLCF 1101
Cdd:pfam15967 3 GFSFGGGPGSTATAGGGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTF 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1102 GGAPSTSlcfGSASNTNLCFGGPPSTSACFSG-----------ATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAG 1170
Cdd:pfam15967 74 GTPASST---AATGPTGLTLGTPAATTAASTGfslgfnkpaasATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLN 150
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 967505707 1171 FGGGLGTSAGFSGDL---STSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSA 1219
Cdd:pfam15967 151 LGGTPATTTAVSTGLslgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTA 202
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
679-883 |
1.96e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 45.81 E-value: 1.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 679 FSRGAGTRAGFSDGASISFNGAPSS----SGGPGI-TFGGAPSSSASfSNTASISFGGTLSTSSSFSSAASisfgGAPST 753
Cdd:pfam15967 4 FSFGGGPGSTATAGGGFSFGAAAASnpgsTGGFSFgTLGAAPAATAT-TTTATLGLGGGLFGQKPATGFTF----GTPAS 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 754 STSFSSEASISfGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSttagFSSVLSTSTS----------FGS 823
Cdd:pfam15967 79 STAATGPTGLT-LGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLS----LGSVLTSTAAqqgatgftlnLGG 153
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 967505707 824 APTTNTVFSSAL---STSTGFGGTLSTSVcFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 883
Cdd:pfam15967 154 TPATTTAVSTGLslgSTLTSLGGSLFQNT-NSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1152-1370 |
7.47e-04 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 43.73 E-value: 7.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1152 GNGLSTSAGFGGGLNTSAGFGgglgtSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGF 1231
Cdd:COG5651 178 GGLLGAQNAGSGNTSSNPGFA-----NLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1232 GGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHS 1311
Cdd:COG5651 253 AGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGA 332
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 967505707 1312 GPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGfgggpnTGAGFGGGPSTSAGFGS 1370
Cdd:COG5651 333 AAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGG------GSAGAAAGAASGGGAAA 385
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1173-1376 |
6.01e-03 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 40.75 E-value: 6.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1173 GGLGTSAGFSGDLSTSSGFDGGLGTSaGFSGGPGTSTGFGGGLGTsaGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFG 1252
Cdd:cd21118 125 GGHGAYGSQGGPGVQGHGIPGGTGGP-WASGGNYGTNSLGGSVGQ--GGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSG 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1253 STLGTGAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSivGFSGGPSTGVGFCSG 1332
Cdd:cd21118 202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 967505707 1333 PSISGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGATSLG 1376
Cdd:cd21118 279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1147-1383 |
6.80e-03 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 40.81 E-value: 6.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1147 TGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFsgdlstssGFDGGLGTSAGFSGGPGTSTGFGGGL---GTSAGFSg 1223
Cdd:pfam15967 2 SGFSFGGGPGSTATAGGGFSFGAAAASNPGSTGGF--------SFGTLGAAPAATATTTTATLGLGGGLfgqKPATGFT- 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1224 glgtgagfggglvtsdgFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFG---SRPNAS---FDRGLSTIIGFGSGSNTST 1297
Cdd:pfam15967 73 -----------------FGTPASSTAATGPTGLTLGTPAATTAASTGFSlgfNKPAASatpFSLPASSTSGGGLSLGSVL 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1298 GFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFcsGPSISGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGATSLGA 1377
Cdd:pfam15967 136 TSTAAQQGATGFTLNLGGTPATTTAVSTGLSL--GSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGL 213
|
....*.
gi 967505707 1378 CGFSYG 1383
Cdd:pfam15967 214 GGLDFS 219
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
883-1222 |
2.19e-23 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 108.17 E-value: 2.19e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 883 GTLSTSVSFGGPSSTSANCGGTLSTSIcfdgspSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFG 962
Cdd:NF033849 217 GQKSISFGVSLPMMYAANLGQSAGTGY------GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQS 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 963 GSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFS 1042
Cdd:NF033849 291 TSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHS 370
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1043 GVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFG-GAHSTSLCFGGAPSTSLCFGSASNTnlcf 1121
Cdd:NF033849 371 TSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGdSVQSVSQSYGSSSSTGTSSGHSDSS---- 446
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1122 ggppSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGF 1201
Cdd:NF033849 447 ----SHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGT 522
|
330 340
....*....|....*....|.
gi 967505707 1202 SGGPGTSTGFGGGLGTSAGFS 1222
Cdd:NF033849 523 SGGRTSGAGGSMGLGPSISLG 543
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
407-567 |
6.61e-23 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 98.11 E-value: 6.61e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 407 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYALEKMFRVNLKEID--------------------KQSS 465
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 466 LYILIST---RESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVRHSLFG 530
Cdd:pfam01454 81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
|
170 180 190
....*....|....*....|....*....|....*....
gi 967505707 531 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 567
Cdd:pfam01454 161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1016-1383 |
4.13e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 100.46 E-value: 4.13e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1016 NTNASFGcaISTSAGFSGAVGTSAGfsgvpstnPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAH 1095
Cdd:NF033849 218 QKSISFG--VSLPMMYAANLGQSAG--------TGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1096 STSLcfggapSTSLCFGSASNTnlcfggppSTSACFSGATSpsfgDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGL 1175
Cdd:NF033849 288 TQST------SESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1176 GTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGlgTSAGFSGGLGTGAgfggglVTSDGFGGGLGTNASFGSTL 1255
Cdd:NF033849 350 STSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIAGGG------VTSEGLGASQGGSEGWGSGD 421
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1256 GtGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSI 1335
Cdd:NF033849 422 S-VQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSE 500
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 967505707 1336 SgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGfgsGATSLGAcGFSYG 1383
Cdd:NF033849 501 S-VSQGDGRSTGRSESQGTSLGTSGGRTSGAG---GSMGLGP-SISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
816-1182 |
5.80e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 100.08 E-value: 5.80e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 816 STSTSFG-SAPTtntVFSSAL--STSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGgspcTSTGFGGTLSTSVSFG 892
Cdd:NF033849 218 QKSISFGvSLPM---MYAANLgqSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQS----HTTGHGSTRGWSHTQS 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 893 GPSSTSANCGGTLSTSIcfdgspstgagfggALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFG 972
Cdd:NF033849 291 TSESESTGQSSSVGTSE--------------SQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHS 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 973 SALNTSAGFGGAVSTSTDFGGTLSTSVCFggSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVpSTNPGFG 1052
Cdd:NF033849 357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSV-SQSYGSS 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1053 GAFNTSAGfggaLSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSlcfgsasntnlcfgGPPSTSACFS 1132
Cdd:NF033849 434 SSTGTSSG----HSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS--------------QSETDSVGDS 495
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 967505707 1133 GATSPSFGDGPSTSTGFSfgNGLSTSAGFGGGLNTSAGFGGGLGTSAGFS 1182
Cdd:NF033849 496 TGTSESVSQGDGRSTGRS--ESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
724-1042 |
8.53e-19 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 93.15 E-value: 8.53e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 724 TASISFGGTLSTSSSFSSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGG 803
Cdd:NF033849 218 QKSISFGVSLPMMYAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 804 TLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 883
Cdd:NF033849 298 GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSS 377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 884 TLSTSVSFGgpSSTSANCGGTLStsicfdGSPSTGAGFGGALNTSASFGSA-----LNTSAGFGGAMSTSADFGSTLSTS 958
Cdd:NF033849 378 SESSSRSSS--SGVSGGFSGGIA------GGGVTSEGLGASQGGSEGWGSGdsvqsVSQSYGSSSSTGTSSGHSDSSSHS 449
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 959 VCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTS 1038
Cdd:NF033849 450 TSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
....
gi 967505707 1039 AGFS 1042
Cdd:NF033849 530 AGGS 533
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
665-1041 |
1.74e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 85.44 E-value: 1.74e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 665 ARAQENadasTSVNFSRGAGTRA---GFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSS 741
Cdd:NF033849 202 EAAAEE----TSNWASRQQGQKSisfGVSLPMMYAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTT 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 742 AASISFGGAPSTSTSFSSEASISfggtpctsASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSF 821
Cdd:NF033849 278 GHGSTRGWSHTQSTSESESTGQS--------SSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 822 GSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFGgpSSTSANc 901
Cdd:NF033849 350 STSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWG--SGDSVQ- 424
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 902 ggtlSTSICFDGSPSTGAGFGGALNTSASFGSAlnTSAGFGGAMSTSADFGSTLSTSVcfggspGTSVSFGSALNTSAGF 981
Cdd:NF033849 425 ----SVSQSYGSSSSTGTSSGHSDSSSHSTSSG--QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 982 GGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFgcaistSAGFSGAVGTSAGF 1041
Cdd:NF033849 493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGG------SMGLGPSISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
669-1042 |
3.04e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 84.67 E-value: 3.04e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 669 ENADASTSVNFSRGAGTRAGFSDGASISFngapSSSGGPGITFGGAPSSSASFSNTASISfggtlstsSSFSSAASISFG 748
Cdd:NF033849 245 ESVGHSTSQGQSHSVGTSESHSVGTSQSQ----SHTTGHGSTRGWSHTQSTSESESTGQS--------SSVGTSESQSHG 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 749 GAPSTSTSFSSEASISFGGTPCTSASFsggvsssfsgplNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTN 828
Cdd:NF033849 313 TTEGTSTTDSSSHSQSSSYNVSSGTGV------------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 829 TVFSSALSTSTGFGGTLstsvcfggspsssgsfggtlstsicfGGSPCTSTGFGGTLSTSVSFGgpSStsancggtlsts 908
Cdd:NF033849 381 SSRSSSSGVSGGFSGGI--------------------------AGGGVTSEGLGASQGGSEGWG--SG------------ 420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 909 icfDGSPSTGAGFGGALNTSASFGSALNTSagfggaMSTSADFGSTLSTSVcfGGSPGTSVSFGSALNTSAGFGGAVSTS 988
Cdd:NF033849 421 ---DSVQSVSQSYGSSSSTGTSSGHSDSSS------HSTSSGQADSVSQGT--SWSEGTGTSQGQSVGTSESWSTSQSET 489
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 967505707 989 TDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFS 1042
Cdd:NF033849 490 DSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
660-1377 |
1.79e-15 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 82.51 E-value: 1.79e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 660 SFEIEARAQENADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSF 739
Cdd:COG3210 795 SIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATA 874
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 740 SSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTST 819
Cdd:COG3210 875 ASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGL 954
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 820 SFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSA 899
Cdd:COG3210 955 SAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGT 1034
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 900 NCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSA 979
Cdd:COG3210 1035 GTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTG 1114
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 980 GFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSA 1059
Cdd:COG3210 1115 GVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTA 1194
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1060 GFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSF 1139
Cdd:COG3210 1195 GTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVA 1274
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1140 GDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSA 1219
Cdd:COG3210 1275 GNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNG 1354
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1220 GFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGF 1299
Cdd:COG3210 1355 GNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGT 1434
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 967505707 1300 IGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGATSLGA 1377
Cdd:COG3210 1435 GGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTA 1512
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
670-1297 |
7.69e-12 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 70.57 E-value: 7.69e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 670 NADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSSAASISFGG 749
Cdd:COG3210 119 TAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVG 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 750 APSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNT 829
Cdd:COG3210 199 GALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVT 278
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 830 VFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSI 909
Cdd:COG3210 279 GTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTG 358
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 910 CFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTST 989
Cdd:COG3210 359 AGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGN 438
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 990 DFGGTLSTSVCFGGSPSTSAGFSGALNTNASfgcaisTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTT 1069
Cdd:COG3210 439 GTVTGGTIGGLTGSGTTNGAGLSGNTDVSGT------GTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIAT 512
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1070 DFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGF 1149
Cdd:COG3210 513 GLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGT 592
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1150 SFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGA 1229
Cdd:COG3210 593 GTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGG 672
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 967505707 1230 GFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTST 1297
Cdd:COG3210 673 GTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGAT 740
|
|
| AidA |
COG3468 |
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ... |
823-1263 |
9.73e-11 |
|
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442691 [Multi-domain] Cd Length: 846 Bit Score: 66.51 E-value: 9.73e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 823 SAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCG 902
Cdd:COG3468 1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 903 GTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGspGTSVSFGSALNTSAGFG 982
Cdd:COG3468 81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGG--GGGGGTGVGGTGAAAAG 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 983 GAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGcaISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFG 1062
Cdd:COG3468 159 GGTGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGG--GGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGG 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1063 GALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGApstslcFGSASNTNLCFGGPPSTSACFSGATSPSFGDG 1142
Cdd:COG3468 237 GVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGG------GANGGGSGGGGGASGTGGGGTASTGGGGGGGG 310
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1143 PSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFS 1222
Cdd:COG3468 311 GNGGGGGGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDG 390
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 967505707 1223 GGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSG 1263
Cdd:COG3468 391 VGTGLTTGGTGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTG 431
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
872-1354 |
2.08e-10 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 65.57 E-value: 2.08e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 872 GGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADF 951
Cdd:COG4625 19 GGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGG 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 952 GSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGF 1031
Cdd:COG4625 99 GGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGG 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1032 SGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTslcfGGAPSTSLCF 1111
Cdd:COG4625 179 GGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG----GGGAGGGGGG 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1112 GSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGF 1191
Cdd:COG4625 255 GGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGA 334
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1192 DGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGF 1271
Cdd:COG4625 335 GGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAG 414
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1272 GSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGGG 1351
Cdd:COG4625 415 GGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGG 494
|
...
gi 967505707 1352 PNT 1354
Cdd:COG4625 495 NYT 497
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
788-1264 |
3.29e-10 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 64.80 E-value: 3.29e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 788 NTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLST 867
Cdd:COG4625 18 GGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGG 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 868 SICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMST 947
Cdd:COG4625 98 GGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGG 177
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 948 SADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAIST 1027
Cdd:COG4625 178 GGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGG 257
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1028 SAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPST 1107
Cdd:COG4625 258 NGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 337
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1108 SLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLST 1187
Cdd:COG4625 338 GGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGG 417
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 967505707 1188 SSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGG 1264
Cdd:COG4625 418 GAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGG 494
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
719-1220 |
9.28e-10 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 63.26 E-value: 9.28e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 719 ASFSNTASISFGGTLSTSSSFSSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAAS 798
Cdd:COG4625 1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 799 SGFGGTLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTS 878
Cdd:COG4625 81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 879 TGFGGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTS 958
Cdd:COG4625 161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 959 VCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTS 1038
Cdd:COG4625 241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1039 AGFSGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTN 1118
Cdd:COG4625 321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1119 LCFGGPpSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTS 1198
Cdd:COG4625 401 GGGGAG-GTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLT 479
|
490 500
....*....|....*....|..
gi 967505707 1199 AGFSGGPGTSTGFGGGLGTSAG 1220
Cdd:COG4625 480 GNNTYTGTTTVNGGGNYTQSAG 501
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
748-1222 |
6.99e-09 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 60.56 E-value: 6.99e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 748 GGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTT 827
Cdd:COG4625 10 GGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGG 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 828 NTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLST 907
Cdd:COG4625 90 TGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGG 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 908 SICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVcfGGSPGTSVSFGSALNTSAGFGGAVST 987
Cdd:COG4625 170 GGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGG 247
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 988 STDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALST 1067
Cdd:COG4625 248 AGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 327
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1068 TTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTST 1147
Cdd:COG4625 328 GGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGG 407
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 967505707 1148 GFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFS 1222
Cdd:COG4625 408 TGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNN 482
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
961-1183 |
7.10e-08 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 57.39 E-value: 7.10e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 961 FGGSPGTSVSFGSALNTSAGFGGAVStstdfGGTLSTSvcfggSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAG 1040
Cdd:PTZ00395 339 YGGFHDGSPNAASAGAPFNGLGNQAD-----GGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1041 FSGVPSTNPGfggafNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLC 1120
Cdd:PTZ00395 409 FSNAGYSNPG-----NSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSA 483
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 967505707 1121 FggppSTSACFSGATSPSfGDGP----STSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSG 1183
Cdd:PTZ00395 484 Y----HAAYQHRAANQPA-ANLPtanqPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
802-1383 |
1.89e-07 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 55.93 E-value: 1.89e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 802 GGTLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGF 881
Cdd:COG5295 2 ASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAAS 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 882 GGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCF 961
Cdd:COG5295 82 SVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATG 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 962 GGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGF 1041
Cdd:COG5295 162 SSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASA 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1042 SGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFG---AAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTN 1118
Cdd:COG5295 242 GAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGtatAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAA 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1119 LCFGGPPSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTS 1198
Cdd:COG5295 322 ALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGS 401
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1199 AGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNAS 1278
Cdd:COG5295 402 STGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAA 481
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1279 FDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGGGPNTGAGF 1358
Cdd:COG5295 482 TSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNN 561
|
570 580
....*....|....*....|....*
gi 967505707 1359 GGGPSTSAGFGSGATSLGACGFSYG 1383
Cdd:COG5295 562 TATGANSVALGAGSVASGANSVSVG 586
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
670-1208 |
2.07e-06 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 52.47 E-value: 2.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 670 NADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSSAASISFGG 749
Cdd:COG4625 6 GGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGG 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 750 APSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNT 829
Cdd:COG4625 86 GGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGG 165
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 830 VFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSI 909
Cdd:COG4625 166 GGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 245
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 910 CFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTST 989
Cdd:COG4625 246 GGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 325
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 990 DFGGTLSTSVCFGGSPSTSAGFSGALNTNASfgcAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTT 1069
Cdd:COG4625 326 GGGGGGGGAGGGGGSGGAGAGGGGAGGGGAG---GGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGG 402
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1070 DFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGF 1149
Cdd:COG4625 403 GGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNN 482
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*....
gi 967505707 1150 SFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTS 1208
Cdd:COG4625 483 TYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTT 541
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
934-1148 |
2.19e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.06 E-value: 2.19e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 934 ALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSG 1013
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1014 ALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTnpgfggafnTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGG 1093
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTT---------TSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTT 151
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 967505707 1094 AHSTSLCF-GGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTG 1148
Cdd:COG3469 152 TVSGTETAtGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGP 207
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1022-1219 |
3.01e-06 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 51.59 E-value: 3.01e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1022 GCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFntsaGFGGALSTTTDFGGTPNNSIGFGAAPstsvsFGGAHSTSLCF 1101
Cdd:pfam15967 3 GFSFGGGPGSTATAGGGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTF 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1102 GGAPSTSlcfGSASNTNLCFGGPPSTSACFSG-----------ATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAG 1170
Cdd:pfam15967 74 GTPASST---AATGPTGLTLGTPAATTAASTGfslgfnkpaasATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLN 150
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 967505707 1171 FGGGLGTSAGFSGDL---STSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSA 1219
Cdd:pfam15967 151 LGGTPATTTAVSTGLslgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTA 202
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
884-1078 |
6.61e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.52 E-value: 6.61e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 884 TLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGG 963
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 964 SPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSG 1043
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETAT 160
|
170 180 190
....*....|....*....|....*....|....*...
gi 967505707 1044 VPSTNP---GFGGAFNTSAGFGGALSTTTDFGGTPNNS 1078
Cdd:COG3469 161 GGTTTTsttTTTTSASTTPSATTTATATTASGATTPSA 198
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1048-1282 |
1.35e-05 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 49.67 E-value: 1.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1048 NPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPstslcFGSASNTNLCFGGPPST 1127
Cdd:pfam15967 5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1128 SAcfSGATSPSFGDGPSTSTgfsfgnglSTSAGFGGGLNTSAgfggglGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGT 1207
Cdd:pfam15967 80 TA--ATGPTGLTLGTPAATT--------AASTGFSLGFNKPA------ASATPFSLPASSTSGGGLSLGSVLTSTAAQQG 143
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 967505707 1208 STGFGGGLGTSAGFSGGLGTGAGFGGGLvtsDGFGGGLGTNASfGSTLGTGAGFSGGLSTSDGFGSRPNASFDRG 1282
Cdd:pfam15967 144 ATGFTLNLGGTPATTTAVSTGLSLGSTL---TSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1009-1222 |
2.87e-05 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 47.97 E-value: 2.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1009 AGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTS---AGFGGALSTTTDFGGTPNNSIGFGAAP 1085
Cdd:COG5651 155 AAASAAAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGlnqVGIGGLNSGSGPIGLNSGPGNTGFAGT 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1086 STSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGL 1165
Cdd:COG5651 235 GAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGG 314
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 967505707 1166 NTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFS 1222
Cdd:COG5651 315 AAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSA 371
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
864-1077 |
1.31e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.28 E-value: 1.31e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 864 TLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGG 943
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 944 AMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVcfgGSPSTSAGFSGALNTNASFGC 1023
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTS---GASATSSAGSTTTTTTVSGTE 157
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 967505707 1024 AISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNN 1077
Cdd:COG3469 158 TATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
679-883 |
1.96e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 45.81 E-value: 1.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 679 FSRGAGTRAGFSDGASISFNGAPSS----SGGPGI-TFGGAPSSSASfSNTASISFGGTLSTSSSFSSAASisfgGAPST 753
Cdd:pfam15967 4 FSFGGGPGSTATAGGGFSFGAAAASnpgsTGGFSFgTLGAAPAATAT-TTTATLGLGGGLFGQKPATGFTF----GTPAS 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 754 STSFSSEASISfGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSttagFSSVLSTSTS----------FGS 823
Cdd:pfam15967 79 STAATGPTGLT-LGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLS----LGSVLTSTAAqqgatgftlnLGG 153
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 967505707 824 APTTNTVFSSAL---STSTGFGGTLSTSVcFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 883
Cdd:pfam15967 154 TPATTTAVSTGLslgSTLTSLGGSLFQNT-NSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
687-950 |
4.22e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 44.66 E-value: 4.22e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 687 AGFSDGASisfnGAPSSSGGPGITFGGAPSSSAsfSNTASISFGgtlstsssfssaasiSFGGAPSTSTSfSSEASISFG 766
Cdd:pfam15967 2 SGFSFGGG----PGSTATAGGGFSFGAAAASNP--GSTGGFSFG---------------TLGAAPAATAT-TTTATLGLG 59
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 767 GTPctsasfsGGVSSSFSGPLNTSAtfSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLS 846
Cdd:pfam15967 60 GGL-------FGQKPATGFTFGTPA--SSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLS 130
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 847 TSVCFGGSPSSSGSFGGTLStsicFGGSPCTSTgfggTLSTSVSFGgpsSTSANCGGTLSTSICFDGSPSTGAGfGGALN 926
Cdd:pfam15967 131 LGSVLTSTAAQQGATGFTLN----LGGTPATTT----AVSTGLSLG---STLTSLGGSLFQNTNSTGLGQTTLG-LTLLA 198
|
250 260
....*....|....*....|....*
gi 967505707 927 TSASFGSALNTSAGFGGA-MSTSAD 950
Cdd:pfam15967 199 TSTAPVSAPAASEGLGGLdFSTSSE 223
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1152-1370 |
7.47e-04 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 43.73 E-value: 7.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1152 GNGLSTSAGFGGGLNTSAGFGgglgtSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGF 1231
Cdd:COG5651 178 GGLLGAQNAGSGNTSSNPGFA-----NLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1232 GGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHS 1311
Cdd:COG5651 253 AGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGA 332
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 967505707 1312 GPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGfgggpnTGAGFGGGPSTSAGFGS 1370
Cdd:COG5651 333 AAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGG------GSAGAAAGAASGGGAAA 385
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
670-1060 |
1.19e-03 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 43.22 E-value: 1.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 670 NADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSSAASISFGG 749
Cdd:COG5295 200 AGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASG 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 750 APSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNT 829
Cdd:COG5295 280 AAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAAD 359
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 830 VFSSALSTSTGFGGTLSTSV------CFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGG 903
Cdd:COG5295 360 ATSGGGAGGGGAAATSSSGGsataagNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGA 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 904 TLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGG 983
Cdd:COG5295 440 SGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAA 519
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 967505707 984 AVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAG 1060
Cdd:COG5295 520 NAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAG 596
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
911-1103 |
1.40e-03 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 43.12 E-value: 1.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 911 FDGSPSTGAGFGGALNTSASFGSALNTSAG--FGGAMSTSADFGSTLSTSVCFGGSPgtsvsFGSALNTSAGFGGAVSTS 988
Cdd:pfam15967 6 FGGGPGSTATAGGGFSFGAAAASNPGSTGGfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 989 TDFGGTLSTSVCFGGSPSTSAGFSGALNTNA------------------SFGCAISTSAGFSGAVGTSAGFSGVPSTNPG 1050
Cdd:pfam15967 81 AATGPTGLTLGTPAATTAASTGFSLGFNKPAasatpfslpasstsggglSLGSVLTSTAAQQGATGFTLNLGGTPATTTA 160
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 967505707 1051 FGGAFN---TSAGFGGALSTTTDFGGTPNNSIGfGAAPSTSVSFGGAHSTSLCFGG 1103
Cdd:pfam15967 161 VSTGLSlgsTLTSLGGSLFQNTNSTGLGQTTLG-LTLLATSTAPVSAPAASEGLGG 215
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1121-1344 |
3.11e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 41.42 E-value: 3.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1121 FGGPPSTSACFSGATSPSFGDGPSTSTGFSFGNglstsAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAG 1200
Cdd:COG5651 167 FTQPPPTITNPGGLLGAQNAGSGNTSSNPGFAN-----LGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAA 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1201 FSGGPGTSTGFGGGLGTSAGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFD 1280
Cdd:COG5651 242 AAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGA 321
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 967505707 1281 RGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFcSGPSISGFSGGPST 1344
Cdd:COG5651 322 TGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGS-AGAAAGAASGGGAA 384
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1173-1376 |
6.01e-03 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 40.75 E-value: 6.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1173 GGLGTSAGFSGDLSTSSGFDGGLGTSaGFSGGPGTSTGFGGGLGTsaGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFG 1252
Cdd:cd21118 125 GGHGAYGSQGGPGVQGHGIPGGTGGP-WASGGNYGTNSLGGSVGQ--GGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSG 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1253 STLGTGAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSivGFSGGPSTGVGFCSG 1332
Cdd:cd21118 202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 967505707 1333 PSISGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGATSLG 1376
Cdd:cd21118 279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
806-1038 |
6.61e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 40.65 E-value: 6.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 806 STTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSvcfggsPSSSGSFGGTLSTSICFGGSPCTSTGFGGTl 885
Cdd:COG5651 162 VALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTG------LNQVGIGGLNSGSGPIGLNSGPGNTGFAGT- 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 886 STSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSvcFGGSP 965
Cdd:COG5651 235 GAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATG--LGLGA 312
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 967505707 966 GTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTS 1038
Cdd:COG5651 313 GGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1147-1383 |
6.80e-03 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 40.81 E-value: 6.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1147 TGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFsgdlstssGFDGGLGTSAGFSGGPGTSTGFGGGL---GTSAGFSg 1223
Cdd:pfam15967 2 SGFSFGGGPGSTATAGGGFSFGAAAASNPGSTGGF--------SFGTLGAAPAATATTTTATLGLGGGLfgqKPATGFT- 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1224 glgtgagfggglvtsdgFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFG---SRPNAS---FDRGLSTIIGFGSGSNTST 1297
Cdd:pfam15967 73 -----------------FGTPASSTAATGPTGLTLGTPAATTAASTGFSlgfNKPAASatpFSLPASSTSGGGLSLGSVL 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1298 GFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFcsGPSISGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGATSLGA 1377
Cdd:pfam15967 136 TSTAAQQGATGFTLNLGGTPATTTAVSTGLSL--GSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGL 213
|
....*.
gi 967505707 1378 CGFSYG 1383
Cdd:pfam15967 214 GGLDFS 219
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1138-1350 |
8.64e-03 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 40.42 E-value: 8.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1138 SFGDGPSTST----GFSFGNGLSTSAGFGGGLntsaGFGGGLGTSAGFSGDLSTSSGFDGGL---GTSAGFS-GGPGTST 1209
Cdd:pfam15967 5 SFGGGPGSTAtaggGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGLfgqKPATGFTfGTPASST 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 967505707 1210 GFGGGLGTSAGFSGGLGTGAGFGgglvtSDGFGGGLGTNASFGSTLGTGAGfsGGLSTSDGFGSRPNASFDRGLSTIIGf 1289
Cdd:pfam15967 81 AATGPTGLTLGTPAATTAASTGF-----SLGFNKPAASATPFSLPASSTSG--GGLSLGSVLTSTAAQQGATGFTLNLG- 152
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 967505707 1290 GSGSNTSTGFIGEP--STSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGG 1350
Cdd:pfam15967 153 GTPATTTAVSTGLSlgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
|
|
|