|
Name |
Accession |
Description |
Interval |
E-value |
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
927-1266 |
1.60e-23 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 108.55 E-value: 1.60e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 927 GTLSTSVSFGGPSSTSANCGGTLSTSIcfdgspSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFG 1006
Cdd:NF033849 217 GQKSISFGVSLPMMYAANLGQSAGTGY------GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQS 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1007 GSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFS 1086
Cdd:NF033849 291 TSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHS 370
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1087 GVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFG-GAHSTSLCFGGAPSTSLCFGSASNTnlcf 1165
Cdd:NF033849 371 TSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGdSVQSVSQSYGSSSSTGTSSGHSDSS---- 446
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1166 ggppSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGF 1245
Cdd:NF033849 447 ----SHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGT 522
|
330 340
....*....|....*....|.
gi 1622971899 1246 SGGPGTSTGFGGGLGTSAGFS 1266
Cdd:NF033849 523 SGGRTSGAGGSMGLGPSISLG 543
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
451-611 |
6.84e-23 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 98.11 E-value: 6.84e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYALEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 510 LYILIST---RESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVRHSLFG 574
Cdd:pfam01454 81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
|
170 180 190
....*....|....*....|....*....|....*....
gi 1622971899 575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454 161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1060-1427 |
2.85e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 101.24 E-value: 2.85e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1060 NTNASFGcaISTSAGFSGAVGTSAGfsgvpstnPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAH 1139
Cdd:NF033849 218 QKSISFG--VSLPMMYAANLGQSAG--------TGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1140 STSLcfggapSTSLCFGSASNTnlcfggppSTSACFSGATSpsfgDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGL 1219
Cdd:NF033849 288 TQST------SESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1220 GTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGlgTSAGFSGGLGTGAgfggglVTSDGFGGGLGTNASFGSTL 1299
Cdd:NF033849 350 STSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIAGGG------VTSEGLGASQGGSEGWGSGD 421
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1300 GtGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSI 1379
Cdd:NF033849 422 S-VQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSE 500
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1622971899 1380 SgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGfgsGATSLGAcGFSYG 1427
Cdd:NF033849 501 S-VSQGDGRSTGRSESQGTSLGTSGGRTSGAG---GSMGLGP-SISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
860-1226 |
4.39e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 100.46 E-value: 4.39e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 860 STSTSFG-SAPTtntVFSSAL--STSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGgspcTSTGFGGTLSTSVSFG 936
Cdd:NF033849 218 QKSISFGvSLPM---MYAANLgqSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQS----HTTGHGSTRGWSHTQS 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 937 GPSSTSANCGGTLSTSIcfdgspstgagfggALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFG 1016
Cdd:NF033849 291 TSESESTGQSSSVGTSE--------------SQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHS 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1017 SALNTSAGFGGAVSTSTDFGGTLSTSVCFggSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVpSTNPGFG 1096
Cdd:NF033849 357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSV-SQSYGSS 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1097 GAFNTSAGfggaLSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSlcfgsasntnlcfgGPPSTSACFS 1176
Cdd:NF033849 434 SSTGTSSG----HSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS--------------QSETDSVGDS 495
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1177 GATSPSFGDGPSTSTGFSfgNGLSTSAGFGGGLNTSAGFGGGLGTSAGFS 1226
Cdd:NF033849 496 TGTSESVSQGDGRSTGRS--ESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
768-1086 |
7.78e-19 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 93.15 E-value: 7.78e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 768 TASISFGGTLSTSSSFSSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGG 847
Cdd:NF033849 218 QKSISFGVSLPMMYAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 848 TLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 927
Cdd:NF033849 298 GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSS 377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 928 TLSTSVSFGgpSSTSANCGGTLStsicfdGSPSTGAGFGGALNTSASFGSA-----LNTSAGFGGAMSTSADFGSTLSTS 1002
Cdd:NF033849 378 SESSSRSSS--SGVSGGFSGGIA------GGGVTSEGLGASQGGSEGWGSGdsvqsVSQSYGSSSSTGTSSGHSDSSSHS 449
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1003 VCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTS 1082
Cdd:NF033849 450 TSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
....
gi 1622971899 1083 AGFS 1086
Cdd:NF033849 530 AGGS 533
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
709-1085 |
1.49e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 85.83 E-value: 1.49e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 709 ARAQENadasTSVNFSRGAGTRA---GFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSS 785
Cdd:NF033849 202 EAAAEE----TSNWASRQQGQKSisfGVSLPMMYAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTT 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 786 AASISFGGAPSTSTSFSSEASISfggtpctsASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSF 865
Cdd:NF033849 278 GHGSTRGWSHTQSTSESESTGQS--------SSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 866 GSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFGgpSSTSANc 945
Cdd:NF033849 350 STSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWG--SGDSVQ- 424
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 946 ggtlSTSICFDGSPSTGAGFGGALNTSASFGSAlnTSAGFGGAMSTSADFGSTLSTSVcfggspGTSVSFGSALNTSAGF 1025
Cdd:NF033849 425 ----SVSQSYGSSSSTGTSSGHSDSSSHSTSSG--QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1026 GGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFgcaistSAGFSGAVGTSAGF 1085
Cdd:NF033849 493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGG------SMGLGPSISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
713-1086 |
2.48e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 85.06 E-value: 2.48e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 713 ENADASTSVNFSRGAGTRAGFSDGASISFngapSSSGGPGITFGGAPSSSASFSNTASISfggtlstsSSFSSAASISFG 792
Cdd:NF033849 245 ESVGHSTSQGQSHSVGTSESHSVGTSQSQ----SHTTGHGSTRGWSHTQSTSESESTGQS--------SSVGTSESQSHG 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 793 GAPSTSTSFSSEASISFGGTPCTSASFsggvsssfsgplNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTN 872
Cdd:NF033849 313 TTEGTSTTDSSSHSQSSSYNVSSGTGV------------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 873 TVFSSALSTSTGFGGTLstsvcfggspsssgsfggtlstsicfGGSPCTSTGFGGTLSTSVSFGgpSStsancggtlsts 952
Cdd:NF033849 381 SSRSSSSGVSGGFSGGI--------------------------AGGGVTSEGLGASQGGSEGWG--SG------------ 420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 953 icfDGSPSTGAGFGGALNTSASFGSALNTSagfggaMSTSADFGSTLSTSVcfGGSPGTSVSFGSALNTSAGFGGAVSTS 1032
Cdd:NF033849 421 ---DSVQSVSQSYGSSSSTGTSSGHSDSSS------HSTSSGQADSVSQGT--SWSEGTGTSQGQSVGTSESWSTSQSET 489
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 1622971899 1033 TDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFS 1086
Cdd:NF033849 490 DSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
704-1421 |
1.90e-15 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 82.51 E-value: 1.90e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 704 SFEIEARAQENADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSF 783
Cdd:COG3210 795 SIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATA 874
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 784 SSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTST 863
Cdd:COG3210 875 ASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGL 954
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 864 SFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSA 943
Cdd:COG3210 955 SAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGT 1034
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 944 NCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSA 1023
Cdd:COG3210 1035 GTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTG 1114
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1024 GFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSA 1103
Cdd:COG3210 1115 GVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTA 1194
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1104 GFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSF 1183
Cdd:COG3210 1195 GTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVA 1274
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1184 GDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSA 1263
Cdd:COG3210 1275 GNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNG 1354
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1264 GFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGF 1343
Cdd:COG3210 1355 GNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGT 1434
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622971899 1344 IGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGATSLGA 1421
Cdd:COG3210 1435 GGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTA 1512
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
714-1341 |
8.27e-12 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 70.57 E-value: 8.27e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 714 NADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSSAASISFGG 793
Cdd:COG3210 119 TAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVG 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 794 APSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNT 873
Cdd:COG3210 199 GALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVT 278
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 874 VFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSI 953
Cdd:COG3210 279 GTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTG 358
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 954 CFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTST 1033
Cdd:COG3210 359 AGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGN 438
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1034 DFGGTLSTSVCFGGSPSTSAGFSGALNTNASfgcaisTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTT 1113
Cdd:COG3210 439 GTVTGGTIGGLTGSGTTNGAGLSGNTDVSGT------GTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIAT 512
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1114 DFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGF 1193
Cdd:COG3210 513 GLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGT 592
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1194 SFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGA 1273
Cdd:COG3210 593 GTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGG 672
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622971899 1274 GFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTST 1341
Cdd:COG3210 673 GTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGAT 740
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
1005-1227 |
6.61e-08 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 57.78 E-value: 6.61e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1005 FGGSPGTSVSFGSALNTSAGFGGAVStstdfGGTLSTSvcfggSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAG 1084
Cdd:PTZ00395 339 YGGFHDGSPNAASAGAPFNGLGNQAD-----GGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1085 FSGVPSTNPGfggafNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLC 1164
Cdd:PTZ00395 409 FSNAGYSNPG-----NSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSA 483
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622971899 1165 FggppSTSACFSGATSPSfGDGP----STSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSG 1227
Cdd:PTZ00395 484 Y----HAAYQHRAANQPA-ANLPtanqPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1066-1263 |
1.71e-06 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 52.36 E-value: 1.71e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1066 GCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFntsaGFGGALSTTTDFGGTPNNSIGFGAAPstsvsFGGAHSTSLCF 1145
Cdd:pfam15967 3 GFSFGGGPGSTATAGGGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTF 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1146 GGAPSTSlcfGSASNTNLCFGGPPSTSACFSG-----------ATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAG 1214
Cdd:pfam15967 74 GTPASST---AATGPTGLTLGTPAATTAASTGfslgfnkpaasATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLN 150
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1622971899 1215 FGGGLGTSAGFSGDL---STSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSA 1263
Cdd:pfam15967 151 LGGTPATTTAVSTGLslgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTA 202
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
723-927 |
1.19e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 46.58 E-value: 1.19e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 723 FSRGAGTRAGFSDGASISFNGAPSS----SGGPGI-TFGGAPSSSASfSNTASISFGGTLSTSSSFSSAASisfgGAPST 797
Cdd:pfam15967 4 FSFGGGPGSTATAGGGFSFGAAAASnpgsTGGFSFgTLGAAPAATAT-TTTATLGLGGGLFGQKPATGFTF----GTPAS 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 798 STSFSSEASISfGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSttagFSSVLSTSTS----------FGS 867
Cdd:pfam15967 79 STAATGPTGLT-LGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLS----LGSVLTSTAAqqgatgftlnLGG 153
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622971899 868 APTTNTVFSSAL---STSTGFGGTLSTSVcFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 927
Cdd:pfam15967 154 TPATTTAVSTGLslgSTLTSLGGSLFQNT-NSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1196-1414 |
5.29e-04 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 44.11 E-value: 5.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1196 GNGLSTSAGFGGGLNTSAGFGgglgtSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGF 1275
Cdd:COG5651 178 GGLLGAQNAGSGNTSSNPGFA-----NLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1276 GGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHS 1355
Cdd:COG5651 253 AGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGA 332
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1622971899 1356 GPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGfgggpnTGAGFGGGPSTSAGFGS 1414
Cdd:COG5651 333 AAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGG------GSAGAAAGAASGGGAAA 385
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1217-1420 |
3.79e-03 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 41.52 E-value: 3.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1217 GGLGTSAGFSGDLSTSSGFDGGLGTSaGFSGGPGTSTGFGGGLGTsaGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFG 1296
Cdd:cd21118 125 GGHGAYGSQGGPGVQGHGIPGGTGGP-WASGGNYGTNSLGGSVGQ--GGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSG 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1297 STLGTGAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSivGFSGGPSTGVGFCSG 1376
Cdd:cd21118 202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1622971899 1377 PSISGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGATSLG 1420
Cdd:cd21118 279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1191-1427 |
4.05e-03 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 41.58 E-value: 4.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1191 TGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFsgdlstssGFDGGLGTSAGFSGGPGTSTGFGGGL---GTSAGFSg 1267
Cdd:pfam15967 2 SGFSFGGGPGSTATAGGGFSFGAAAASNPGSTGGF--------SFGTLGAAPAATATTTTATLGLGGGLfgqKPATGFT- 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1268 glgtgagfggglvtsdgFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFG---SRPNAS---FDRGLSTIIGFGSGSNTST 1341
Cdd:pfam15967 73 -----------------FGTPASSTAATGPTGLTLGTPAATTAASTGFSlgfNKPAASatpFSLPASSTSGGGLSLGSVL 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1342 GFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFcsGPSISGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGATSLGA 1421
Cdd:pfam15967 136 TSTAAQQGATGFTLNLGGTPATTTAVSTGLSL--GSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGL 213
|
....*.
gi 1622971899 1422 CGFSYG 1427
Cdd:pfam15967 214 GGLDFS 219
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
927-1266 |
1.60e-23 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 108.55 E-value: 1.60e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 927 GTLSTSVSFGGPSSTSANCGGTLSTSIcfdgspSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFG 1006
Cdd:NF033849 217 GQKSISFGVSLPMMYAANLGQSAGTGY------GESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQS 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1007 GSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFS 1086
Cdd:NF033849 291 TSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHS 370
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1087 GVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFG-GAHSTSLCFGGAPSTSLCFGSASNTnlcf 1165
Cdd:NF033849 371 TSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGdSVQSVSQSYGSSSSTGTSSGHSDSS---- 446
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1166 ggppSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGF 1245
Cdd:NF033849 447 ----SHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGT 522
|
330 340
....*....|....*....|.
gi 1622971899 1246 SGGPGTSTGFGGGLGTSAGFS 1266
Cdd:NF033849 523 SGGRTSGAGGSMGLGPSISLG 543
|
|
| MAGE |
pfam01454 |
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide ... |
451-611 |
6.84e-23 |
|
MAGE homology domain; The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
Pssm-ID: 426270 Cd Length: 205 Bit Score: 98.11 E-value: 6.84e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 451 LVKYLLVKDQTKIPIKRSDMLRDVIQEYDE-YFPEIIERASYALEKMFRVNLKEID--------------------KQSS 509
Cdd:pfam01454 1 LVRYALACEYQRTPIRREDISKKVLGENRKrLFKKVFEEAQKILRDVFGMELVELPakeekkttvtsqqrraaaksSRSK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 510 LYILIST---RESSAGILGTTK---------DTPKLGLLMVILSVIFMNGNKASEAVIWEVLRKLGLH---PGVRHSLFG 574
Cdd:pfam01454 81 SYILVSTlppEYRVPAIIWPSKapsfvldqdEATYTGILTVILSLILLSGGSISEQELLRYLRRLGIDtdgTKEIPPLNG 160
|
170 180 190
....*....|....*....|....*....|....*....
gi 1622971899 575 EVRKLItDEFVKQKYLEYKRVPNSRP--PEYEFFWGLRS 611
Cdd:pfam01454 161 NTDDLL-KRLVKQGYLVRTKEGASDDgeEIIEYRVGPRA 198
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
1060-1427 |
2.85e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 101.24 E-value: 2.85e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1060 NTNASFGcaISTSAGFSGAVGTSAGfsgvpstnPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAH 1139
Cdd:NF033849 218 QKSISFG--VSLPMMYAANLGQSAG--------TGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSH 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1140 STSLcfggapSTSLCFGSASNTnlcfggppSTSACFSGATSpsfgDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGL 1219
Cdd:NF033849 288 TQST------SESESTGQSSSV--------GTSESQSHGTT----EGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1220 GTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGlgTSAGFSGGLGTGAgfggglVTSDGFGGGLGTNASFGSTL 1299
Cdd:NF033849 350 STSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSG--VSGGFSGGIAGGG------VTSEGLGASQGGSEGWGSGD 421
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1300 GtGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSI 1379
Cdd:NF033849 422 S-VQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSE 500
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1622971899 1380 SgFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGfgsGATSLGAcGFSYG 1427
Cdd:NF033849 501 S-VSQGDGRSTGRSESQGTSLGTSGGRTSGAG---GSMGLGP-SISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
860-1226 |
4.39e-21 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 100.46 E-value: 4.39e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 860 STSTSFG-SAPTtntVFSSAL--STSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGgspcTSTGFGGTLSTSVSFG 936
Cdd:NF033849 218 QKSISFGvSLPM---MYAANLgqSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQS----HTTGHGSTRGWSHTQS 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 937 GPSSTSANCGGTLSTSIcfdgspstgagfggALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFG 1016
Cdd:NF033849 291 TSESESTGQSSSVGTSE--------------SQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHS 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1017 SALNTSAGFGGAVSTSTDFGGTLSTSVCFggSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVpSTNPGFG 1096
Cdd:NF033849 357 ESSSESTGTSVGHSTSSSVSSSESSSRSS--SSGVSGGFSGGIAGGGVTSEGLGASQGGSEGWGSGDSVQSV-SQSYGSS 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1097 GAFNTSAGfggaLSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSlcfgsasntnlcfgGPPSTSACFS 1176
Cdd:NF033849 434 SSTGTSSG----HSDSSSHSTSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTS--------------QSETDSVGDS 495
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1177 GATSPSFGDGPSTSTGFSfgNGLSTSAGFGGGLNTSAGFGGGLGTSAGFS 1226
Cdd:NF033849 496 TGTSESVSQGDGRSTGRS--ESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
768-1086 |
7.78e-19 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 93.15 E-value: 7.78e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 768 TASISFGGTLSTSSSFSSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGG 847
Cdd:NF033849 218 QKSISFGVSLPMMYAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESEST 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 848 TLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 927
Cdd:NF033849 298 GQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSS 377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 928 TLSTSVSFGgpSSTSANCGGTLStsicfdGSPSTGAGFGGALNTSASFGSA-----LNTSAGFGGAMSTSADFGSTLSTS 1002
Cdd:NF033849 378 SESSSRSSS--SGVSGGFSGGIA------GGGVTSEGLGASQGGSEGWGSGdsvqsVSQSYGSSSSTGTSSGHSDSSSHS 449
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1003 VCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTS 1082
Cdd:NF033849 450 TSSGQADSVSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSG 529
|
....
gi 1622971899 1083 AGFS 1086
Cdd:NF033849 530 AGGS 533
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
709-1085 |
1.49e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 85.83 E-value: 1.49e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 709 ARAQENadasTSVNFSRGAGTRA---GFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSS 785
Cdd:NF033849 202 EAAAEE----TSNWASRQQGQKSisfGVSLPMMYAANLGQSAGTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTT 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 786 AASISFGGAPSTSTSFSSEASISfggtpctsASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSF 865
Cdd:NF033849 278 GHGSTRGWSHTQSTSESESTGQS--------SSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQ 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 866 GSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSIcfGGSPCTSTGFGGTLSTSVSFGgpSSTSANc 945
Cdd:NF033849 350 STSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGI--AGGGVTSEGLGASQGGSEGWG--SGDSVQ- 424
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 946 ggtlSTSICFDGSPSTGAGFGGALNTSASFGSAlnTSAGFGGAMSTSADFGSTLSTSVcfggspGTSVSFGSALNTSAGF 1025
Cdd:NF033849 425 ----SVSQSYGSSSSTGTSSGHSDSSSHSTSSG--QADSVSQGTSWSEGTGTSQGQSV------GTSESWSTSQSETDSV 492
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1026 GGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFgcaistSAGFSGAVGTSAGF 1085
Cdd:NF033849 493 GDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGG------SMGLGPSISLGKSY 546
|
|
| ser_rich_anae_1 |
NF033849 |
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ... |
713-1086 |
2.48e-16 |
|
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.
Pssm-ID: 468206 [Multi-domain] Cd Length: 1122 Bit Score: 85.06 E-value: 2.48e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 713 ENADASTSVNFSRGAGTRAGFSDGASISFngapSSSGGPGITFGGAPSSSASFSNTASISfggtlstsSSFSSAASISFG 792
Cdd:NF033849 245 ESVGHSTSQGQSHSVGTSESHSVGTSQSQ----SHTTGHGSTRGWSHTQSTSESESTGQS--------SSVGTSESQSHG 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 793 GAPSTSTSFSSEASISFGGTPCTSASFsggvsssfsgplNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTN 872
Cdd:NF033849 313 TTEGTSTTDSSSHSQSSSYNVSSGTGV------------SSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSES 380
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 873 TVFSSALSTSTGFGGTLstsvcfggspsssgsfggtlstsicfGGSPCTSTGFGGTLSTSVSFGgpSStsancggtlsts 952
Cdd:NF033849 381 SSRSSSSGVSGGFSGGI--------------------------AGGGVTSEGLGASQGGSEGWG--SG------------ 420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 953 icfDGSPSTGAGFGGALNTSASFGSALNTSagfggaMSTSADFGSTLSTSVcfGGSPGTSVSFGSALNTSAGFGGAVSTS 1032
Cdd:NF033849 421 ---DSVQSVSQSYGSSSSTGTSSGHSDSSS------HSTSSGQADSVSQGT--SWSEGTGTSQGQSVGTSESWSTSQSET 489
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 1622971899 1033 TDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFS 1086
Cdd:NF033849 490 DSVGDSTGTSESVSQGDGRSTGRSESQGTSLGTSGGRTSGAGGSMGLGPSISLG 543
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
704-1421 |
1.90e-15 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 82.51 E-value: 1.90e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 704 SFEIEARAQENADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSF 783
Cdd:COG3210 795 SIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATA 874
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 784 SSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTST 863
Cdd:COG3210 875 ASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGL 954
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 864 SFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSA 943
Cdd:COG3210 955 SAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGT 1034
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 944 NCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSA 1023
Cdd:COG3210 1035 GTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTG 1114
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1024 GFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSA 1103
Cdd:COG3210 1115 GVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTA 1194
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1104 GFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSF 1183
Cdd:COG3210 1195 GTDLKGGDSTGGSTTTIGTTNVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVA 1274
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1184 GDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSA 1263
Cdd:COG3210 1275 GNAGATATGSTVDIGSTSATSAGGSLDTTGNTAGANGATVGTGIGGTTATGTAVAAVNSGGVNAGGGTINTTAANTGLNG 1354
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1264 GFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGF 1343
Cdd:COG3210 1355 GNGATDSAAGAGSGGAAGSLAATAGAGTVLTGAGNNTGAEGTNAGRDGGVTTSGTGVGNNGGVSGTTVAGTTGSSATTGT 1434
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622971899 1344 IGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGATSLGA 1421
Cdd:COG3210 1435 GGTGNTTGTSVAGAGGGNADASAINTGNASSLGAGGSTAGNAVGGAVIGGTTTGGNGAGVAGATASNGGTSTGAGGTA 1512
|
|
| FhaB |
COG3210 |
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ... |
714-1341 |
8.27e-12 |
|
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442443 [Multi-domain] Cd Length: 1698 Bit Score: 70.57 E-value: 8.27e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 714 NADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSSAASISFGG 793
Cdd:COG3210 119 TAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVVAANPTGVTGVG 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 794 APSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNT 873
Cdd:COG3210 199 GALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVT 278
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 874 VFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSI 953
Cdd:COG3210 279 GTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTG 358
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 954 CFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTST 1033
Cdd:COG3210 359 AGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGN 438
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1034 DFGGTLSTSVCFGGSPSTSAGFSGALNTNASfgcaisTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTT 1113
Cdd:COG3210 439 GTVTGGTIGGLTGSGTTNGAGLSGNTDVSGT------GTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIAT 512
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1114 DFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGF 1193
Cdd:COG3210 513 GLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGT 592
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1194 SFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGA 1273
Cdd:COG3210 593 GTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGG 672
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622971899 1274 GFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTST 1341
Cdd:COG3210 673 GTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGNLGTGAT 740
|
|
| AidA |
COG3468 |
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ... |
867-1307 |
5.04e-11 |
|
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442691 [Multi-domain] Cd Length: 846 Bit Score: 67.66 E-value: 5.04e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 867 SAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCG 946
Cdd:COG3468 1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 947 GTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGspGTSVSFGSALNTSAGFG 1026
Cdd:COG3468 81 SGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGG--GGGGGTGVGGTGAAAAG 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1027 GAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGcaISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFG 1106
Cdd:COG3468 159 GGTGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGG--GGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGG 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1107 GALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGApstslcFGSASNTNLCFGGPPSTSACFSGATSPSFGDG 1186
Cdd:COG3468 237 GVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGG------GANGGGSGGGGGASGTGGGGTASTGGGGGGGG 310
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1187 PSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFS 1266
Cdd:COG3468 311 GNGGGGGGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDG 390
|
410 420 430 440
....*....|....*....|....*....|....*....|.
gi 1622971899 1267 GGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSG 1307
Cdd:COG3468 391 VGTGLTTGGTGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTG 431
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
916-1398 |
7.18e-11 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 67.11 E-value: 7.18e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 916 GGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADF 995
Cdd:COG4625 19 GGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGG 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 996 GSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGF 1075
Cdd:COG4625 99 GGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGG 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1076 SGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTslcfGGAPSTSLCF 1155
Cdd:COG4625 179 GGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG----GGGAGGGGGG 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1156 GSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGF 1235
Cdd:COG4625 255 GGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGA 334
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1236 DGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGF 1315
Cdd:COG4625 335 GGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAG 414
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1316 GSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGGG 1395
Cdd:COG4625 415 GGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGG 494
|
...
gi 1622971899 1396 PNT 1398
Cdd:COG4625 495 NYT 497
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
832-1308 |
1.12e-10 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 66.34 E-value: 1.12e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 832 NTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLST 911
Cdd:COG4625 18 GGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGG 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 912 SICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMST 991
Cdd:COG4625 98 GGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGG 177
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 992 SADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAIST 1071
Cdd:COG4625 178 GGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGG 257
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1072 SAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPST 1151
Cdd:COG4625 258 NGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 337
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1152 SLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLST 1231
Cdd:COG4625 338 GGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGG 417
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622971899 1232 SSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGG 1308
Cdd:COG4625 418 GAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGG 494
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
763-1264 |
3.28e-10 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 64.80 E-value: 3.28e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 763 ASFSNTASISFGGTLSTSSSFSSAASISFGGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAAS 842
Cdd:COG4625 1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 843 SGFGGTLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTS 922
Cdd:COG4625 81 GGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 923 TGFGGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTS 1002
Cdd:COG4625 161 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1003 VCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTS 1082
Cdd:COG4625 241 GGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1083 AGFSGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTN 1162
Cdd:COG4625 321 GGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGG 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1163 LCFGGPpSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTS 1242
Cdd:COG4625 401 GGGGAG-GTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLT 479
|
490 500
....*....|....*....|..
gi 1622971899 1243 AGFSGGPGTSTGFGGGLGTSAG 1264
Cdd:COG4625 480 GNNTYTGTTTVNGGGNYTQSAG 501
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
792-1266 |
2.65e-09 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 62.10 E-value: 2.65e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 792 GGAPSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTT 871
Cdd:COG4625 10 GGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGG 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 872 NTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLST 951
Cdd:COG4625 90 TGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGG 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 952 SICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVcfGGSPGTSVSFGSALNTSAGFGGAVST 1031
Cdd:COG4625 170 GGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGG--GGGGGGGGGGGGGGGGGGGGGGGGGG 247
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1032 STDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALST 1111
Cdd:COG4625 248 AGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 327
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1112 TTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTST 1191
Cdd:COG4625 328 GGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGG 407
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622971899 1192 GFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFS 1266
Cdd:COG4625 408 TGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNN 482
|
|
| PTZ00395 |
PTZ00395 |
Sec24-related protein; Provisional |
1005-1227 |
6.61e-08 |
|
Sec24-related protein; Provisional
Pssm-ID: 185594 [Multi-domain] Cd Length: 1560 Bit Score: 57.78 E-value: 6.61e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1005 FGGSPGTSVSFGSALNTSAGFGGAVStstdfGGTLSTSvcfggSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAG 1084
Cdd:PTZ00395 339 YGGFHDGSPNAASAGAPFNGLGNQAD-----GGHINQV-----HPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAG 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1085 FSGVPSTNPGfggafNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLC 1164
Cdd:PTZ00395 409 FSNAGYSNPG-----NSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSA 483
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622971899 1165 FggppSTSACFSGATSPSfGDGP----STSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSG 1227
Cdd:PTZ00395 484 Y----HAAYQHRAANQPA-ANLPtanqPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNG 545
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
846-1427 |
2.08e-07 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 55.55 E-value: 2.08e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 846 GGTLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGF 925
Cdd:COG5295 2 ASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAAS 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 926 GGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCF 1005
Cdd:COG5295 82 SVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATG 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1006 GGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGF 1085
Cdd:COG5295 162 SSTANAATAAAGATSTSASGSSSGASGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASA 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1086 SGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFG---AAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTN 1162
Cdd:COG5295 242 GAASGNATTASASSVSGSAVAAGTASTATTASTTAASGAAGtatAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAA 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1163 LCFGGPPSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTS 1242
Cdd:COG5295 322 ALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGS 401
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1243 AGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNAS 1322
Cdd:COG5295 402 STGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAA 481
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1323 FDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGGGPNTGAGF 1402
Cdd:COG5295 482 TSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNN 561
|
570 580
....*....|....*....|....*
gi 1622971899 1403 GGGPSTSAGFGSGATSLGACGFSYG 1427
Cdd:COG5295 562 TATGANSVALGAGSVASGANSVSVG 586
|
|
| COG4625 |
COG4625 |
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ... |
714-1252 |
8.89e-07 |
|
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];
Pssm-ID: 443664 [Multi-domain] Cd Length: 900 Bit Score: 53.63 E-value: 8.89e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 714 NADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSSAASISFGG 793
Cdd:COG4625 6 GGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGG 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 794 APSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNT 873
Cdd:COG4625 86 GGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGG 165
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 874 VFSSALSTSTGFGGTLSTSVCFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSI 953
Cdd:COG4625 166 GGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 245
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 954 CFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTST 1033
Cdd:COG4625 246 GGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 325
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1034 DFGGTLSTSVCFGGSPSTSAGFSGALNTNASfgcAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTT 1113
Cdd:COG4625 326 GGGGGGGGAGGGGGSGGAGAGGGGAGGGGAG---GGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGG 402
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1114 DFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGF 1193
Cdd:COG4625 403 GGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNN 482
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*....
gi 1622971899 1194 SFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTS 1252
Cdd:COG4625 483 TYTGTTTVNGGGNYTQSAGSTLAVEVDAANSDRLVVTGTATLNGGTVVVLAGGYAPGTT 541
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
978-1192 |
1.70e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 52.45 E-value: 1.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 978 ALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSG 1057
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1058 ALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTnpgfggafnTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGG 1137
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTT---------TSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTT 151
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1622971899 1138 AHSTSLCF-GGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTG 1192
Cdd:COG3469 152 TVSGTETAtGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGP 207
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1066-1263 |
1.71e-06 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 52.36 E-value: 1.71e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1066 GCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFntsaGFGGALSTTTDFGGTPNNSIGFGAAPstsvsFGGAHSTSLCF 1145
Cdd:pfam15967 3 GFSFGGGPGSTATAGGGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTF 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1146 GGAPSTSlcfGSASNTNLCFGGPPSTSACFSG-----------ATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGLNTSAG 1214
Cdd:pfam15967 74 GTPASST---AATGPTGLTLGTPAATTAASTGfslgfnkpaasATPFSLPASSTSGGGLSLGSVLTSTAAQQGATGFTLN 150
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1622971899 1215 FGGGLGTSAGFSGDL---STSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSA 1263
Cdd:pfam15967 151 LGGTPATTTAVSTGLslgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTA 202
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
928-1122 |
5.63e-06 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 50.91 E-value: 5.63e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 928 TLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGG 1007
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1008 SPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSG 1087
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETAT 160
|
170 180 190
....*....|....*....|....*....|....*...
gi 1622971899 1088 VPSTNP---GFGGAFNTSAGFGGALSTTTDFGGTPNNS 1122
Cdd:COG3469 161 GGTTTTsttTTTTSASTTPSATTTATATTASGATTPSA 198
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1092-1326 |
8.12e-06 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 50.44 E-value: 8.12e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1092 NPGFGGAFNTSAGFGGALSTTTDFGGTPNNSIGFGAAPSTSVSFGGAHSTSLCFGGAPstslcFGSASNTNLCFGGPPST 1171
Cdd:pfam15967 5 SFGGGPGSTATAGGGFSFGAAAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASS 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1172 SAcfSGATSPSFGDGPSTSTgfsfgnglSTSAGFGGGLNTSAgfggglGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGT 1251
Cdd:pfam15967 80 TA--ATGPTGLTLGTPAATT--------AASTGFSLGFNKPA------ASATPFSLPASSTSGGGLSLGSVLTSTAAQQG 143
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1622971899 1252 STGFGGGLGTSAGFSGGLGTGAGFGGGLvtsDGFGGGLGTNASfGSTLGTGAGFSGGLSTSDGFGSRPNASFDRG 1326
Cdd:pfam15967 144 ATGFTLNLGGTPATTTAVSTGLSLGSTL---TSLGGSLFQNTN-STGLGQTTLGLTLLATSTAPVSAPAASEGLG 214
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1053-1266 |
1.96e-05 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 48.74 E-value: 1.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1053 AGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTS---AGFGGALSTTTDFGGTPNNSIGFGAAP 1129
Cdd:COG5651 155 AAASAAAVALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTGlnqVGIGGLNSGSGPIGLNSGPGNTGFAGT 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1130 STSVSFGGAHSTSLCFGGAPSTSLCFGSASNTNLCFGGPPSTSACFSGATSPSFGDGPSTSTGFSFGNGLSTSAGFGGGL 1209
Cdd:COG5651 235 GAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGG 314
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 1622971899 1210 NTSAGFGGGLGTSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFS 1266
Cdd:COG5651 315 AAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSA 371
|
|
| Chi1 |
COG3469 |
Chitinase [Carbohydrate transport and metabolism]; |
908-1121 |
1.12e-04 |
|
Chitinase [Carbohydrate transport and metabolism];
Pssm-ID: 442692 [Multi-domain] Cd Length: 534 Bit Score: 46.67 E-value: 1.12e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 908 TLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGG 987
Cdd:COG3469 1 SSSVSTAASPTAGGASATAVTLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSGTGTTAASSTAATSSTTSTTA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 988 AMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVcfgGSPSTSAGFSGALNTNASFGC 1067
Cdd:COG3469 81 TATAAAAAATSTSATLVATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTS---GASATSSAGSTTTTTTVSGTE 157
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1622971899 1068 AISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAGFGGALSTTTDFGGTPNN 1121
Cdd:COG3469 158 TATGGTTTTSTTTTTTSASTTPSATTTATATTASGATTPSATTTATTTGPPTPG 211
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
723-927 |
1.19e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 46.58 E-value: 1.19e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 723 FSRGAGTRAGFSDGASISFNGAPSS----SGGPGI-TFGGAPSSSASfSNTASISFGGTLSTSSSFSSAASisfgGAPST 797
Cdd:pfam15967 4 FSFGGGPGSTATAGGGFSFGAAAASnpgsTGGFSFgTLGAAPAATAT-TTTATLGLGGGLFGQKPATGFTF----GTPAS 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 798 STSFSSEASISfGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSttagFSSVLSTSTS----------FGS 867
Cdd:pfam15967 79 STAATGPTGLT-LGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLS----LGSVLTSTAAqqgatgftlnLGG 153
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622971899 868 APTTNTVFSSAL---STSTGFGGTLSTSVcFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGG 927
Cdd:pfam15967 154 TPATTTAVSTGLslgSTLTSLGGSLFQNT-NSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
731-994 |
2.27e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 45.81 E-value: 2.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 731 AGFSDGASisfnGAPSSSGGPGITFGGAPSSSAsfSNTASISFGgtlstsssfssaasiSFGGAPSTSTSfSSEASISFG 810
Cdd:pfam15967 2 SGFSFGGG----PGSTATAGGGFSFGAAAASNP--GSTGGFSFG---------------TLGAAPAATAT-TTTATLGLG 59
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 811 GTPctsasfsGGVSSSFSGPLNTSAtfSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLS 890
Cdd:pfam15967 60 GGL-------FGQKPATGFTFGTPA--SSTAATGPTGLTLGTPAATTAASTGFSLGFNKPAASATPFSLPASSTSGGGLS 130
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 891 TSVCFGGSPSSSGSFGGTLStsicFGGSPCTSTgfggTLSTSVSFGgpsSTSANCGGTLSTSICFDGSPSTGAGfGGALN 970
Cdd:pfam15967 131 LGSVLTSTAAQQGATGFTLN----LGGTPATTT----AVSTGLSLG---STLTSLGGSLFQNTNSTGLGQTTLG-LTLLA 198
|
250 260
....*....|....*....|....*
gi 1622971899 971 TSASFGSALNTSAGFGGA-MSTSAD 994
Cdd:pfam15967 199 TSTAPVSAPAASEGLGGLdFSTSSE 223
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1196-1414 |
5.29e-04 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 44.11 E-value: 5.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1196 GNGLSTSAGFGGGLNTSAGFGgglgtSAGFSGDLSTSSGFDGGLGTSAGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGF 1275
Cdd:COG5651 178 GGLLGAQNAGSGNTSSNPGFA-----NLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAAAGAAAAAAAAAAAAG 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1276 GGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHS 1355
Cdd:COG5651 253 AGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGA 332
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1622971899 1356 GPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGfgggpnTGAGFGGGPSTSAGFGS 1414
Cdd:COG5651 333 AAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGG------GSAGAAAGAASGGGAAA 385
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
955-1147 |
9.26e-04 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 43.50 E-value: 9.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 955 FDGSPSTGAGFGGALNTSASFGSALNTSAG--FGGAMSTSADFGSTLSTSVCFGGSPgtsvsFGSALNTSAGFGGAVSTS 1032
Cdd:pfam15967 6 FGGGPGSTATAGGGFSFGAAAASNPGSTGGfsFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1033 TDFGGTLSTSVCFGGSPSTSAGFSGALNTNA------------------SFGCAISTSAGFSGAVGTSAGFSGVPSTNPG 1094
Cdd:pfam15967 81 AATGPTGLTLGTPAATTAASTGFSLGFNKPAasatpfslpasstsggglSLGSVLTSTAAQQGATGFTLNLGGTPATTTA 160
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*.
gi 1622971899 1095 FGGAFN---TSAGFGGALSTTTDFGGTPNNSIGfGAAPSTSVSFGGAHSTSLCFGG 1147
Cdd:pfam15967 161 VSTGLSlgsTLTSLGGSLFQNTNSTGLGQTTLG-LTLLATSTAPVSAPAASEGLGG 215
|
|
| Hia |
COG5295 |
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ... |
714-1104 |
1.29e-03 |
|
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];
Pssm-ID: 444098 [Multi-domain] Cd Length: 785 Bit Score: 43.22 E-value: 1.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 714 NADASTSVNFSRGAGTRAGFSDGASISFNGAPSSSGGPGITFGGAPSSSASFSNTASISFGGTLSTSSSFSSAASISFGG 793
Cdd:COG5295 200 AGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATTASTTAASG 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 794 APSTSTSFSSEASISFGGTPCTSASFSGGVSSSFSGPLNTSATFSGAASSGFGGTLSTTAGFSSVLSTSTSFGSAPTTNT 873
Cdd:COG5295 280 AAGTATAAAGGDAAAAGSASSTGAANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGASAAAATNDGTANGAGTSAAAD 359
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 874 VFSSALSTSTGFGGTLSTSV------CFGGSPSSSGSFGGTLSTSICFGGSPCTSTGFGGTLSTSVSFGGPSSTSANCGG 947
Cdd:COG5295 360 ATSGGGAGGGGAAATSSSGGsataagNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTSAVGASNGA 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 948 TLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSVCFGGSPGTSVSFGSALNTSAGFGG 1027
Cdd:COG5295 440 SGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAGSAAAGGAA 519
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622971899 1028 AVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAG 1104
Cdd:COG5295 520 NAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAENVAAG 596
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
1165-1388 |
2.00e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 42.19 E-value: 2.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1165 FGGPPSTSACFSGATSPSFGDGPSTSTGFSFGN-GLSTSAGFG-GGLNTSAGfggglgtSAGFSGDLSTSSGFDGGLGTS 1242
Cdd:COG5651 167 FTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANlGLTGLNQVGiGGLNSGSG-------PIGLNSGPGNTGFAGTGAAAG 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1243 AGFSGGPGTSTGFGGGLGTSAGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFGSRPNAS 1322
Cdd:COG5651 240 AAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAGAA 319
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1622971899 1323 FDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFcSGPSISGFSGGPST 1388
Cdd:COG5651 320 GATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGS-AGAAAGAASGGGAA 384
|
|
| dermokine |
cd21118 |
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ... |
1217-1420 |
3.79e-03 |
|
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.
Pssm-ID: 411053 [Multi-domain] Cd Length: 495 Bit Score: 41.52 E-value: 3.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1217 GGLGTSAGFSGDLSTSSGFDGGLGTSaGFSGGPGTSTGFGGGLGTsaGFSGGLGTGAGFGGGLVTSDGFGGGLGTNASFG 1296
Cdd:cd21118 125 GGHGAYGSQGGPGVQGHGIPGGTGGP-WASGGNYGTNSLGGSVGQ--GGNGGPLNYGTNSQGAVAQPGYGTVRGNNQNSG 201
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1297 STLGTGAGFSGGLSTSDGfGSRPNASFDRGLSTIIGFGSGSNTSTGFIGEPSTSTGFHSGPSSivGFSGGPSTGVGFCSG 1376
Cdd:cd21118 202 CTNPPPSGSHESFSNSGG-SSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSG--GSNGGSSGNSGSGSG 278
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1622971899 1377 PSISGFSGGPSTGAGFGGGPNTGAGFG---GGPSTSAGFGSGATSLG 1420
Cdd:cd21118 279 GSSSGGSNGWGGSSSSGGSGGSGGGNKpecNNPGNDVRMAGGGGSQG 325
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1191-1427 |
4.05e-03 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 41.58 E-value: 4.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1191 TGFSFGNGLSTSAGFGGGLNTSAGFGGGLGTSAGFsgdlstssGFDGGLGTSAGFSGGPGTSTGFGGGL---GTSAGFSg 1267
Cdd:pfam15967 2 SGFSFGGGPGSTATAGGGFSFGAAAASNPGSTGGF--------SFGTLGAAPAATATTTTATLGLGGGLfgqKPATGFT- 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1268 glgtgagfggglvtsdgFGGGLGTNASFGSTLGTGAGFSGGLSTSDGFG---SRPNAS---FDRGLSTIIGFGSGSNTST 1341
Cdd:pfam15967 73 -----------------FGTPASSTAATGPTGLTLGTPAATTAASTGFSlgfNKPAASatpFSLPASSTSGGGLSLGSVL 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1342 GFIGEPSTSTGFHSGPSSIVGFSGGPSTGVGFcsGPSISGFSGGPSTGAGFGGGPNTGAGFGGGPSTSAGFGSGATSLGA 1421
Cdd:pfam15967 136 TSTAAQQGATGFTLNLGGTPATTTAVSTGLSL--GSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGL 213
|
....*.
gi 1622971899 1422 CGFSYG 1427
Cdd:pfam15967 214 GGLDFS 219
|
|
| PPE |
COG5651 |
PPE-repeat protein [Function unknown]; |
850-1082 |
4.14e-03 |
|
PPE-repeat protein [Function unknown];
Pssm-ID: 444372 [Multi-domain] Cd Length: 385 Bit Score: 41.42 E-value: 4.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 850 STTAGFSSVLSTSTSFGSAPTTNTVFSSALSTSTGFGGTLSTSvcfggsPSSSGSFGGTLSTSICFGGSPCTSTGFGGTl 929
Cdd:COG5651 162 VALTPFTQPPPTITNPGGLLGAQNAGSGNTSSNPGFANLGLTG------LNQVGIGGLNSGSGPIGLNSGPGNTGFAGT- 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 930 STSVSFGGPSSTSANCGGTLSTSICFDGSPSTGAGFGGALNTSASFGSALNTSAGFGGAMSTSADFGSTLSTSvcFGGSP 1009
Cdd:COG5651 235 GAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATG--LGLGA 312
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622971899 1010 GTSVSFGSALNTSAGFGGAVSTSTDFGGTLSTSVCFGGSPSTSAGFSGALNTNASFGCAISTSAGFSGAVGTS 1082
Cdd:COG5651 313 GGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAAA 385
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
1182-1394 |
5.10e-03 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 41.19 E-value: 5.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1182 SFGDGPSTST----GFSFGNGLSTSAGFGGGLntsaGFGGGLGTSAGFSGDLSTSSGFDGGL---GTSAGFS-GGPGTST 1253
Cdd:pfam15967 5 SFGGGPGSTAtaggGFSFGAAAASNPGSTGGF----SFGTLGAAPAATATTTTATLGLGGGLfgqKPATGFTfGTPASST 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1254 GFGGGLGTSAGFSGGLGTGAGFGgglvtSDGFGGGLGTNASFGSTLGTGAGfsGGLSTSDGFGSRPNASFDRGLSTIIGf 1333
Cdd:pfam15967 81 AATGPTGLTLGTPAATTAASTGF-----SLGFNKPAASATPFSLPASSTSG--GGLSLGSVLTSTAAQQGATGFTLNLG- 152
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1622971899 1334 GSGSNTSTGFIGEP--STSTGFHSGPSSIVGFSGGPSTGVGFCSGPSISGFSGGPSTGAGFGG 1394
Cdd:pfam15967 153 GTPATTTAVSTGLSlgSTLTSLGGSLFQNTNSTGLGQTTLGLTLLATSTAPVSAPAASEGLGG 215
|
|
| Nucleoporin_FG2 |
pfam15967 |
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ... |
985-1186 |
9.58e-03 |
|
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.
Pssm-ID: 435043 [Multi-domain] Cd Length: 586 Bit Score: 40.42 E-value: 9.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 985 FGGAMSTSADFGSTLSTSVCFGGSPGTS--VSFGSALNTSAGFGGAVSTSTDFGGTLstsvcFGGSPSTSAGFSGALNTN 1062
Cdd:pfam15967 6 FGGGPGSTATAGGGFSFGAAAASNPGSTggFSFGTLGAAPAATATTTTATLGLGGGL-----FGQKPATGFTFGTPASST 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622971899 1063 ASFGCAIST----SAGFSGAVGTSAGFSGVPSTNPGFGGAFNTSAG----FGGALSTTTDFGGTPNNSIGFGAAPSTSVs 1134
Cdd:pfam15967 81 AATGPTGLTlgtpAATTAASTGFSLGFNKPAASATPFSLPASSTSGgglsLGSVLTSTAAQQGATGFTLNLGGTPATTT- 159
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 1622971899 1135 fggAHSTSLCFGGAPST--SLCFGSASNTNL----CFGGPPSTSACFSGATSPSFGDG 1186
Cdd:pfam15967 160 ---AVSTGLSLGSTLTSlgGSLFQNTNSTGLgqttLGLTLLATSTAPVSAPAASEGLG 214
|
|
|