NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1914657870|ref|XP_036358708|]
View 

uncharacterized protein LOC118763360 isoform X24 [Octopus sinensis]

Protein Classification

chitin binding peritrophin-A domain-containing protein; glycoside hydrolase family 18 protein( domain architecture ID 10649518)

chitin binding peritrophin-A domain-containing protein similar to Blomia tropicalis major allergen Blo t 12 and Caenorhabditis elegans chondroitin proteoglycan 1| glycoside hydrolase family 18 protein similar to chitinase, which catalyzes the random endo-hydrolysis of the 1,4-beta-linkages of N-acetylglucosamine in chitin and chitodextrins

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
dermokine super family cl42387
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
421-520 1.02e-10

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


The actual alignment was detected with superfamily member cd21118:

Pssm-ID: 455732 [Multi-domain]  Cd Length: 495  Bit Score: 66.56  E-value: 1.02e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  421 NAKPENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATG 500
Cdd:cd21118    204 NPPPSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSG 283
                           90       100
                   ....*....|....*....|..
gi 1914657870  501 GSQGTGG--ATGGSQGAGGATG 520
Cdd:cd21118    284 GSNGWGGssSSGGSGGSGGGNK 305
COG4625 super family cl34793
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
419-802 2.08e-07

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


The actual alignment was detected with superfamily member COG4625:

Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 56.33  E-value: 2.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  419 GINAKPENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGA 498
Cdd:COG4625     35 GGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGG 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  499 TGGSQGTGGATGGSQGAGGATGGSQATGSSQGTGGATGGSEGGSQIPGGIAGGSQGTGGEGGASGSQGKGGVTGGSQGTG 578
Cdd:COG4625    115 GGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNG 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  579 GATGGTGGTQGSQIIGGGAGGSQATGGGTGGVESTGTGTGGSQGTGGEGGVSGSQGTGGVAGGSQGTGGGAGGGQVTGGG 658
Cdd:COG4625    195 GGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGG 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  659 ASGNQGTGGGTGGNQETGGVTGGIQGSREGTGGSQGTGEVAGGSQGTSGGTGGNQATGGGTSGNQGTEDGTGSSQGTGGV 738
Cdd:COG4625    275 SGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGG 354
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1914657870  739 AGGKQGTGSGTGSNQGTGGVANGNQAKGGGKDGKQGSGGETGGDQGAGGGTGSNQGTGGVANGN 802
Cdd:COG4625    355 AGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGG 418
ChtBD2 smart00494
Chitin-binding domain type 2;
1780-1825 1.95e-06

Chitin-binding domain type 2;


:

Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 46.28  E-value: 1.95e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870  1780 IPCQ--PNSYYPKFKSISQFYQCSHGLLFLMQCPDKTVWNEASIKCVY 1825
Cdd:smart00494    1 NECPgrGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
ChtBD2 smart00494
Chitin-binding domain type 2;
103-150 6.47e-06

Chitin-binding domain type 2;


:

Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 44.74  E-value: 6.47e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870   103 ECQPcVTGGFYSHPNDQGRYYQCVYGVLLPKYCQSGTIWYQHTRTCIF 150
Cdd:smart00494    2 ECPG-RGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
ChtBD2 smart00494
Chitin-binding domain type 2;
1495-1542 1.52e-05

Chitin-binding domain type 2;


:

Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 43.97  E-value: 1.52e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870  1495 DCEPcQQGGYYPIIGSLSEFFQCSHGQLVPMKCPARTIWNNKIIRCVY 1542
Cdd:smart00494    2 ECPG-RGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
ChtBD2 smart00494
Chitin-binding domain type 2;
1413-1458 2.18e-04

Chitin-binding domain type 2;


:

Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 40.50  E-value: 2.18e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870  1413 EACEKGS--YYPKAESIAEFYQCSHGILFLMQCPESTIWHGESLRCIY 1458
Cdd:smart00494    1 NECPGRGdgLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
ChtBD2 smart00494
Chitin-binding domain type 2;
258-297 2.63e-04

Chitin-binding domain type 2;


:

Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 40.50  E-value: 2.63e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 1914657870   258 DTYYSKPGSVRQFYQCVHGWLFVRSCPTGTVWAGLLKECV 297
Cdd:smart00494    8 DGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCD 47
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
194-237 3.28e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


:

Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 40.09  E-value: 3.28e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1914657870  194 CEN--DSYHSKTASLTHFYHCKNGWLYLMYCPSGTIWNSTLSACVY 237
Cdd:pfam01607    1 CAGkeDGYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDY 46
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
1187-1229 2.86e-03

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


:

Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 37.39  E-value: 2.86e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1914657870 1187 YYSKLGSNKQFYHCNYGVLYVLECPTQTVWNRRLGSCVYESNP 1229
Cdd:pfam01607    8 YYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDYPDNV 50
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
382-416 3.55e-03

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


:

Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 37.39  E-value: 3.55e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 1914657870  382 FYQCLSGWLFIMECPLNTVWDGESTKCIYDELASA 416
Cdd:pfam01607   18 YYVCSNGEAVEFTCPNGLVFDPTLGICDYPDNVVD 52
ChtBD2 smart00494
Chitin-binding domain type 2;
1598-1642 8.10e-03

Chitin-binding domain type 2;


:

Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 36.27  E-value: 8.10e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1914657870  1598 PCQKGS--YYPKYQSEAHFYQCSQGLLFLMNCPDNTIWHGKSIRCIY 1642
Cdd:smart00494    2 ECPGRGdgLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
 
Name Accession Description Interval E-value
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
421-520 1.02e-10

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 66.56  E-value: 1.02e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  421 NAKPENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATG 500
Cdd:cd21118    204 NPPPSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSG 283
                           90       100
                   ....*....|....*....|..
gi 1914657870  501 GSQGTGG--ATGGSQGAGGATG 520
Cdd:cd21118    284 GSNGWGGssSSGGSGGSGGGNK 305
Keratin_2_tail pfam16210
Keratin type II cytoskeletal 1 tail;
450-528 3.54e-10

Keratin type II cytoskeletal 1 tail;


Pssm-ID: 406591 [Multi-domain]  Cd Length: 135  Bit Score: 59.60  E-value: 3.54e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1914657870  450 GGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGATGGSQATGSS 528
Cdd:pfam16210   23 GSSRGGGGGGGGSYGSGGGSYGSGGGGGSGSGSYGSGGGSYGSGGGGGSGSGGGSSGGHRGGSGGGGGSSGGRSGGGSS 101
III PHA00370
attachment protein
397-511 1.30e-09

attachment protein


Pssm-ID: 164795 [Multi-domain]  Cd Length: 297  Bit Score: 61.47  E-value: 1.30e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  397 LNTVWDGESTK-------CIYDelasaLRGINAKPENQTGegGNGRETAIGGSQGTGGaTGGNQGTGGATGGSQGaGGAT 469
Cdd:PHA00370    35 FNNVWKGDEGGryanyegCEYE-----ATGVTVCQNDGTV--CNGSWKPTGSADKDGD-GGGTGEGGSDTGGDTG-GGNT 105
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1914657870  470 GGSQGvGGATGGSQGTGGVTGGSqgTGGATGGSQGTGGATGG 511
Cdd:PHA00370   106 GGGSG-GGDTGGSGGGGSDGGGS--EGGSTGKSLTKEGVGAG 144
TrbL COG3846
Type IV secretory pathway, TrbL components [Intracellular trafficking, secretion, and ...
428-529 8.50e-08

Type IV secretory pathway, TrbL components [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443056 [Multi-domain]  Cd Length: 443  Bit Score: 56.87  E-value: 8.50e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  428 TGEGGNGRETAIGGsqGTGGATGGNQGTGGAT---GGSQGAGGATG----GSQGVG-GATGGSQGTGGVTGGSQGTGGAT 499
Cdd:COG3846    277 TGAAAGGAAVAAGA--AAAAAAGGAAAAGGAAaarGGASAAGGAKAayslGSAGSGsGAAGVAAGMGGVGRAGGSAAASP 354
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1914657870  500 GGSQGTGGATG-------GSQGAGGATGGSQATGSSQ 529
Cdd:COG3846    355 AGKAAFAQAAGfadsyraGSRAAWAATGGAAARGAGL 391
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
419-802 2.08e-07

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 56.33  E-value: 2.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  419 GINAKPENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGA 498
Cdd:COG4625     35 GGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGG 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  499 TGGSQGTGGATGGSQGAGGATGGSQATGSSQGTGGATGGSEGGSQIPGGIAGGSQGTGGEGGASGSQGKGGVTGGSQGTG 578
Cdd:COG4625    115 GGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNG 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  579 GATGGTGGTQGSQIIGGGAGGSQATGGGTGGVESTGTGTGGSQGTGGEGGVSGSQGTGGVAGGSQGTGGGAGGGQVTGGG 658
Cdd:COG4625    195 GGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGG 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  659 ASGNQGTGGGTGGNQETGGVTGGIQGSREGTGGSQGTGEVAGGSQGTSGGTGGNQATGGGTSGNQGTEDGTGSSQGTGGV 738
Cdd:COG4625    275 SGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGG 354
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1914657870  739 AGGKQGTGSGTGSNQGTGGVANGNQAKGGGKDGKQGSGGETGGDQGAGGGTGSNQGTGGVANGN 802
Cdd:COG4625    355 AGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGG 418
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
425-529 9.36e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.24  E-value: 9.36e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  425 ENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGAT----GGSQGTG---GVTGGSQGTGG 497
Cdd:NF033849   349 QSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTseglGASQGGSegwGSGDSVQSVSQ 428
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1914657870  498 ATGGSQGTGGATGGSQGAGGATGGSQATGSSQ 529
Cdd:NF033849   429 SYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQ 460
ChtBD2 smart00494
Chitin-binding domain type 2;
1780-1825 1.95e-06

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 46.28  E-value: 1.95e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870  1780 IPCQ--PNSYYPKFKSISQFYQCSHGLLFLMQCPDKTVWNEASIKCVY 1825
Cdd:smart00494    1 NECPgrGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
442-529 2.23e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.09  E-value: 2.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  442 SQGTGGATGGNQGTGGATGGSQGAGGATGGSQGV--GGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGAT 519
Cdd:NF033849   272 SQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTseSQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQST 351
                           90
                   ....*....|
gi 1914657870  520 GGSQATGSSQ 529
Cdd:NF033849   352 SISHSESSSE 361
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
437-529 3.61e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.31  E-value: 3.61e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  437 TAIGGSQGT--GGATGGNQGTGGATGGSQGAGGATGGSQG--VGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGS 512
Cdd:NF033849   317 TSTTDSSSHsqSSSYNVSSGTGVSSSHSDGTSQSTSISHSesSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGG 396
                           90
                   ....*....|....*..
gi 1914657870  513 QGAGGATggSQATGSSQ 529
Cdd:NF033849   397 IAGGGVT--SEGLGASQ 411
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
428-531 5.56e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.54  E-value: 5.56e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  428 TGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGgaTGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQG--T 505
Cdd:NF033849   419 SGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADS--VSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGdsT 496
                           90       100
                   ....*....|....*....|....*.
gi 1914657870  506 GGATGGSQGAGGATGGSQATGSSQGT 531
Cdd:NF033849   497 GTSESVSQGDGRSTGRSESQGTSLGT 522
ChtBD2 smart00494
Chitin-binding domain type 2;
103-150 6.47e-06

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 44.74  E-value: 6.47e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870   103 ECQPcVTGGFYSHPNDQGRYYQCVYGVLLPKYCQSGTIWYQHTRTCIF 150
Cdd:smart00494    2 ECPG-RGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
ChtBD2 smart00494
Chitin-binding domain type 2;
1495-1542 1.52e-05

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 43.97  E-value: 1.52e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870  1495 DCEPcQQGGYYPIIGSLSEFFQCSHGQLVPMKCPARTIWNNKIIRCVY 1542
Cdd:smart00494    2 ECPG-RGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
1783-1825 3.42e-05

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 42.79  E-value: 3.42e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1914657870 1783 QPNSYYPKFKSISQFYQCSHGLLFLMQCPDKTVWNEASIKCVY 1825
Cdd:pfam01607    4 KEDGYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDY 46
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
437-529 9.17e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 47.69  E-value: 9.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  437 TAIGGSQGTGGATggNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAG 516
Cdd:NF033849   233 ANLGQSAGTGYGE--SVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQS 310
                           90
                   ....*....|...
gi 1914657870  517 GATGGSQATGSSQ 529
Cdd:NF033849   311 HGTTEGTSTTDSS 323
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
450-529 1.41e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.92  E-value: 1.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  450 GGNQGTGGatGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGATGGSQATGSSQ 529
Cdd:NF033849   232 AANLGQSA--GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQ 309
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
111-151 1.97e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 40.86  E-value: 1.97e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1914657870  111 GFYSHPNDQGRYYQCVYGVLLPKYCQSGTIWYQHTRTCIFD 151
Cdd:pfam01607    7 GYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDYP 47
ChtBD2 smart00494
Chitin-binding domain type 2;
1413-1458 2.18e-04

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 40.50  E-value: 2.18e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870  1413 EACEKGS--YYPKAESIAEFYQCSHGILFLMQCPESTIWHGESLRCIY 1458
Cdd:smart00494    1 NECPGRGdgLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
1503-1547 2.42e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 40.47  E-value: 2.42e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1914657870 1503 GYYPIIGSLSEFFQCSHGQLVPMKCPARTIWNNKIIRCVYDRSQV 1547
Cdd:pfam01607    7 GYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDYPDNVV 51
ChtBD2 smart00494
Chitin-binding domain type 2;
258-297 2.63e-04

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 40.50  E-value: 2.63e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 1914657870   258 DTYYSKPGSVRQFYQCVHGWLFVRSCPTGTVWAGLLKECV 297
Cdd:smart00494    8 DGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCD 47
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
194-237 3.28e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 40.09  E-value: 3.28e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1914657870  194 CEN--DSYHSKTASLTHFYHCKNGWLYLMYCPSGTIWNSTLSACVY 237
Cdd:pfam01607    1 CAGkeDGYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDY 46
ChtBD2 smart00494
Chitin-binding domain type 2;
193-237 3.36e-04

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 40.12  E-value: 3.36e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1914657870   193 PCEN--DSYHSKTASLTHFYHCKNGWLYLMYCPSGTIWNSTLSACVY 237
Cdd:smart00494    2 ECPGrgDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
1412-1458 8.10e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 38.93  E-value: 8.10e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1914657870 1412 CEACEKGsYYPKAESIAEFYQCSHGILFLMQCPESTIWHGESLRCIY 1458
Cdd:pfam01607    1 CAGKEDG-YYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDY 46
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
256-297 8.60e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 38.93  E-value: 8.60e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1914657870  256 LQDTYYSKPGSVRQFYQCVHGWLFVRSCPTGTVWAGLLKECV 297
Cdd:pfam01607    4 KEDGYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICD 45
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
1187-1229 2.86e-03

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 37.39  E-value: 2.86e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1914657870 1187 YYSKLGSNKQFYHCNYGVLYVLECPTQTVWNRRLGSCVYESNP 1229
Cdd:pfam01607    8 YYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDYPDNV 50
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
382-416 3.55e-03

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 37.39  E-value: 3.55e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 1914657870  382 FYQCLSGWLFIMECPLNTVWDGESTKCIYDELASA 416
Cdd:pfam01607   18 YYVCSNGEAVEFTCPNGLVFDPTLGICDYPDNVVD 52
ChtBD2 smart00494
Chitin-binding domain type 2;
1185-1225 4.99e-03

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 36.65  E-value: 4.99e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1914657870  1185 YTYYSKLGSNKQFYHCNYGVLYVLECPTQTVWNRRLGSCVY 1225
Cdd:smart00494    8 DGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
ChtBD2 smart00494
Chitin-binding domain type 2;
1598-1642 8.10e-03

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 36.27  E-value: 8.10e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1914657870  1598 PCQKGS--YYPKYQSEAHFYQCSQGLLFLMNCPDNTIWHGKSIRCIY 1642
Cdd:smart00494    2 ECPGRGdgLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
 
Name Accession Description Interval E-value
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
421-520 1.02e-10

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 66.56  E-value: 1.02e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  421 NAKPENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATG 500
Cdd:cd21118    204 NPPPSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSGNSGSGSGGSSSG 283
                           90       100
                   ....*....|....*....|..
gi 1914657870  501 GSQGTGG--ATGGSQGAGGATG 520
Cdd:cd21118    284 GSNGWGGssSSGGSGGSGGGNK 305
Keratin_2_tail pfam16210
Keratin type II cytoskeletal 1 tail;
450-528 3.54e-10

Keratin type II cytoskeletal 1 tail;


Pssm-ID: 406591 [Multi-domain]  Cd Length: 135  Bit Score: 59.60  E-value: 3.54e-10
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1914657870  450 GGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGATGGSQATGSS 528
Cdd:pfam16210   23 GSSRGGGGGGGGSYGSGGGSYGSGGGGGSGSGSYGSGGGSYGSGGGGGSGSGGGSSGGHRGGSGGGGGSSGGRSGGGSS 101
Keratin_2_tail pfam16210
Keratin type II cytoskeletal 1 tail;
429-528 4.37e-10

Keratin type II cytoskeletal 1 tail;


Pssm-ID: 406591 [Multi-domain]  Cd Length: 135  Bit Score: 59.60  E-value: 4.37e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  429 GEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGAtGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGA 508
Cdd:pfam16210   29 GGGGGGSYGSGGGSYGSGGGGGSGSGSYGSGGGSYGSGGG-GGSGSGGGSSGGHRGGSGGGGGSSGGRSGGGSSGGSFGS 107
                           90       100
                   ....*....|....*....|
gi 1914657870  509 TGGSqgaGGATGGSQATGSS 528
Cdd:pfam16210  108 SGGR---GSSSGGVKSSGGS 124
III PHA00370
attachment protein
397-511 1.30e-09

attachment protein


Pssm-ID: 164795 [Multi-domain]  Cd Length: 297  Bit Score: 61.47  E-value: 1.30e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  397 LNTVWDGESTK-------CIYDelasaLRGINAKPENQTGegGNGRETAIGGSQGTGGaTGGNQGTGGATGGSQGaGGAT 469
Cdd:PHA00370    35 FNNVWKGDEGGryanyegCEYE-----ATGVTVCQNDGTV--CNGSWKPTGSADKDGD-GGGTGEGGSDTGGDTG-GGNT 105
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1914657870  470 GGSQGvGGATGGSQGTGGVTGGSqgTGGATGGSQGTGGATGG 511
Cdd:PHA00370   106 GGGSG-GGDTGGSGGGGSDGGGS--EGGSTGKSLTKEGVGAG 144
Keratin_2_tail pfam16210
Keratin type II cytoskeletal 1 tail;
440-528 1.84e-09

Keratin type II cytoskeletal 1 tail;


Pssm-ID: 406591 [Multi-domain]  Cd Length: 135  Bit Score: 57.68  E-value: 1.84e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  440 GGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGatgGSQGTGGvTGGSQGTGGATGGSQGTGGATGGSQGAGGAT 519
Cdd:pfam16210   23 GSSRGGGGGGGGSYGSGGGSYGSGGGGGSGSGSYGSGG---GSYGSGG-GGGSGSGGGSSGGHRGGSGGGGGSSGGRSGG 98

                   ....*....
gi 1914657870  520 GGSQATGSS 528
Cdd:pfam16210   99 GSSGGSFGS 107
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
401-528 6.75e-09

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 60.40  E-value: 6.75e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  401 WDGESTKCIYDELASALRGINAKPENQTGE---GGNGRETAIGGSQGTGGATGG-NQGT-----------GGATGGSQ-- 463
Cdd:cd21118    121 WQGSGGHGAYGSQGGPGVQGHGIPGGTGGPwasGGNYGTNSLGGSVGQGGNGGPlNYGTnsqgavaqpgyGTVRGNNQns 200
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1914657870  464 ------GAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGATGGSQATGSS 528
Cdd:cd21118    201 gctnppPSGSHESFSNSGGSSSSGSSGSQGSHGSNGQGSSGSSGGQGNGGNNGSSSSNSGNSGGSNGGSSG 271
TrbL COG3846
Type IV secretory pathway, TrbL components [Intracellular trafficking, secretion, and ...
428-529 8.50e-08

Type IV secretory pathway, TrbL components [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443056 [Multi-domain]  Cd Length: 443  Bit Score: 56.87  E-value: 8.50e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  428 TGEGGNGRETAIGGsqGTGGATGGNQGTGGAT---GGSQGAGGATG----GSQGVG-GATGGSQGTGGVTGGSQGTGGAT 499
Cdd:COG3846    277 TGAAAGGAAVAAGA--AAAAAAGGAAAAGGAAaarGGASAAGGAKAayslGSAGSGsGAAGVAAGMGGVGRAGGSAAASP 354
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1914657870  500 GGSQGTGGATG-------GSQGAGGATGGSQATGSSQ 529
Cdd:COG3846    355 AGKAAFAQAAGfadsyraGSRAAWAATGGAAARGAGL 391
TrbL COG3846
Type IV secretory pathway, TrbL components [Intracellular trafficking, secretion, and ...
440-526 1.79e-07

Type IV secretory pathway, TrbL components [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443056 [Multi-domain]  Cd Length: 443  Bit Score: 55.71  E-value: 1.79e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  440 GGSQGTGGATGGnqgTGGATGGSQGAGGATGGsqGVGGATGGSQGTGGVTGGSQGTGGATG----GSQGTGgaTGGSQGA 515
Cdd:COG3846    265 GGPQLGAGAAAG---TGAAAGGAAVAAGAAAA--AAAGGAAAAGGAAAARGGASAAGGAKAayslGSAGSG--SGAAGVA 337
                           90
                   ....*....|.
gi 1914657870  516 GGATGGSQATG 526
Cdd:COG3846    338 AGMGGVGRAGG 348
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
419-802 2.08e-07

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 56.33  E-value: 2.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  419 GINAKPENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGA 498
Cdd:COG4625     35 GGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGG 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  499 TGGSQGTGGATGGSQGAGGATGGSQATGSSQGTGGATGGSEGGSQIPGGIAGGSQGTGGEGGASGSQGKGGVTGGSQGTG 578
Cdd:COG4625    115 GGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNG 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  579 GATGGTGGTQGSQIIGGGAGGSQATGGGTGGVESTGTGTGGSQGTGGEGGVSGSQGTGGVAGGSQGTGGGAGGGQVTGGG 658
Cdd:COG4625    195 GGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGG 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  659 ASGNQGTGGGTGGNQETGGVTGGIQGSREGTGGSQGTGEVAGGSQGTSGGTGGNQATGGGTSGNQGTEDGTGSSQGTGGV 738
Cdd:COG4625    275 SGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGG 354
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1914657870  739 AGGKQGTGSGTGSNQGTGGVANGNQAKGGGKDGKQGSGGETGGDQGAGGGTGSNQGTGGVANGN 802
Cdd:COG4625    355 AGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGG 418
PRK13875 PRK13875
conjugal transfer protein TrbL; Provisional
440-528 4.20e-07

conjugal transfer protein TrbL; Provisional


Pssm-ID: 237537  Cd Length: 440  Bit Score: 54.53  E-value: 4.20e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  440 GGSQ-GTGGATGgnqgTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGA-TGGSQGTGGATGGSQGAGG 517
Cdd:PRK13875   265 GAPQlGAGAAVG----TGLAAGGAAVAAAAGAGLAAGGGAAAAGGAAAAARGGAAAAGGAsSAYSAGAAGGSGAAGVAAG 340
                           90
                   ....*....|.
gi 1914657870  518 ATGGSQATGSS 528
Cdd:PRK13875   341 LGGVARAGASA 351
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
425-529 9.36e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 54.24  E-value: 9.36e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  425 ENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGAT----GGSQGTG---GVTGGSQGTGG 497
Cdd:NF033849   349 QSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTseglGASQGGSegwGSGDSVQSVSQ 428
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1914657870  498 ATGGSQGTGGATGGSQGAGGATGGSQATGSSQ 529
Cdd:NF033849   429 SYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQ 460
ChtBD2 smart00494
Chitin-binding domain type 2;
1780-1825 1.95e-06

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 46.28  E-value: 1.95e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870  1780 IPCQ--PNSYYPKFKSISQFYQCSHGLLFLMQCPDKTVWNEASIKCVY 1825
Cdd:smart00494    1 NECPgrGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
442-529 2.23e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.09  E-value: 2.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  442 SQGTGGATGGNQGTGGATGGSQGAGGATGGSQGV--GGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGAT 519
Cdd:NF033849   272 SQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTseSQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQST 351
                           90
                   ....*....|
gi 1914657870  520 GGSQATGSSQ 529
Cdd:NF033849   352 SISHSESSSE 361
III PHA00370
attachment protein
461-528 3.61e-06

attachment protein


Pssm-ID: 164795 [Multi-domain]  Cd Length: 297  Bit Score: 51.07  E-value: 3.61e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1914657870  461 GSQGAGGATGGSQGVGGATGGSQGTGGVTGGSqgTGGATGGSQGtggatGGSQGaGGATGGSQATGSS 528
Cdd:PHA00370    78 GSADKDGDGGGTGEGGSDTGGDTGGGNTGGGS--GGGDTGGSGG-----GGSDG-GGSEGGSTGKSLT 137
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
437-529 3.61e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.31  E-value: 3.61e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  437 TAIGGSQGT--GGATGGNQGTGGATGGSQGAGGATGGSQG--VGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGS 512
Cdd:NF033849   317 TSTTDSSSHsqSSSYNVSSGTGVSSSHSDGTSQSTSISHSesSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGG 396
                           90
                   ....*....|....*..
gi 1914657870  513 QGAGGATggSQATGSSQ 529
Cdd:NF033849   397 IAGGGVT--SEGLGASQ 411
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
428-531 5.56e-06

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 51.54  E-value: 5.56e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  428 TGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGgaTGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQG--T 505
Cdd:NF033849   419 SGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADS--VSQGTSWSEGTGTSQGQSVGTSESWSTSQSETDSVGdsT 496
                           90       100
                   ....*....|....*....|....*.
gi 1914657870  506 GGATGGSQGAGGATGGSQATGSSQGT 531
Cdd:NF033849   497 GTSESVSQGDGRSTGRSESQGTSLGT 522
ChtBD2 smart00494
Chitin-binding domain type 2;
103-150 6.47e-06

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 44.74  E-value: 6.47e-06
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870   103 ECQPcVTGGFYSHPNDQGRYYQCVYGVLLPKYCQSGTIWYQHTRTCIF 150
Cdd:smart00494    2 ECPG-RGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
TrbL COG3846
Type IV secretory pathway, TrbL components [Intracellular trafficking, secretion, and ...
451-527 1.38e-05

Type IV secretory pathway, TrbL components [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443056 [Multi-domain]  Cd Length: 443  Bit Score: 49.94  E-value: 1.38e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  451 GNQGTGGA----TGGSQ-GAGGATGGSQGVGGATGGsqGTGGVTGGsqgtGGATGGSQGTGGATGGSQGAGGATG----G 521
Cdd:COG3846    252 GIFGPGIAaglvSGGPQlGAGAAAGTGAAAGGAAVA--AGAAAAAA----AGGAAAAGGAAAARGGASAAGGAKAayslG 325

                   ....*.
gi 1914657870  522 SQATGS 527
Cdd:COG3846    326 SAGSGS 331
ChtBD2 smart00494
Chitin-binding domain type 2;
1495-1542 1.52e-05

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 43.97  E-value: 1.52e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870  1495 DCEPcQQGGYYPIIGSLSEFFQCSHGQLVPMKCPARTIWNNKIIRCVY 1542
Cdd:smart00494    2 ECPG-RGDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
PTZ00473 PTZ00473
Plasmodium Vir superfamily; Provisional
440-527 1.79e-05

Plasmodium Vir superfamily; Provisional


Pssm-ID: 240430 [Multi-domain]  Cd Length: 420  Bit Score: 49.46  E-value: 1.79e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  440 GGSQGTGGATGGNQ--GTGGATGGSQGAGGAT-GGSQGVGGAT-GGSQGTGGVT-GGSQGTGGATGGSQGTGGA--TGGS 512
Cdd:PTZ00473   312 HDSRGPYNANYGGQfnSRSGRTGSSESIRGFTyDSSTTYGGSSyGTSQTDSTSTyGSRSTFDSSTGGGSQSGGGstYGGS 391
                           90
                   ....*....|....*
gi 1914657870  513 QGAGGATGGSQATGS 527
Cdd:PTZ00473   392 STFDGSSRGSSDSFG 406
PTZ00473 PTZ00473
Plasmodium Vir superfamily; Provisional
429-525 2.80e-05

Plasmodium Vir superfamily; Provisional


Pssm-ID: 240430 [Multi-domain]  Cd Length: 420  Bit Score: 48.69  E-value: 2.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  429 GEGGNGRETAIGGSQGTGGATGGNQGTGGAT--GGSQGAGGATGGSQGVGGAT-GGSQGTGGVTGGSQGTGGA--TGGSQ 503
Cdd:PTZ00473   313 DSRGPYNANYGGQFNSRSGRTGSSESIRGFTydSSTTYGGSSYGTSQTDSTSTyGSRSTFDSSTGGGSQSGGGstYGGSS 392
                           90       100
                   ....*....|....*....|...
gi 1914657870  504 GTGGATGGS-QGAGGATGGSQAT 525
Cdd:PTZ00473   393 TFDGSSRGSsDSFGVSYFGPQQT 415
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
1783-1825 3.42e-05

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 42.79  E-value: 3.42e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1914657870 1783 QPNSYYPKFKSISQFYQCSHGLLFLMQCPDKTVWNEASIKCVY 1825
Cdd:pfam01607    4 KEDGYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDY 46
PRK13875 PRK13875
conjugal transfer protein TrbL; Provisional
428-529 5.27e-05

conjugal transfer protein TrbL; Provisional


Pssm-ID: 237537  Cd Length: 440  Bit Score: 47.98  E-value: 5.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  428 TGEGGNGRETAIGGSQGTGGATGGNQGTGGAT-------GGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATG 500
Cdd:PRK13875   290 AGAGLAAGGGAAAAGGAAAAARGGAAAAGGASsaysagaAGGSGAAGVAAGLGGVARAGASAAASPLRRAASRAAESMKS 369
                           90       100
                   ....*....|....*....|....*....
gi 1914657870  501 GSQGTGGATGGSQGAGGATGGSQATGSSQ 529
Cdd:PRK13875   370 SFRAGARSTGGGAGGAAAAAAAGAAAAGP 398
PRK13875 PRK13875
conjugal transfer protein TrbL; Provisional
440-528 5.60e-05

conjugal transfer protein TrbL; Provisional


Pssm-ID: 237537  Cd Length: 440  Bit Score: 47.60  E-value: 5.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  440 GGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTG----GSQGTGGATGGSQGTGG-ATGGSQG 514
Cdd:PRK13875   272 GAAVGTGLAAGGAAVAAAAGAGLAAGGGAAAAGGAAAAARGGAAAAGGASSaysaGAAGGSGAAGVAAGLGGvARAGASA 351
                           90
                   ....*....|....*....
gi 1914657870  515 AG-----GATGGSQATGSS 528
Cdd:PRK13875   352 AAsplrrAASRAAESMKSS 370
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
431-528 6.47e-05

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 47.74  E-value: 6.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  431 GGNGRETAIGGSQGTGGATGGNQG-TGGATGGSQGAGGA-----TGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQG 504
Cdd:pfam15967    8 GGPGSTATAGGGFSFGAAAASNPGsTGGFSFGTLGAAPAatattTTATLGLGGGLFGQKPATGFTFGTPASSTAATGPTG 87
                           90       100
                   ....*....|....*....|....*..
gi 1914657870  505 -TGGATGGSQGA--GGATGGSQATGSS 528
Cdd:pfam15967   88 lTLGTPAATTAAstGFSLGFNKPAASA 114
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
437-529 9.17e-05

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 47.69  E-value: 9.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  437 TAIGGSQGTGGATggNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAG 516
Cdd:NF033849   233 ANLGQSAGTGYGE--SVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQS 310
                           90
                   ....*....|...
gi 1914657870  517 GATGGSQATGSSQ 529
Cdd:NF033849   311 HGTTEGTSTTDSS 323
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
432-802 1.13e-04

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 47.08  E-value: 1.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  432 GNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGG 511
Cdd:COG4625      1 GGGGGGGGGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  512 SQGAGGATGGSQATGSSqgtggatggseggSQIPGGIAGGSQGTGGEGGASGSQGKGGVTGGSQGTGGATGGTGGTQGSQ 591
Cdd:COG4625     81 GGGGGGGGGTGGVGGGG-------------GGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGG 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  592 IIGGGAGGSQATGGGTGGVESTGTGTGGSQGTGGEGGVSGSQGTGGVAGGSQGTGGGAGGGQVTGGGASGNQGTGGGTGG 671
Cdd:COG4625    148 GAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGG 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  672 NQETGGVTGGIQGSREGTGGSQGTGEVAGGSQGTSGGTGGNQATGGGTSGNQGTEDGTGSSQGTGGVAGGKQGTGSGTGS 751
Cdd:COG4625    228 GGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGG 307
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1914657870  752 NQGTGGVANGNQAKGGGKDGKQGSGGETGGDQGAGGGTGSNQGTGGVANGN 802
Cdd:COG4625    308 GGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGG 358
PRK13875 PRK13875
conjugal transfer protein TrbL; Provisional
440-526 1.35e-04

conjugal transfer protein TrbL; Provisional


Pssm-ID: 237537  Cd Length: 440  Bit Score: 46.44  E-value: 1.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  440 GGSQGTGGATGGNQ-GTGGATGGSQGAGG-ATGGSQGVGGATGGsqgtgGVTGGSQGTGGATGGSQGTGGA-TGGSQGAG 516
Cdd:PRK13875   255 GPGIANGLVSGAPQlGAGAAVGTGLAAGGaAVAAAAGAGLAAGG-----GAAAAGGAAAAARGGAAAAGGAsSAYSAGAA 329
                           90
                   ....*....|
gi 1914657870  517 GATGGSQATG 526
Cdd:PRK13875   330 GGSGAAGVAA 339
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
450-529 1.41e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 46.92  E-value: 1.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  450 GGNQGTGGatGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGATGGSQATGSSQ 529
Cdd:NF033849   232 AANLGQSA--GTGYGESVGHSTSQGQSHSVGTSESHSVGTSQSQSHTTGHGSTRGWSHTQSTSESESTGQSSSVGTSESQ 309
PTZ00146 PTZ00146
fibrillarin; Provisional
470-521 1.65e-04

fibrillarin; Provisional


Pssm-ID: 240291  Cd Length: 293  Bit Score: 45.88  E-value: 1.65e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1914657870  470 GGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGATGG 521
Cdd:PTZ00146     4 GGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPG 55
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
428-913 1.66e-04

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 46.70  E-value: 1.66e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  428 TGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGG 507
Cdd:COG4625      8 GGGGGGGGGTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGG 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  508 ATGGSQGAGGATGGSQATGSSQGTGGATGGSEGGSQIPGGIAGGSQGTGGEGGASGSQGKGGVTGGSQGTGGATGGTGGT 587
Cdd:COG4625     88 GGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGG 167
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  588 QGSQIIGGGAGGSQATGGGTGGVESTGTGTGGSQGTGGEGGVSGSQGTGGVAGGSQGTGGGAGGGQVTGGGASGNQGTGG 667
Cdd:COG4625    168 GGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 247
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  668 GTGGNQETGGVTGGIQGSREGTGGSQGTGevagGSQGTSGGTGGNQATGGGTSGNQGTEDGTGSSQGTGGVAGGKQGTGS 747
Cdd:COG4625    248 AGGGGGGGGGNGGGGGAGGGGGGGGGGSG----GGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG 323
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  748 GTGSNQGTGGVANGNQAKGGGKDGKQGSGGETGGDQGAGGGTGSNQGTGGVANGNQVSGGGAGGNQGTIGGIGGNQGTGG 827
Cdd:COG4625    324 GGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGG 403
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  828 GTGSIQGTGGIASGSQSTGGGTGGNQGTGGGIGSNQGTGGGISGNQGAGGGTGSSQGTGGVASGSQSTGGGTGGKQGTGG 907
Cdd:COG4625    404 GAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNT 483

                   ....*.
gi 1914657870  908 GTGGNQ 913
Cdd:COG4625    484 YTGTTT 489
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
111-151 1.97e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 40.86  E-value: 1.97e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1914657870  111 GFYSHPNDQGRYYQCVYGVLLPKYCQSGTIWYQHTRTCIFD 151
Cdd:pfam01607    7 GYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDYP 47
ChtBD2 smart00494
Chitin-binding domain type 2;
1413-1458 2.18e-04

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 40.50  E-value: 2.18e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*...
gi 1914657870  1413 EACEKGS--YYPKAESIAEFYQCSHGILFLMQCPESTIWHGESLRCIY 1458
Cdd:smart00494    1 NECPGRGdgLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
421-549 2.36e-04

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 46.30  E-value: 2.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  421 NAKPENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQG---TGGVTGGSQGTGG 497
Cdd:COG5295      1 SASNAGAVAAGTALTTVASGASTTASGSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAaatAGAGSGGTSATAA 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1914657870  498 ATGGSQGTGGATGGSQGAGGATGGSQATGSSQGTGGATGGSEGGSQIPGGIA 549
Cdd:COG5295     81 SSVASGGASAATAASTGTGNTAGTAATVAGAASSGSATNAGASAGASAAAAA 132
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
1503-1547 2.42e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 40.47  E-value: 2.42e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1914657870 1503 GYYPIIGSLSEFFQCSHGQLVPMKCPARTIWNNKIIRCVYDRSQV 1547
Cdd:pfam01607    7 GYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDYPDNVV 51
ChtBD2 smart00494
Chitin-binding domain type 2;
258-297 2.63e-04

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 40.50  E-value: 2.63e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|
gi 1914657870   258 DTYYSKPGSVRQFYQCVHGWLFVRSCPTGTVWAGLLKECV 297
Cdd:smart00494    8 DGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCD 47
PTZ00146 PTZ00146
fibrillarin; Provisional
460-511 2.81e-04

fibrillarin; Provisional


Pssm-ID: 240291  Cd Length: 293  Bit Score: 45.11  E-value: 2.81e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1914657870  460 GGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGG 511
Cdd:PTZ00146     4 GGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPG 55
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
194-237 3.28e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 40.09  E-value: 3.28e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 1914657870  194 CEN--DSYHSKTASLTHFYHCKNGWLYLMYCPSGTIWNSTLSACVY 237
Cdd:pfam01607    1 CAGkeDGYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDY 46
ChtBD2 smart00494
Chitin-binding domain type 2;
193-237 3.36e-04

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 40.12  E-value: 3.36e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1914657870   193 PCEN--DSYHSKTASLTHFYHCKNGWLYLMYCPSGTIWNSTLSACVY 237
Cdd:smart00494    2 ECPGrgDGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
425-528 3.43e-04

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 45.38  E-value: 3.43e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  425 ENQTGEGGNGRETAIGgsQGTGGATGGNQGTGG-ATGGSQGagGATGGSQGVGGATGGSQGTGGV-----TGGSQGTGGA 498
Cdd:cd21118     96 GNAGNEIGRQAEDIIR--HGVDAVHNSWQGSGGhGAYGSQG--GPGVQGHGIPGGTGGPWASGGNygtnsLGGSVGQGGN 171
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1914657870  499 TG------GSQGTGGATG-------------------GSQGAGGATGGSQATGSS 528
Cdd:cd21118    172 GGplnygtNSQGAVAQPGygtvrgnnqnsgctnpppsGSHESFSNSGGSSSSGSS 226
PTZ00146 PTZ00146
fibrillarin; Provisional
450-501 3.57e-04

fibrillarin; Provisional


Pssm-ID: 240291  Cd Length: 293  Bit Score: 44.72  E-value: 3.57e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1914657870  450 GGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGG 501
Cdd:PTZ00146     4 GGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPG 55
PTZ00146 PTZ00146
fibrillarin; Provisional
440-495 4.61e-04

fibrillarin; Provisional


Pssm-ID: 240291  Cd Length: 293  Bit Score: 44.34  E-value: 4.61e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1914657870  440 GGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGvtGGSQGT 495
Cdd:PTZ00146     4 GGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGG--GGPGKV 57
Nucleoporin_FG pfam13634
Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in ...
452-529 6.46e-04

Nucleoporin FG repeat region; This family includes a number of FG repeats that are found in nucleoporin proteins. This family includes the yeast nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.


Pssm-ID: 463941 [Multi-domain]  Cd Length: 90  Bit Score: 40.68  E-value: 6.46e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1914657870  452 NQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQgTGGATGGSQGTGGATGGSQGAGGATGGSQATGSSQ 529
Cdd:pfam13634    7 TSTSGGLFGNTSTTAASGGGLFGAASTATATTSGGGLFGNSS-SNAPSGGLFGATNTTTQTATGGGLFGNNAATTTST 83
PRK07772 PRK07772
single-stranded DNA-binding protein; Provisional
411-502 7.68e-04

single-stranded DNA-binding protein; Provisional


Pssm-ID: 236092 [Multi-domain]  Cd Length: 186  Bit Score: 42.33  E-value: 7.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  411 DELASALRGINAKPENQTGEGGNGretaiggsqgtggatGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTG 490
Cdd:PRK07772   104 DEIGPSLRYATAKVTRASRGGGGG---------------GGGGGFGGGGGGSGGGGGGGGGGGAPGGGGAQASAPADDPW 168
                           90
                   ....*....|..
gi 1914657870  491 GSQGTGGATGGS 502
Cdd:PRK07772   169 SSAPASGGFGGG 180
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
1412-1458 8.10e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 38.93  E-value: 8.10e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1914657870 1412 CEACEKGsYYPKAESIAEFYQCSHGILFLMQCPESTIWHGESLRCIY 1458
Cdd:pfam01607    1 CAGKEDG-YYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDY 46
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
256-297 8.60e-04

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 38.93  E-value: 8.60e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1914657870  256 LQDTYYSKPGSVRQFYQCVHGWLFVRSCPTGTVWAGLLKECV 297
Cdd:pfam01607    4 KEDGYYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICD 45
PTZ00146 PTZ00146
fibrillarin; Provisional
470-524 1.30e-03

fibrillarin; Provisional


Pssm-ID: 240291  Cd Length: 293  Bit Score: 42.80  E-value: 1.30e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1914657870  470 GGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGATGGSQA 524
Cdd:PTZ00146     1 GMGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPG 55
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
422-528 2.42e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 42.73  E-value: 2.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  422 AKPENQTGEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQG-TGGVTGGSQG--TGGA 498
Cdd:pfam15967   25 AAASNPGSTGGFSFGTLGAAPAATATTTTATLGLGGGLFGQKPATGFTFGTPASSTAATGPTGlTLGTPAATTAasTGFS 104
                           90       100       110
                   ....*....|....*....|....*....|
gi 1914657870  499 TGGSQGTGGATGGSQGAGGATGGSQATGSS 528
Cdd:pfam15967  105 LGFNKPAASATPFSLPASSTSGGGLSLGSV 134
PRK07772 PRK07772
single-stranded DNA-binding protein; Provisional
460-520 2.67e-03

single-stranded DNA-binding protein; Provisional


Pssm-ID: 236092 [Multi-domain]  Cd Length: 186  Bit Score: 40.79  E-value: 2.67e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1914657870  460 GGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQgtGGATGGSQGAGGATG 520
Cdd:PRK07772   124 GGGGGGGGGFGGGGGGSGGGGGGGGGGGAPGGGGAQASAPADDP--WSSAPASGGFGGGDD 182
Gly_rich pfam12810
Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. ...
427-522 2.80e-03

Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. The proteins are composed of several glycine rich motifs interspersed through the sequence. Although many proteins have been annotated by similarity in the family these annotations given the biased composition of the sequences these are unlikely to be functionally relevant.


Pssm-ID: 403882 [Multi-domain]  Cd Length: 257  Bit Score: 41.49  E-value: 2.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  427 QTGEGGNGRETAIGGSQGtGGATGGNQGTGGATGGSQ------------------GAGGATGGSQGVGGATGGSQGTGGV 488
Cdd:pfam12810   55 GKGEYNNSTNMNPGGFNG-GGNYKGSSGDGSGGGGGAtdirfdenslksriivagGGGGSGEGDDGSGGYGGGLTGGGGG 133
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1914657870  489 TGGSQGTGGAT---GGSQGTGGatGGSQGAGGATGGS 522
Cdd:pfam12810  134 SGCYEGSYGATqtsGGIGGYGI--NGSFGQGGNGRNS 168
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
1187-1229 2.86e-03

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 37.39  E-value: 2.86e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 1914657870 1187 YYSKLGSNKQFYHCNYGVLYVLECPTQTVWNRRLGSCVYESNP 1229
Cdd:pfam01607    8 YYADPGDCSKYYVCSNGEAVEFTCPNGLVFDPTLGICDYPDNV 50
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
454-524 2.96e-03

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 42.63  E-value: 2.96e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1914657870  454 GTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATGGSQGAGGATGGSQA 524
Cdd:COG3468      1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAG 71
CBM_14 pfam01607
Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is ...
382-416 3.55e-03

Chitin binding Peritrophin-A domain; This domain is called the Peritrophin-A domain and is found in chitin binding proteins particularly peritrophic matrix proteins of insects and animal chitinases. Copies of the domain are also found in some baculoviruses. Relevant references that describe proteins with this domain include. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains.


Pssm-ID: 426342 [Multi-domain]  Cd Length: 53  Bit Score: 37.39  E-value: 3.55e-03
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 1914657870  382 FYQCLSGWLFIMECPLNTVWDGESTKCIYDELASA 416
Cdd:pfam01607   18 YYVCSNGEAVEFTCPNGLVFDPTLGICDYPDNVVD 52
PRK07772 PRK07772
single-stranded DNA-binding protein; Provisional
450-510 3.97e-03

single-stranded DNA-binding protein; Provisional


Pssm-ID: 236092 [Multi-domain]  Cd Length: 186  Bit Score: 40.40  E-value: 3.97e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1914657870  450 GGNQGTGGATGGsqGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGATG 510
Cdd:PRK07772   124 GGGGGGGGGFGG--GGGGSGGGGGGGGGGGAPGGGGAQASAPADDPWSSAPASGGFGGGDD 182
ChtBD2 smart00494
Chitin-binding domain type 2;
1185-1225 4.99e-03

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 36.65  E-value: 4.99e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1914657870  1185 YTYYSKLGSNKQFYHCNYGVLYVLECPTQTVWNRRLGSCVY 1225
Cdd:smart00494    8 DGLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
PTZ00146 PTZ00146
fibrillarin; Provisional
431-484 5.03e-03

fibrillarin; Provisional


Pssm-ID: 240291  Cd Length: 293  Bit Score: 40.87  E-value: 5.03e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1914657870  431 GGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQG 484
Cdd:PTZ00146     2 MGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPG 55
Gly_rich pfam12810
Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. ...
431-512 5.04e-03

Glycine rich protein; This family of proteins is greatly expanded in Trichomonas vaginalis. The proteins are composed of several glycine rich motifs interspersed through the sequence. Although many proteins have been annotated by similarity in the family these annotations given the biased composition of the sequences these are unlikely to be functionally relevant.


Pssm-ID: 403882 [Multi-domain]  Cd Length: 257  Bit Score: 40.72  E-value: 5.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1914657870  431 GGNGRETAIGGSQGtGGATGGNQGTGGA---TGGSQGAGG----ATGGSQGVGGATGGSQGTGGVTGGSqGTGGATGGSQ 503
Cdd:pfam12810  111 GGSGEGDDGSGGYG-GGLTGGGGGSGCYegsYGATQTSGGiggyGINGSFGQGGNGRNSGGGGGGGGGG-GYYGGFGGGS 188

                   ....*....
gi 1914657870  504 GTGGATGGS 512
Cdd:pfam12810  189 YGGGGGGGS 197
PTZ00146 PTZ00146
fibrillarin; Provisional
454-508 5.35e-03

fibrillarin; Provisional


Pssm-ID: 240291  Cd Length: 293  Bit Score: 40.87  E-value: 5.35e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1914657870  454 GTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGVTGGSQGTGGATGGSQGTGGA 508
Cdd:PTZ00146     1 GMGGGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPG 55
PTZ00146 PTZ00146
fibrillarin; Provisional
446-498 5.69e-03

fibrillarin; Provisional


Pssm-ID: 240291  Cd Length: 293  Bit Score: 40.87  E-value: 5.69e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1914657870  446 GGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGGSQGTGGvTGGSQGTGGA 498
Cdd:PTZ00146     4 GGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGG-RGGGGGGGPG 55
PTZ00146 PTZ00146
fibrillarin; Provisional
429-481 6.32e-03

fibrillarin; Provisional


Pssm-ID: 240291  Cd Length: 293  Bit Score: 40.87  E-value: 6.32e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1914657870  429 GEGGNGRETAIGGSQGTGGATGGNQGTGGATGGSQGAGGATGGSQGVGGATGG 481
Cdd:PTZ00146     3 GGGFGGGRGGGRGGGGGGGRGGGGRGGGRGGGRGRGRGGGGGGRGGGGGGGPG 55
ChtBD2 smart00494
Chitin-binding domain type 2;
1598-1642 8.10e-03

Chitin-binding domain type 2;


Pssm-ID: 214696 [Multi-domain]  Cd Length: 49  Bit Score: 36.27  E-value: 8.10e-03
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1914657870  1598 PCQKGS--YYPKYQSEAHFYQCSQGLLFLMNCPDNTIWHGKSIRCIY 1642
Cdd:smart00494    2 ECPGRGdgLYPHPTDCSKYYQCSNGRPIVGSCPAGLVFNPATQTCDW 48
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH