|
Name |
Accession |
Description |
Interval |
E-value |
| MJ1470 super family |
cl34978 |
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type ... |
352-460 |
4.84e-40 |
|
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type IV pili-like system [General function prediction only]; The actual alignment was detected with superfamily member COG5306:
Pssm-ID: 444105 [Multi-domain] Cd Length: 529 Bit Score: 156.99 E-value: 4.84e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 352 PGWDYRKAITI-TNQNSFSLTDYQIKIELNSSNFDFTKANSDGSDIRFTESDGSTFLNYWIESWDPSNQTATIWVKV-NI 429
Cdd:COG5306 26 PDWSYRKPITIdTTAIGGDLTDYPVLVRLHTGNFDFSSAKEDGSDIRFVAGDDKTPLKYWIEKFDPLNEMALVWVKVpSI 105
|
90 100 110
....*....|....*....|....*....|.
gi 505403041 430 PASTSKTIYMYYGNSSATSMSNGDSTFVFFD 460
Cdd:COG5306 106 PAGAGTTIWLYYGNPKAPSASDGKGTFDFFD 136
|
|
| MJ1470 super family |
cl34978 |
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type ... |
757-860 |
5.66e-36 |
|
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type IV pili-like system [General function prediction only]; The actual alignment was detected with superfamily member COG5306:
Pssm-ID: 444105 [Multi-domain] Cd Length: 529 Bit Score: 144.66 E-value: 5.66e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 757 PGWDYRKAITI-TNQNSFSLTDYQIKIELNSSNFDFTKANSDGSDIRFTESDGSTFLNYWIESWDPSNQTATIWVKV-NI 834
Cdd:COG5306 26 PDWSYRKPITIdTTAIGGDLTDYPVLVRLHTGNFDFSSAKEDGSDIRFVAGDDKTPLKYWIEKFDPLNEMALVWVKVpSI 105
|
90 100
....*....|....*....|....*.
gi 505403041 835 PASTSKTIYMYYGNSSATSMSNPEKT 860
Cdd:COG5306 106 PAGAGTTIWLYYGNPKAPSASDGKGT 131
|
|
| PKD_4 |
pfam18911 |
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins. |
259-344 |
2.12e-22 |
|
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins. :
Pssm-ID: 436824 [Multi-domain] Cd Length: 85 Bit Score: 92.72 E-value: 2.12e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 259 NNPPVAAFTyTPATPLVGDIITFNASSSYDPDGDkITDYIWDFGDGDTATGVVTTHSYSSPGTYDVTLTVYDERGGVNST 338
Cdd:pfam18911 1 NAAPVADAG-GDRIVAEGETVTFDASASDDPDGD-ILSYRWDFGDGTTATGANVSHTYAAPGTYTVTLTVTDDSGASNST 78
|
....*..
gi 505403041 339 -TVAIEV 344
Cdd:pfam18911 79 aTDTVTV 85
|
|
| CARDB |
pfam07705 |
CARDB; Cell adhesion related domain found in bacteria. |
151-250 |
2.84e-17 |
|
CARDB; Cell adhesion related domain found in bacteria. :
Pssm-ID: 400172 [Multi-domain] Cd Length: 101 Bit Score: 78.47 E-value: 2.84e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 151 DLTISLKVSNANPEVNEEITIYADVMNVGSEDiNESFTVRFYYDSIEIYNEIINGLTSQSVEHISFSYTPTSTGQHVIKG 230
Cdd:pfam07705 3 DLIVQSISPPSEAYVGEENTITVTVKNQGTAA-AGAFNVALYVDGTSVGTITVPGLAAGESTTVSFSWTPPTEGSYTLTV 81
|
90 100
....*....|....*....|
gi 505403041 231 VVDADGAIVEDNENNNVSSK 250
Cdd:pfam07705 82 VVDPDNTVAESNETNNELTK 101
|
|
| PKD |
pfam00801 |
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ... |
670-738 |
2.43e-16 |
|
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold. :
Pssm-ID: 395646 [Multi-domain] Cd Length: 70 Bit Score: 74.73 E-value: 2.43e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 505403041 670 TPSTPQVNEEITFNASSSTpsepgGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGR 738
Cdd:pfam00801 4 SGTVVAAGQPVTFTATLAD-----GSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGS 67
|
|
| PKD |
pfam00801 |
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ... |
1083-1151 |
2.43e-16 |
|
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold. :
Pssm-ID: 395646 [Multi-domain] Cd Length: 70 Bit Score: 74.73 E-value: 2.43e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 505403041 1083 TPSTPQVNEEITFNASSSTpsepgGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGR 1151
Cdd:pfam00801 4 SGTVVAAGQPVTFTATLAD-----GSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGS 67
|
|
| COG3430 |
COG3430 |
Archaeal flagellin (archaellin), FlaG/FlaF family [Cell motility]; |
5-78 |
1.71e-11 |
|
Archaeal flagellin (archaellin), FlaG/FlaF family [Cell motility]; :
Pssm-ID: 442656 Cd Length: 91 Bit Score: 61.96 E-value: 1.71e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 5 MKFSGDDRAVSELISIILMIAITVgafsVIAVSIYSFL--------QTPPSKHADFQAEKIGDELVIYHTGGEELSGDDI 76
Cdd:COG3430 4 KKGMSDERAVSPVIGVILMVAITV----ILAAVIGVFVfglgddvsEPAPQASLSVEFVGDGDSVTITHEGGDPVNVDDL 79
|
..
gi 505403041 77 II 78
Cdd:COG3430 80 KV 81
|
|
| COG3291 |
COG3291 |
Uncharacterized conserved protein, PKD repeat domain [Function unknown]; |
1080-1386 |
1.17e-08 |
|
Uncharacterized conserved protein, PKD repeat domain [Function unknown]; :
Pssm-ID: 442520 [Multi-domain] Cd Length: 333 Bit Score: 58.53 E-value: 1.17e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1080 FTFTPSTPQVNEEITFNASSStpsepgGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRQDSVTKQV 1159
Cdd:COG3291 1 FTATPTSGCAPLTVQFTDTSS------GNATSYEWDFGDGTT--STEANPSHTYTTPGTYTVTLTVTDAAGCSDTTTKTI 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1160 EVSETLFDPGFAYEDVNGNLMYDPGVDVQILASEIQDGVYDAGSNGLVIPPSVGDITASSIYFKGRDVVVSVDLTASKGV 1239
Cdd:COG3291 73 TVGAPNPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDT 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1240 EIIGSDSVDITGVSVSSTNYNRDVVIQAGKILANGTDITAHGEVFLKATNIYISDSTIDTSSEYNMKISIDATNYVFANN 1319
Cdd:COG3291 153 ATVTTSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTD 232
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 505403041 1320 ATLKSQAKIDLGGNSLSGDGMSIDNSKAYDMKVNIVFLEDISLNNANIVSQGVVTINAGTQLTASDI 1386
Cdd:COG3291 233 VTLTGISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNV 299
|
|
| PHA01755 super family |
cl39126 |
hypothetical protein |
359-635 |
6.68e-04 |
|
hypothetical protein The actual alignment was detected with superfamily member PHA01755:
Pssm-ID: 222834 Cd Length: 562 Bit Score: 44.21 E-value: 6.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 359 AITITN-QNSFSLTDYQIKIELNSSNFdfTKANSDGSDIRF-TESDGSTFLNYWIESWDPSNQTATIWVK--VNIPASTS 434
Cdd:PHA01755 227 TITITNsQPDPTPSPFQQLLILNLSNI--ISSPSQLLNLQFcLDSQCSTPLYAWIESYNSNLSTVYIWINlpISIPANSS 304
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 435 KTIYMYYGNSSA---TSM-SNGDSTFVFFDD--------FEGTSLNTTKWATNTDTYQVEN--------GAIRLWGSWND 494
Cdd:PHA01755 305 ITIYMFVRNSIQypyTGMrPDLTSTYAQYDNgknvfliyFNGNEPLSNFNQEGNTIQQISTfgplgntiNAIYLSGYENN 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 495 GAYLNTRDSFSGSFVVegrwrlsTTSKDVDLAVVF----AEYSNSYMWESTSITCTYDSQST---SRPYYQKDLNVKGTh 567
Cdd:PHA01755 385 VGFVYTGKSETNQPVI-------SEASSQRMPNQTgglgAYNGTAGIADSTNTAFINDIGVTmgeDTSYFSQYYYVNGG- 456
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 505403041 568 vdwgpeiESSDWQKFRIIFTQSYINYWdswsaensakpslEYSGSTFSTFYLGIAADSDSTSrYGYID 635
Cdd:PHA01755 457 -------ETGGSNYQGSAVSQWVYAWV-------------QYQGSSASSWFGCIAPQLYSSP-GGYCG 503
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MJ1470 |
COG5306 |
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type ... |
352-460 |
4.84e-40 |
|
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type IV pili-like system [General function prediction only];
Pssm-ID: 444105 [Multi-domain] Cd Length: 529 Bit Score: 156.99 E-value: 4.84e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 352 PGWDYRKAITI-TNQNSFSLTDYQIKIELNSSNFDFTKANSDGSDIRFTESDGSTFLNYWIESWDPSNQTATIWVKV-NI 429
Cdd:COG5306 26 PDWSYRKPITIdTTAIGGDLTDYPVLVRLHTGNFDFSSAKEDGSDIRFVAGDDKTPLKYWIEKFDPLNEMALVWVKVpSI 105
|
90 100 110
....*....|....*....|....*....|.
gi 505403041 430 PASTSKTIYMYYGNSSATSMSNGDSTFVFFD 460
Cdd:COG5306 106 PAGAGTTIWLYYGNPKAPSASDGKGTFDFFD 136
|
|
| MJ1470 |
COG5306 |
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type ... |
757-860 |
5.66e-36 |
|
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type IV pili-like system [General function prediction only];
Pssm-ID: 444105 [Multi-domain] Cd Length: 529 Bit Score: 144.66 E-value: 5.66e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 757 PGWDYRKAITI-TNQNSFSLTDYQIKIELNSSNFDFTKANSDGSDIRFTESDGSTFLNYWIESWDPSNQTATIWVKV-NI 834
Cdd:COG5306 26 PDWSYRKPITIdTTAIGGDLTDYPVLVRLHTGNFDFSSAKEDGSDIRFVAGDDKTPLKYWIEKFDPLNEMALVWVKVpSI 105
|
90 100
....*....|....*....|....*.
gi 505403041 835 PASTSKTIYMYYGNSSATSMSNPEKT 860
Cdd:COG5306 106 PAGAGTTIWLYYGNPKAPSASDGKGT 131
|
|
| DUF2341 |
pfam10102 |
Domain of unknown function (DUF2341); Members of this family are found in various bacterial ... |
392-475 |
4.33e-33 |
|
Domain of unknown function (DUF2341); Members of this family are found in various bacterial proteins, including MotA/TolQ/ExbB proton channels and other transport proteins. The exact function of this set of domains has not, as yet, been determined.
Pssm-ID: 431055 [Multi-domain] Cd Length: 84 Bit Score: 123.24 E-value: 4.33e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 392 DGSDIRFTESDGSTfLNYWIESWDPSNQTATIWVKV-NIPASTSKTIYMYYGNSSATSMSNGDSTFVFFDDFEGTSLNTT 470
Cdd:pfam10102 1 DGSDIRFTDSDGTT-LPYWIEPWDPTTGKALIWVKVpSIPANGNGTIYIYYGNPTATSTSNGDATFEFFDDFSGTALDTT 79
|
....*
gi 505403041 471 KWATN 475
Cdd:pfam10102 80 KWTVV 84
|
|
| DUF2341 |
pfam10102 |
Domain of unknown function (DUF2341); Members of this family are found in various bacterial ... |
797-870 |
8.88e-24 |
|
Domain of unknown function (DUF2341); Members of this family are found in various bacterial proteins, including MotA/TolQ/ExbB proton channels and other transport proteins. The exact function of this set of domains has not, as yet, been determined.
Pssm-ID: 431055 [Multi-domain] Cd Length: 84 Bit Score: 96.66 E-value: 8.88e-24
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 505403041 797 DGSDIRFTESDGSTfLNYWIESWDPSNQTATIWVKV-NIPASTSKTIYMYYGNSSATSMSNPEKTMFLYENFESD 870
Cdd:pfam10102 1 DGSDIRFTDSDGTT-LPYWIEPWDPTTGKALIWVKVpSIPANGNGTIYIYYGNPTATSTSNGDATFEFFDDFSGT 74
|
|
| PKD_4 |
pfam18911 |
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins. |
259-344 |
2.12e-22 |
|
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
Pssm-ID: 436824 [Multi-domain] Cd Length: 85 Bit Score: 92.72 E-value: 2.12e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 259 NNPPVAAFTyTPATPLVGDIITFNASSSYDPDGDkITDYIWDFGDGDTATGVVTTHSYSSPGTYDVTLTVYDERGGVNST 338
Cdd:pfam18911 1 NAAPVADAG-GDRIVAEGETVTFDASASDDPDGD-ILSYRWDFGDGTTATGANVSHTYAAPGTYTVTLTVTDDSGASNST 78
|
....*..
gi 505403041 339 -TVAIEV 344
Cdd:pfam18911 79 aTDTVTV 85
|
|
| CARDB |
pfam07705 |
CARDB; Cell adhesion related domain found in bacteria. |
151-250 |
2.84e-17 |
|
CARDB; Cell adhesion related domain found in bacteria.
Pssm-ID: 400172 [Multi-domain] Cd Length: 101 Bit Score: 78.47 E-value: 2.84e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 151 DLTISLKVSNANPEVNEEITIYADVMNVGSEDiNESFTVRFYYDSIEIYNEIINGLTSQSVEHISFSYTPTSTGQHVIKG 230
Cdd:pfam07705 3 DLIVQSISPPSEAYVGEENTITVTVKNQGTAA-AGAFNVALYVDGTSVGTITVPGLAAGESTTVSFSWTPPTEGSYTLTV 81
|
90 100
....*....|....*....|
gi 505403041 231 VVDADGAIVEDNENNNVSSK 250
Cdd:pfam07705 82 VVDPDNTVAESNETNNELTK 101
|
|
| PKD |
pfam00801 |
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ... |
670-738 |
2.43e-16 |
|
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.
Pssm-ID: 395646 [Multi-domain] Cd Length: 70 Bit Score: 74.73 E-value: 2.43e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 505403041 670 TPSTPQVNEEITFNASSSTpsepgGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGR 738
Cdd:pfam00801 4 SGTVVAAGQPVTFTATLAD-----GSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGS 67
|
|
| PKD |
pfam00801 |
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ... |
1083-1151 |
2.43e-16 |
|
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.
Pssm-ID: 395646 [Multi-domain] Cd Length: 70 Bit Score: 74.73 E-value: 2.43e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 505403041 1083 TPSTPQVNEEITFNASSSTpsepgGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGR 1151
Cdd:pfam00801 4 SGTVVAAGQPVTFTATLAD-----GSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGS 67
|
|
| PKD |
smart00089 |
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ... |
665-748 |
3.30e-16 |
|
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Pssm-ID: 214510 [Multi-domain] Cd Length: 79 Bit Score: 74.79 E-value: 3.30e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 665 ADFTFTPSTPQVNEEITFNASSSTPsepgGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRqDSVTK 744
Cdd:smart00089 2 ADVSASPTVGVAGESVTFTATSSDD----GSIVSYTWDFGDGTS--STGPTVTHTYTKPGTYTVTLTVTNAVGS-ASATV 74
|
....
gi 505403041 745 QVEV 748
Cdd:smart00089 75 TVVV 78
|
|
| PKD |
smart00089 |
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ... |
1078-1161 |
3.30e-16 |
|
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Pssm-ID: 214510 [Multi-domain] Cd Length: 79 Bit Score: 74.79 E-value: 3.30e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1078 ADFTFTPSTPQVNEEITFNASSSTPsepgGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRqDSVTK 1157
Cdd:smart00089 2 ADVSASPTVGVAGESVTFTATSSDD----GSIVSYTWDFGDGTS--STGPTVTHTYTKPGTYTVTLTVTNAVGS-ASATV 74
|
....
gi 505403041 1158 QVEV 1161
Cdd:smart00089 75 TVVV 78
|
|
| PKD |
cd00146 |
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ... |
663-748 |
5.45e-16 |
|
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Pssm-ID: 238084 [Multi-domain] Cd Length: 81 Bit Score: 74.07 E-value: 5.45e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 663 PTADFTFTPsTPQVNEEITFNASSStpsePGGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGRQDSV 742
Cdd:cd00146 1 PTASVSAPP-VAELGASVTFSASDS----SGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
|
....*.
gi 505403041 743 TKQVEV 748
Cdd:cd00146 76 TTTVVV 81
|
|
| PKD |
cd00146 |
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ... |
1076-1161 |
5.45e-16 |
|
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Pssm-ID: 238084 [Multi-domain] Cd Length: 81 Bit Score: 74.07 E-value: 5.45e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1076 PTADFTFTPsTPQVNEEITFNASSStpsePGGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGRQDSV 1155
Cdd:cd00146 1 PTASVSAPP-VAELGASVTFSASDS----SGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
|
....*.
gi 505403041 1156 TKQVEV 1161
Cdd:cd00146 76 TTTVVV 81
|
|
| PKD |
smart00089 |
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ... |
263-344 |
8.33e-16 |
|
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Pssm-ID: 214510 [Multi-domain] Cd Length: 79 Bit Score: 73.64 E-value: 8.33e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 263 VAAFTYTPATPLVGDIITFNASSSYDPDgdkITDYIWDFGDGDTATGVVTTHSYSSPGTYDVTLTVYDErGGVNSTTVAI 342
Cdd:smart00089 1 VADVSASPTVGVAGESVTFTATSSDDGS---IVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNA-VGSASATVTV 76
|
..
gi 505403041 343 EV 344
Cdd:smart00089 77 VV 78
|
|
| PKD |
cd00146 |
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ... |
262-344 |
1.40e-12 |
|
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Pssm-ID: 238084 [Multi-domain] Cd Length: 81 Bit Score: 64.44 E-value: 1.40e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 262 PVAAFTYTPATPLVGDIiTFNASSSYDPDgdkITDYIWDFGDG--DTATGVVTTHSYSSPGTYDVTLTVYDERGGVNSTT 339
Cdd:cd00146 1 PTASVSAPPVAELGASV-TFSASDSSGGS---IVSYKWDFGDGevSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKT 76
|
....*
gi 505403041 340 VAIEV 344
Cdd:cd00146 77 TTVVV 81
|
|
| COG3430 |
COG3430 |
Archaeal flagellin (archaellin), FlaG/FlaF family [Cell motility]; |
5-78 |
1.71e-11 |
|
Archaeal flagellin (archaellin), FlaG/FlaF family [Cell motility];
Pssm-ID: 442656 Cd Length: 91 Bit Score: 61.96 E-value: 1.71e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 5 MKFSGDDRAVSELISIILMIAITVgafsVIAVSIYSFL--------QTPPSKHADFQAEKIGDELVIYHTGGEELSGDDI 76
Cdd:COG3430 4 KKGMSDERAVSPVIGVILMVAITV----ILAAVIGVFVfglgddvsEPAPQASLSVEFVGDGDSVTITHEGGDPVNVDDL 79
|
..
gi 505403041 77 II 78
Cdd:COG3430 80 KV 81
|
|
| COG1572 |
COG1572 |
Serine protease, subtilase family [Posttranslational modification, protein turnover, ... |
151-282 |
1.73e-10 |
|
Serine protease, subtilase family [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 441180 [Multi-domain] Cd Length: 459 Bit Score: 65.37 E-value: 1.73e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 151 DLTISLKVSNANPEVNEEITIYADVMNVGSEDInESFTVRFYYDSIEIYNEIINGLTSQSVEHISFSYT-PTSTGQHVIK 229
Cdd:COG1572 246 DLTVTSVTAPSTVVEGDTITVSATVKNQGTAAA-GATTVAFYLSGDPVGTASVGALAAGASYTVTVTITlPANAGTYYLL 324
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 505403041 230 GVVDADGAIVEDNENNNVSSKTISVSEPANNPPVAAFTYTPATPLVGDIITFN 282
Cdd:COG1572 325 AVVDPDNQVAESNETNNVASSAITVVGPPPPDLVVTSVSAPSTATAGSSVTVS 377
|
|
| Pilin_N |
pfam07790 |
Archaeal Type IV pilin, N-terminal; This entry represents the N-terminal domain of archaeal ... |
12-78 |
3.27e-09 |
|
Archaeal Type IV pilin, N-terminal; This entry represents the N-terminal domain of archaeal pilins, which play important roles in surface adhesion and twitching motility. This domain contains an conserved N- terminal hydrophobic motif.
Pssm-ID: 400235 Cd Length: 78 Bit Score: 54.89 E-value: 3.27e-09
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 505403041 12 RAVSELISIILMIAITVGAFSVIAVSIYSFLQTPPSK-HADFQAEKI---GDELVIYHTGGEELSGDDIII 78
Cdd:pfam07790 1 DAVSPVIGVVLMLAITVILAAVIAVFVFGLASPPEKApQASIQVKYDssaDTGVTFEHKGGDPIDTKDLKI 71
|
|
| COG3291 |
COG3291 |
Uncharacterized conserved protein, PKD repeat domain [Function unknown]; |
1080-1386 |
1.17e-08 |
|
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
Pssm-ID: 442520 [Multi-domain] Cd Length: 333 Bit Score: 58.53 E-value: 1.17e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1080 FTFTPSTPQVNEEITFNASSStpsepgGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRQDSVTKQV 1159
Cdd:COG3291 1 FTATPTSGCAPLTVQFTDTSS------GNATSYEWDFGDGTT--STEANPSHTYTTPGTYTVTLTVTDAAGCSDTTTKTI 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1160 EVSETLFDPGFAYEDVNGNLMYDPGVDVQILASEIQDGVYDAGSNGLVIPPSVGDITASSIYFKGRDVVVSVDLTASKGV 1239
Cdd:COG3291 73 TVGAPNPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDT 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1240 EIIGSDSVDITGVSVSSTNYNRDVVIQAGKILANGTDITAHGEVFLKATNIYISDSTIDTSSEYNMKISIDATNYVFANN 1319
Cdd:COG3291 153 ATVTTSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTD 232
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 505403041 1320 ATLKSQAKIDLGGNSLSGDGMSIDNSKAYDMKVNIVFLEDISLNNANIVSQGVVTINAGTQLTASDI 1386
Cdd:COG3291 233 VTLTGISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNV 299
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
224-750 |
3.36e-08 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 58.94 E-value: 3.36e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 224 GQHVIKGVVDADGAIVEDNENNNVSSKTISVSEPANNPPVAAFTYTPATplVGDIIT------FNASSSYDPdgdkiTDY 297
Cdd:TIGR00864 1389 GAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGS--HGNNLElgqpylFSAFGRARN-----ASY 1461
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 298 IWDFGDGDTATGVVTTHSYSSPGTYDVTLTVYDERGGvNSTTVAIEVIESIcelpgwdyrKAITItnqNSfSLTDyqikI 377
Cdd:TIGR00864 1462 LWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGK-NEATLNVAVKARV---------RGLTI---NA-SLTN----V 1523
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 378 ELNSSnFDFTKANSDGSDIRFTesdgstflnyWI--ESWDPSNQTATIWVKVNiPASTSKTIYMYYGNSSATSmsngDST 455
Cdd:TIGR00864 1524 PLNGS-VHFEAHLDAGDDVRFS----------WIlcDHCTPIFGGNTIFYTFR-SVGTFNIIVTAENDVGAAQ----ASI 1587
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 456 FVFFddfegtslnttkwatntdtyQVENGAIRLWGSWNDGAylntrdsfsGSFVVEGRWRLSTTSKDVDLAVVFAEysns 535
Cdd:TIGR00864 1588 FLFV--------------------LQEIEGLQILGETAEGG---------GGGVQELDGCYFETNHTVQFHAGFKD---- 1634
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 536 ymwestsitctydsqstsrpyyqkdlnvkGTHVDWGpeiessdwqkfriiftqsyinyWDSWSAENSAKPSLEYSGSTFS 615
Cdd:TIGR00864 1635 -----------------------------GTNLSFS----------------------WNAILDNEPDGPAFAGSGKGAK 1663
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 616 TfylgiaadsdSTSRYGYIDyIFMRKyvenepSVVISATETDCTV----PSPTADFTFTPSTPQVNEEITFNASSStpse 691
Cdd:TIGR00864 1664 L----------NPLEAGPCD-IFLQA------ANLLGQATADCTIdflePAGNLMLAASDNPAAVNALINLSAELA---- 1722
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*....
gi 505403041 692 pGGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGRQDSvTKQVEVLE 750
Cdd:TIGR00864 1723 -EGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELGSANA-SEEVDVQE 1779
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
1076-1231 |
3.47e-05 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 48.93 E-value: 3.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1076 PTADFTFTPSTPQVNEEITFNASSsTPSePGGsiTNYHWDFGDGNVIDTTS-PTITHTYSSANTYSVTLTVTDSLgrqDS 1154
Cdd:TIGR00864 1088 PRVAIGTEDGLLLAGKPADFEAHP-LPS-PGG--IHYEWDFGDGSALLQGRqPAAAHTFAKRGPFHVCLEVNNTI---SG 1160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1155 VTKQVEVsetlfdpgFAYEDVNGnLMYDPGVDVQILASEIQDGVYDAGSNgLVIPPSVGD--------ITASSIYFKGRD 1226
Cdd:TIGR00864 1161 AAACADM--------FAFEEIEG-LSADMSLATELGAATTVRAALQSGDN-ITWTFDMGDgkslsgpeATVEHKYAKAGN 1230
|
....*
gi 505403041 1227 VVVSV 1231
Cdd:TIGR00864 1231 CTVNI 1235
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
663-736 |
2.94e-04 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 45.84 E-value: 2.94e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 505403041 663 PTADFTFTPSTPQVNEEITFNASSsTPSePGGsiTNYHWDFGDGNVIDTTS-PTITHTYSSANTYSVTLTVTDSL 736
Cdd:TIGR00864 1088 PRVAIGTEDGLLLAGKPADFEAHP-LPS-PGG--IHYEWDFGDGSALLQGRqPAAAHTFAKRGPFHVCLEVNNTI 1158
|
|
| PHA01755 |
PHA01755 |
hypothetical protein |
359-635 |
6.68e-04 |
|
hypothetical protein
Pssm-ID: 222834 Cd Length: 562 Bit Score: 44.21 E-value: 6.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 359 AITITN-QNSFSLTDYQIKIELNSSNFdfTKANSDGSDIRF-TESDGSTFLNYWIESWDPSNQTATIWVK--VNIPASTS 434
Cdd:PHA01755 227 TITITNsQPDPTPSPFQQLLILNLSNI--ISSPSQLLNLQFcLDSQCSTPLYAWIESYNSNLSTVYIWINlpISIPANSS 304
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 435 KTIYMYYGNSSA---TSM-SNGDSTFVFFDD--------FEGTSLNTTKWATNTDTYQVEN--------GAIRLWGSWND 494
Cdd:PHA01755 305 ITIYMFVRNSIQypyTGMrPDLTSTYAQYDNgknvfliyFNGNEPLSNFNQEGNTIQQISTfgplgntiNAIYLSGYENN 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 495 GAYLNTRDSFSGSFVVegrwrlsTTSKDVDLAVVF----AEYSNSYMWESTSITCTYDSQST---SRPYYQKDLNVKGTh 567
Cdd:PHA01755 385 VGFVYTGKSETNQPVI-------SEASSQRMPNQTgglgAYNGTAGIADSTNTAFINDIGVTmgeDTSYFSQYYYVNGG- 456
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 505403041 568 vdwgpeiESSDWQKFRIIFTQSYINYWdswsaensakpslEYSGSTFSTFYLGIAADSDSTSrYGYID 635
Cdd:PHA01755 457 -------ETGGSNYQGSAVSQWVYAWV-------------QYQGSSASSWFGCIAPQLYSSP-GGYCG 503
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
1107-1186 |
1.07e-03 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 43.92 E-value: 1.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1107 GSITNYHWDFGDGNVIDttSPTITHTYSSANTYSVTLTVTDSLGRQDS---VTKQVEVSETLFDPGFAYEDVNGNLMYDP 1183
Cdd:TIGR00864 1456 ARNASYLWDFGDGGLLE--GPEILHAFNSPGDFNIRLAAANEVGKNEAtlnVAVKARVRGLTINASLTNVPLNGSVHFEA 1533
|
...
gi 505403041 1184 GVD 1186
Cdd:TIGR00864 1534 HLD 1536
|
|
| PHA01755 |
PHA01755 |
hypothetical protein |
747-850 |
2.19e-03 |
|
hypothetical protein
Pssm-ID: 222834 Cd Length: 562 Bit Score: 42.67 E-value: 2.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 747 EVLEASVCELPGWdyrkAITITN-QNSFSLTDYQIKIELNSSNFdfTKANSDGSDIRF-TESDGSTFLNYWIESWDPSNQ 824
Cdd:PHA01755 214 ELIEFYVIPITAY----TITITNsQPDPTPSPFQQLLILNLSNI--ISSPSQLLNLQFcLDSQCSTPLYAWIESYNSNLS 287
|
90 100
....*....|....*....|....*...
gi 505403041 825 TATIWVK--VNIPASTSKTIYMYYGNSS 850
Cdd:PHA01755 288 TVYIWINlpISIPANSSITIYMFVRNSI 315
|
|
| BglS |
COG2273 |
Beta-glucanase, GH16 family [Carbohydrate transport and metabolism]; |
455-514 |
9.97e-03 |
|
Beta-glucanase, GH16 family [Carbohydrate transport and metabolism];
Pssm-ID: 441874 [Multi-domain] Cd Length: 259 Bit Score: 39.59 E-value: 9.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 455 TFVFFDDFEGTSLNTTKWATNTD----------TYQ-----VENGAIRLWGSWND---------GAYLNTRDSFSGSFvv 510
Cdd:COG2273 30 TLVFSDEFDGTSLDTSKWTYDTGgpgwgngelqYYTdenvsVENGNLVITARKEPyggggrpytSGRITTKGKFSFTY-- 107
|
....
gi 505403041 511 eGRW 514
Cdd:COG2273 108 -GRF 110
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MJ1470 |
COG5306 |
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type ... |
352-460 |
4.84e-40 |
|
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type IV pili-like system [General function prediction only];
Pssm-ID: 444105 [Multi-domain] Cd Length: 529 Bit Score: 156.99 E-value: 4.84e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 352 PGWDYRKAITI-TNQNSFSLTDYQIKIELNSSNFDFTKANSDGSDIRFTESDGSTFLNYWIESWDPSNQTATIWVKV-NI 429
Cdd:COG5306 26 PDWSYRKPITIdTTAIGGDLTDYPVLVRLHTGNFDFSSAKEDGSDIRFVAGDDKTPLKYWIEKFDPLNEMALVWVKVpSI 105
|
90 100 110
....*....|....*....|....*....|.
gi 505403041 430 PASTSKTIYMYYGNSSATSMSNGDSTFVFFD 460
Cdd:COG5306 106 PAGAGTTIWLYYGNPKAPSASDGKGTFDFFD 136
|
|
| MJ1470 |
COG5306 |
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type ... |
757-860 |
5.66e-36 |
|
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type IV pili-like system [General function prediction only];
Pssm-ID: 444105 [Multi-domain] Cd Length: 529 Bit Score: 144.66 E-value: 5.66e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 757 PGWDYRKAITI-TNQNSFSLTDYQIKIELNSSNFDFTKANSDGSDIRFTESDGSTFLNYWIESWDPSNQTATIWVKV-NI 834
Cdd:COG5306 26 PDWSYRKPITIdTTAIGGDLTDYPVLVRLHTGNFDFSSAKEDGSDIRFVAGDDKTPLKYWIEKFDPLNEMALVWVKVpSI 105
|
90 100
....*....|....*....|....*.
gi 505403041 835 PASTSKTIYMYYGNSSATSMSNPEKT 860
Cdd:COG5306 106 PAGAGTTIWLYYGNPKAPSASDGKGT 131
|
|
| DUF2341 |
pfam10102 |
Domain of unknown function (DUF2341); Members of this family are found in various bacterial ... |
392-475 |
4.33e-33 |
|
Domain of unknown function (DUF2341); Members of this family are found in various bacterial proteins, including MotA/TolQ/ExbB proton channels and other transport proteins. The exact function of this set of domains has not, as yet, been determined.
Pssm-ID: 431055 [Multi-domain] Cd Length: 84 Bit Score: 123.24 E-value: 4.33e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 392 DGSDIRFTESDGSTfLNYWIESWDPSNQTATIWVKV-NIPASTSKTIYMYYGNSSATSMSNGDSTFVFFDDFEGTSLNTT 470
Cdd:pfam10102 1 DGSDIRFTDSDGTT-LPYWIEPWDPTTGKALIWVKVpSIPANGNGTIYIYYGNPTATSTSNGDATFEFFDDFSGTALDTT 79
|
....*
gi 505403041 471 KWATN 475
Cdd:pfam10102 80 KWTVV 84
|
|
| DUF2341 |
pfam10102 |
Domain of unknown function (DUF2341); Members of this family are found in various bacterial ... |
797-870 |
8.88e-24 |
|
Domain of unknown function (DUF2341); Members of this family are found in various bacterial proteins, including MotA/TolQ/ExbB proton channels and other transport proteins. The exact function of this set of domains has not, as yet, been determined.
Pssm-ID: 431055 [Multi-domain] Cd Length: 84 Bit Score: 96.66 E-value: 8.88e-24
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 505403041 797 DGSDIRFTESDGSTfLNYWIESWDPSNQTATIWVKV-NIPASTSKTIYMYYGNSSATSMSNPEKTMFLYENFESD 870
Cdd:pfam10102 1 DGSDIRFTDSDGTT-LPYWIEPWDPTTGKALIWVKVpSIPANGNGTIYIYYGNPTATSTSNGDATFEFFDDFSGT 74
|
|
| PKD_4 |
pfam18911 |
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins. |
259-344 |
2.12e-22 |
|
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
Pssm-ID: 436824 [Multi-domain] Cd Length: 85 Bit Score: 92.72 E-value: 2.12e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 259 NNPPVAAFTyTPATPLVGDIITFNASSSYDPDGDkITDYIWDFGDGDTATGVVTTHSYSSPGTYDVTLTVYDERGGVNST 338
Cdd:pfam18911 1 NAAPVADAG-GDRIVAEGETVTFDASASDDPDGD-ILSYRWDFGDGTTATGANVSHTYAAPGTYTVTLTVTDDSGASNST 78
|
....*..
gi 505403041 339 -TVAIEV 344
Cdd:pfam18911 79 aTDTVTV 85
|
|
| CARDB |
pfam07705 |
CARDB; Cell adhesion related domain found in bacteria. |
151-250 |
2.84e-17 |
|
CARDB; Cell adhesion related domain found in bacteria.
Pssm-ID: 400172 [Multi-domain] Cd Length: 101 Bit Score: 78.47 E-value: 2.84e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 151 DLTISLKVSNANPEVNEEITIYADVMNVGSEDiNESFTVRFYYDSIEIYNEIINGLTSQSVEHISFSYTPTSTGQHVIKG 230
Cdd:pfam07705 3 DLIVQSISPPSEAYVGEENTITVTVKNQGTAA-AGAFNVALYVDGTSVGTITVPGLAAGESTTVSFSWTPPTEGSYTLTV 81
|
90 100
....*....|....*....|
gi 505403041 231 VVDADGAIVEDNENNNVSSK 250
Cdd:pfam07705 82 VVDPDNTVAESNETNNELTK 101
|
|
| PKD |
pfam00801 |
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ... |
670-738 |
2.43e-16 |
|
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.
Pssm-ID: 395646 [Multi-domain] Cd Length: 70 Bit Score: 74.73 E-value: 2.43e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 505403041 670 TPSTPQVNEEITFNASSSTpsepgGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGR 738
Cdd:pfam00801 4 SGTVVAAGQPVTFTATLAD-----GSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGS 67
|
|
| PKD |
pfam00801 |
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ... |
1083-1151 |
2.43e-16 |
|
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.
Pssm-ID: 395646 [Multi-domain] Cd Length: 70 Bit Score: 74.73 E-value: 2.43e-16
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 505403041 1083 TPSTPQVNEEITFNASSSTpsepgGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGR 1151
Cdd:pfam00801 4 SGTVVAAGQPVTFTATLAD-----GSNVTYTWDFGDSPGTSGSGPTVTHTYLSPGTYTVTLTASNAVGS 67
|
|
| PKD |
smart00089 |
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ... |
665-748 |
3.30e-16 |
|
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Pssm-ID: 214510 [Multi-domain] Cd Length: 79 Bit Score: 74.79 E-value: 3.30e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 665 ADFTFTPSTPQVNEEITFNASSSTPsepgGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRqDSVTK 744
Cdd:smart00089 2 ADVSASPTVGVAGESVTFTATSSDD----GSIVSYTWDFGDGTS--STGPTVTHTYTKPGTYTVTLTVTNAVGS-ASATV 74
|
....
gi 505403041 745 QVEV 748
Cdd:smart00089 75 TVVV 78
|
|
| PKD |
smart00089 |
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ... |
1078-1161 |
3.30e-16 |
|
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Pssm-ID: 214510 [Multi-domain] Cd Length: 79 Bit Score: 74.79 E-value: 3.30e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1078 ADFTFTPSTPQVNEEITFNASSSTPsepgGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRqDSVTK 1157
Cdd:smart00089 2 ADVSASPTVGVAGESVTFTATSSDD----GSIVSYTWDFGDGTS--STGPTVTHTYTKPGTYTVTLTVTNAVGS-ASATV 74
|
....
gi 505403041 1158 QVEV 1161
Cdd:smart00089 75 TVVV 78
|
|
| PKD |
cd00146 |
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ... |
663-748 |
5.45e-16 |
|
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Pssm-ID: 238084 [Multi-domain] Cd Length: 81 Bit Score: 74.07 E-value: 5.45e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 663 PTADFTFTPsTPQVNEEITFNASSStpsePGGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGRQDSV 742
Cdd:cd00146 1 PTASVSAPP-VAELGASVTFSASDS----SGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
|
....*.
gi 505403041 743 TKQVEV 748
Cdd:cd00146 76 TTTVVV 81
|
|
| PKD |
cd00146 |
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ... |
1076-1161 |
5.45e-16 |
|
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Pssm-ID: 238084 [Multi-domain] Cd Length: 81 Bit Score: 74.07 E-value: 5.45e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1076 PTADFTFTPsTPQVNEEITFNASSStpsePGGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGRQDSV 1155
Cdd:cd00146 1 PTASVSAPP-VAELGASVTFSASDS----SGGSIVSYKWDFGDGEVSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTK 75
|
....*.
gi 505403041 1156 TKQVEV 1161
Cdd:cd00146 76 TTTVVV 81
|
|
| PKD |
smart00089 |
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 ... |
263-344 |
8.33e-16 |
|
Repeats in polycystic kidney disease 1 (PKD1) and other proteins; Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
Pssm-ID: 214510 [Multi-domain] Cd Length: 79 Bit Score: 73.64 E-value: 8.33e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 263 VAAFTYTPATPLVGDIITFNASSSYDPDgdkITDYIWDFGDGDTATGVVTTHSYSSPGTYDVTLTVYDErGGVNSTTVAI 342
Cdd:smart00089 1 VADVSASPTVGVAGESVTFTATSSDDGS---IVSYTWDFGDGTSSTGPTVTHTYTKPGTYTVTLTVTNA-VGSASATVTV 76
|
..
gi 505403041 343 EV 344
Cdd:smart00089 77 VV 78
|
|
| PKD_4 |
pfam18911 |
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins. |
1076-1155 |
7.86e-15 |
|
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
Pssm-ID: 436824 [Multi-domain] Cd Length: 85 Bit Score: 71.15 E-value: 7.86e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1076 PTADFTFtPSTPQVNEEITFNASSSTPsePGGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRQDSV 1155
Cdd:pfam18911 4 PVADAGG-DRIVAEGETVTFDASASDD--PDGDILSYRWDFGDGTT--ATGANVSHTYAAPGTYTVTLTVTDDSGASNST 78
|
|
| PKD_4 |
pfam18911 |
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins. |
663-742 |
7.86e-15 |
|
PKD domain; This entry is composed of PKD domains found in bacterial surface proteins.
Pssm-ID: 436824 [Multi-domain] Cd Length: 85 Bit Score: 71.15 E-value: 7.86e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 663 PTADFTFtPSTPQVNEEITFNASSSTPsePGGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRQDSV 742
Cdd:pfam18911 4 PVADAGG-DRIVAEGETVTFDASASDD--PDGDILSYRWDFGDGTT--ATGANVSHTYAAPGTYTVTLTVTDDSGASNST 78
|
|
| PKD |
cd00146 |
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an ... |
262-344 |
1.40e-12 |
|
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
Pssm-ID: 238084 [Multi-domain] Cd Length: 81 Bit Score: 64.44 E-value: 1.40e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 262 PVAAFTYTPATPLVGDIiTFNASSSYDPDgdkITDYIWDFGDG--DTATGVVTTHSYSSPGTYDVTLTVYDERGGVNSTT 339
Cdd:cd00146 1 PTASVSAPPVAELGASV-TFSASDSSGGS---IVSYKWDFGDGevSSSGEPTVTHTYTKPGTYTVTLTVTNAVGSSSTKT 76
|
....*
gi 505403041 340 VAIEV 344
Cdd:cd00146 77 TTVVV 81
|
|
| COG3430 |
COG3430 |
Archaeal flagellin (archaellin), FlaG/FlaF family [Cell motility]; |
5-78 |
1.71e-11 |
|
Archaeal flagellin (archaellin), FlaG/FlaF family [Cell motility];
Pssm-ID: 442656 Cd Length: 91 Bit Score: 61.96 E-value: 1.71e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 5 MKFSGDDRAVSELISIILMIAITVgafsVIAVSIYSFL--------QTPPSKHADFQAEKIGDELVIYHTGGEELSGDDI 76
Cdd:COG3430 4 KKGMSDERAVSPVIGVILMVAITV----ILAAVIGVFVfglgddvsEPAPQASLSVEFVGDGDSVTITHEGGDPVNVDDL 79
|
..
gi 505403041 77 II 78
Cdd:COG3430 80 KV 81
|
|
| PKD |
pfam00801 |
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. ... |
266-337 |
8.75e-11 |
|
PKD domain; This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.
Pssm-ID: 395646 [Multi-domain] Cd Length: 70 Bit Score: 58.94 E-value: 8.75e-11
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 505403041 266 FTYTPATPLVGDIITFNASSSydpDGDkITDYIWDFGDGD--TATGVVTTHSYSSPGTYDVTLTVYDERGGVNS 337
Cdd:pfam00801 1 VSASGTVVAAGQPVTFTATLA---DGS-NVTYTWDFGDSPgtSGSGPTVTHTYLSPGTYTVTLTASNAVGSANA 70
|
|
| COG1572 |
COG1572 |
Serine protease, subtilase family [Posttranslational modification, protein turnover, ... |
151-282 |
1.73e-10 |
|
Serine protease, subtilase family [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 441180 [Multi-domain] Cd Length: 459 Bit Score: 65.37 E-value: 1.73e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 151 DLTISLKVSNANPEVNEEITIYADVMNVGSEDInESFTVRFYYDSIEIYNEIINGLTSQSVEHISFSYT-PTSTGQHVIK 229
Cdd:COG1572 246 DLTVTSVTAPSTVVEGDTITVSATVKNQGTAAA-GATTVAFYLSGDPVGTASVGALAAGASYTVTVTITlPANAGTYYLL 324
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 505403041 230 GVVDADGAIVEDNENNNVSSKTISVSEPANNPPVAAFTYTPATPLVGDIITFN 282
Cdd:COG1572 325 AVVDPDNQVAESNETNNVASSAITVVGPPPPDLVVTSVSAPSTATAGSSVTVS 377
|
|
| Pilin_N |
pfam07790 |
Archaeal Type IV pilin, N-terminal; This entry represents the N-terminal domain of archaeal ... |
12-78 |
3.27e-09 |
|
Archaeal Type IV pilin, N-terminal; This entry represents the N-terminal domain of archaeal pilins, which play important roles in surface adhesion and twitching motility. This domain contains an conserved N- terminal hydrophobic motif.
Pssm-ID: 400235 Cd Length: 78 Bit Score: 54.89 E-value: 3.27e-09
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 505403041 12 RAVSELISIILMIAITVGAFSVIAVSIYSFLQTPPSK-HADFQAEKI---GDELVIYHTGGEELSGDDIII 78
Cdd:pfam07790 1 DAVSPVIGVVLMLAITVILAAVIAVFVFGLASPPEKApQASIQVKYDssaDTGVTFEHKGGDPIDTKDLKI 71
|
|
| COG3291 |
COG3291 |
Uncharacterized conserved protein, PKD repeat domain [Function unknown]; |
1080-1386 |
1.17e-08 |
|
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
Pssm-ID: 442520 [Multi-domain] Cd Length: 333 Bit Score: 58.53 E-value: 1.17e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1080 FTFTPSTPQVNEEITFNASSStpsepgGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRQDSVTKQV 1159
Cdd:COG3291 1 FTATPTSGCAPLTVQFTDTSS------GNATSYEWDFGDGTT--STEANPSHTYTTPGTYTVTLTVTDAAGCSDTTTKTI 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1160 EVSETLFDPGFAYEDVNGNLMYDPGVDVQILASEIQDGVYDAGSNGLVIPPSVGDITASSIYFKGRDVVVSVDLTASKGV 1239
Cdd:COG3291 73 TVGAPNPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDT 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1240 EIIGSDSVDITGVSVSSTNYNRDVVIQAGKILANGTDITAHGEVFLKATNIYISDSTIDTSSEYNMKISIDATNYVFANN 1319
Cdd:COG3291 153 ATVTTSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTD 232
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 505403041 1320 ATLKSQAKIDLGGNSLSGDGMSIDNSKAYDMKVNIVFLEDISLNNANIVSQGVVTINAGTQLTASDI 1386
Cdd:COG3291 233 VTLTGISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNV 299
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
224-750 |
3.36e-08 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 58.94 E-value: 3.36e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 224 GQHVIKGVVDADGAIVEDNENNNVSSKTISVSEPANNPPVAAFTYTPATplVGDIIT------FNASSSYDPdgdkiTDY 297
Cdd:TIGR00864 1389 GAEVTFIYNDPGCYLVTVAASNNISAANDSALIEVLEPVGATSFKHNGS--HGNNLElgqpylFSAFGRARN-----ASY 1461
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 298 IWDFGDGDTATGVVTTHSYSSPGTYDVTLTVYDERGGvNSTTVAIEVIESIcelpgwdyrKAITItnqNSfSLTDyqikI 377
Cdd:TIGR00864 1462 LWDFGDGGLLEGPEILHAFNSPGDFNIRLAAANEVGK-NEATLNVAVKARV---------RGLTI---NA-SLTN----V 1523
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 378 ELNSSnFDFTKANSDGSDIRFTesdgstflnyWI--ESWDPSNQTATIWVKVNiPASTSKTIYMYYGNSSATSmsngDST 455
Cdd:TIGR00864 1524 PLNGS-VHFEAHLDAGDDVRFS----------WIlcDHCTPIFGGNTIFYTFR-SVGTFNIIVTAENDVGAAQ----ASI 1587
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 456 FVFFddfegtslnttkwatntdtyQVENGAIRLWGSWNDGAylntrdsfsGSFVVEGRWRLSTTSKDVDLAVVFAEysns 535
Cdd:TIGR00864 1588 FLFV--------------------LQEIEGLQILGETAEGG---------GGGVQELDGCYFETNHTVQFHAGFKD---- 1634
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 536 ymwestsitctydsqstsrpyyqkdlnvkGTHVDWGpeiessdwqkfriiftqsyinyWDSWSAENSAKPSLEYSGSTFS 615
Cdd:TIGR00864 1635 -----------------------------GTNLSFS----------------------WNAILDNEPDGPAFAGSGKGAK 1663
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 616 TfylgiaadsdSTSRYGYIDyIFMRKyvenepSVVISATETDCTV----PSPTADFTFTPSTPQVNEEITFNASSStpse 691
Cdd:TIGR00864 1664 L----------NPLEAGPCD-IFLQA------ANLLGQATADCTIdflePAGNLMLAASDNPAAVNALINLSAELA---- 1722
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*....
gi 505403041 692 pGGSITNYHWDFGDGNVIDTTSPTITHTYSSANTYSVTLTVTDSLGRQDSvTKQVEVLE 750
Cdd:TIGR00864 1723 -EGSGLQYRWFLEEGDDLETSEPFMSHSFPSAGLHLVTMKAFNELGSANA-SEEVDVQE 1779
|
|
| COG3291 |
COG3291 |
Uncharacterized conserved protein, PKD repeat domain [Function unknown]; |
266-553 |
3.47e-08 |
|
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
Pssm-ID: 442520 [Multi-domain] Cd Length: 333 Bit Score: 57.37 E-value: 3.47e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 266 FTYTPATPLVGDIITFNASSSYDpdgdkITDYIWDFGDGDTATGVVTTHSYSSPGTYDVTLTVYDERGGVNSTTVAIEVI 345
Cdd:COG3291 1 FTATPTSGCAPLTVQFTDTSSGN-----ATSYEWDFGDGTTSTEANPSHTYTTPGTYTVTLTVTDAAGCSDTTTKTITVG 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 346 ESICELPGWDYRKAITITNQNSFSLTDYQIKIELNSSNFDFTKANSDGSDIRFTESDGSTFLNYWIESWDPSNQTATIWV 425
Cdd:COG3291 76 APNPGVTTVTTSTTVTTLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDTATV 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 426 KVNIPASTSKTIYMYYGNSSATSMSNGDSTFVFFDDFEGTSLNTTKWATNTDTYQVENGAIRLWGSWNDGAYLNTRDSFS 505
Cdd:COG3291 156 TTSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTDVTL 235
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 505403041 506 GSFVVEGRWRLSTTSKDVDLAVVFAEYSNSYMWESTSITCTYDSQSTS 553
Cdd:COG3291 236 TGISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTN 283
|
|
| COG1572 |
COG1572 |
Serine protease, subtilase family [Posttranslational modification, protein turnover, ... |
132-254 |
7.33e-08 |
|
Serine protease, subtilase family [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 441180 [Multi-domain] Cd Length: 459 Bit Score: 56.90 E-value: 7.33e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 132 SEEANQVLFSMPAEALPEK-DLTISLKVSNANPEVNEEITIYADVMNVGSEDInESFTVRFYYDSIEIY-NEIINGLTSQ 209
Cdd:COG1572 336 SNETNNVASSAITVVGPPPpDLVVTSVSAPSTATAGSSVTVSVTVKNQGTAAA-SGFTVTLYLSGDATTdLTYVGSLAAG 414
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 505403041 210 SVEHISFSYTpTSTGQHVIKGVVDADGAIVEDNENNNVSSKTISV 254
Cdd:COG1572 415 ASYTVTISVT-TASGQYYLLVVADPDNYVGESNENNNVFAVSINV 458
|
|
| COG3291 |
COG3291 |
Uncharacterized conserved protein, PKD repeat domain [Function unknown]; |
667-982 |
1.36e-07 |
|
Uncharacterized conserved protein, PKD repeat domain [Function unknown];
Pssm-ID: 442520 [Multi-domain] Cd Length: 333 Bit Score: 55.45 E-value: 1.36e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 667 FTFTPSTPQVNEEITFNASSStpsepgGSITNYHWDFGDGNVidTTSPTITHTYSSANTYSVTLTVTDSLGRQDSVTKQV 746
Cdd:COG3291 1 FTATPTSGCAPLTVQFTDTSS------GNATSYEWDFGDGTT--STEANPSHTYTTPGTYTVTLTVTDAAGCSDTTTKTI 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 747 EVLEASVCELPGWDYRKAI-TITNQNSFSLTDYQIKIELNSSNFDFTKANSDGSDIRFTESDGSTFLNYWIESWDPSNQT 825
Cdd:COG3291 73 TVGAPNPGVTTVTTSTTVTtLANTANGGATTVVAGSTVGTGVATSTTTAAAPGGGGGTGTTTTTGTDTGLTGSTGTASDT 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 826 ATIWVKVNIPASTSKTIYMYYGNSSATSMSNPEKTMFLYENFESDPGNLYGDAYYDSANRYVVLTRPLIFQTGYMVYNSV 905
Cdd:COG3291 153 ATVTTSVSTTDVTSDGTTSASTNPSVTTDTVTTLTGSYTGTIVGGSGSGTVTSGTAGVTTGATSGTSGTGSATSGVAVTD 232
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 505403041 906 PTNPTGFYAKFYFKSGGGRGADALWMGAYDTDYTGTREDIVDGGYHFTYDEYNDRIAFTKSTTDNGAPIAYYSIDGS 982
Cdd:COG3291 233 VTLTGISTGDAGTPGTNTVTTSGANTAGTSTITGGTSGVVTTSAATGTSTNGTGGLGTTTAITPGNVSTTADVTGGT 309
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
1076-1231 |
3.47e-05 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 48.93 E-value: 3.47e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1076 PTADFTFTPSTPQVNEEITFNASSsTPSePGGsiTNYHWDFGDGNVIDTTS-PTITHTYSSANTYSVTLTVTDSLgrqDS 1154
Cdd:TIGR00864 1088 PRVAIGTEDGLLLAGKPADFEAHP-LPS-PGG--IHYEWDFGDGSALLQGRqPAAAHTFAKRGPFHVCLEVNNTI---SG 1160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1155 VTKQVEVsetlfdpgFAYEDVNGnLMYDPGVDVQILASEIQDGVYDAGSNgLVIPPSVGD--------ITASSIYFKGRD 1226
Cdd:TIGR00864 1161 AAACADM--------FAFEEIEG-LSADMSLATELGAATTVRAALQSGDN-ITWTFDMGDgkslsgpeATVEHKYAKAGN 1230
|
....*
gi 505403041 1227 VVVSV 1231
Cdd:TIGR00864 1231 CTVNI 1235
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
694-804 |
1.39e-04 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 47.00 E-value: 1.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 694 GSITNYHWDFGDGNVIDttSPTITHTYSSANTYSVTLTVTDSLGRQDSvTKQVEVLeASVcelpgwdyrKAITItnqNSf 773
Cdd:TIGR00864 1456 ARNASYLWDFGDGGLLE--GPEILHAFNSPGDFNIRLAAANEVGKNEA-TLNVAVK-ARV---------RGLTI---NA- 1518
|
90 100 110
....*....|....*....|....*....|.
gi 505403041 774 SLTDyqikIELNSSnFDFTKANSDGSDIRFT 804
Cdd:TIGR00864 1519 SLTN----VPLNGS-VHFEAHLDAGDDVRFS 1544
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
663-736 |
2.94e-04 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 45.84 E-value: 2.94e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 505403041 663 PTADFTFTPSTPQVNEEITFNASSsTPSePGGsiTNYHWDFGDGNVIDTTS-PTITHTYSSANTYSVTLTVTDSL 736
Cdd:TIGR00864 1088 PRVAIGTEDGLLLAGKPADFEAHP-LPS-PGG--IHYEWDFGDGSALLQGRqPAAAHTFAKRGPFHVCLEVNNTI 1158
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
671-757 |
4.33e-04 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 45.07 E-value: 4.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 671 PSTPQVNEEITFNASSStpsePGGSITNYHWDFGDGNV-IDTTSPTITHTYSSANTYSVTLTVTDSLG---RQDSVTkqV 746
Cdd:TIGR00864 2051 PQDCFTNKMAQFEAATS----PKPNFMACHWDFGDGSAgQDTDEPRAEHEYLHPGDYRVQVNASNLVSffsAHAEIN--V 2124
|
90
....*....|.
gi 505403041 747 EVLEasvCELP 757
Cdd:TIGR00864 2125 QVLA---CEEP 2132
|
|
| PHA01755 |
PHA01755 |
hypothetical protein |
359-635 |
6.68e-04 |
|
hypothetical protein
Pssm-ID: 222834 Cd Length: 562 Bit Score: 44.21 E-value: 6.68e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 359 AITITN-QNSFSLTDYQIKIELNSSNFdfTKANSDGSDIRF-TESDGSTFLNYWIESWDPSNQTATIWVK--VNIPASTS 434
Cdd:PHA01755 227 TITITNsQPDPTPSPFQQLLILNLSNI--ISSPSQLLNLQFcLDSQCSTPLYAWIESYNSNLSTVYIWINlpISIPANSS 304
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 435 KTIYMYYGNSSA---TSM-SNGDSTFVFFDD--------FEGTSLNTTKWATNTDTYQVEN--------GAIRLWGSWND 494
Cdd:PHA01755 305 ITIYMFVRNSIQypyTGMrPDLTSTYAQYDNgknvfliyFNGNEPLSNFNQEGNTIQQISTfgplgntiNAIYLSGYENN 384
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 495 GAYLNTRDSFSGSFVVegrwrlsTTSKDVDLAVVF----AEYSNSYMWESTSITCTYDSQST---SRPYYQKDLNVKGTh 567
Cdd:PHA01755 385 VGFVYTGKSETNQPVI-------SEASSQRMPNQTgglgAYNGTAGIADSTNTAFINDIGVTmgeDTSYFSQYYYVNGG- 456
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 505403041 568 vdwgpeiESSDWQKFRIIFTQSYINYWdswsaensakpslEYSGSTFSTFYLGIAADSDSTSrYGYID 635
Cdd:PHA01755 457 -------ETGGSNYQGSAVSQWVYAWV-------------QYQGSSASSWFGCIAPQLYSSP-GGYCG 503
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
1107-1186 |
1.07e-03 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 43.92 E-value: 1.07e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 1107 GSITNYHWDFGDGNVIDttSPTITHTYSSANTYSVTLTVTDSLGRQDS---VTKQVEVSETLFDPGFAYEDVNGNLMYDP 1183
Cdd:TIGR00864 1456 ARNASYLWDFGDGGLLE--GPEILHAFNSPGDFNIRLAAANEVGKNEAtlnVAVKARVRGLTINASLTNVPLNGSVHFEA 1533
|
...
gi 505403041 1184 GVD 1186
Cdd:TIGR00864 1534 HLD 1536
|
|
| PHA01755 |
PHA01755 |
hypothetical protein |
747-850 |
2.19e-03 |
|
hypothetical protein
Pssm-ID: 222834 Cd Length: 562 Bit Score: 42.67 E-value: 2.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 747 EVLEASVCELPGWdyrkAITITN-QNSFSLTDYQIKIELNSSNFdfTKANSDGSDIRF-TESDGSTFLNYWIESWDPSNQ 824
Cdd:PHA01755 214 ELIEFYVIPITAY----TITITNsQPDPTPSPFQQLLILNLSNI--ISSPSQLLNLQFcLDSQCSTPLYAWIESYNSNLS 287
|
90 100
....*....|....*....|....*...
gi 505403041 825 TATIWVK--VNIPASTSKTIYMYYGNSS 850
Cdd:PHA01755 288 TVYIWINlpISIPANSSITIYMFVRNSI 315
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
699-757 |
2.29e-03 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 42.76 E-value: 2.29e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 505403041 699 YHWDFGDGNVIDTT--SPTITHTYSSANTYSVTLTVTDSLGRQDSVTkqvevleaSVCELP 757
Cdd:TIGR00864 1287 FDWSFGDGSPNETHhgCPGISHNFRGNGTFPLALTISSGVNKAHFFT--------QICVEP 1339
|
|
| PCC |
TIGR00864 |
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) ... |
1112-1157 |
2.51e-03 |
|
polycystin cation channel protein; The Polycystin Cation Channel (PCC) Family (TC 1.A.5) Polycystin is a huge protein of 4303aas. Its repeated leucine-rich (LRR) segment is found in many proteins. It contains 16 polycystic kidney disease (PKD) domains, one LDL-receptor class A domain, one C-type lectin family domain, and 16-18 putative TMSs in positions between residues 2200 and 4100. Polycystin-L has been shown to be a cation (Na+, K+ and Ca2+) channel that is activated by Ca2+. Two members of the PCC family (polycystin 1 and 2) are mutated in autosomal dominant polycystic kidney disease, and polycystin-L is deleted in mice with renal and retinal defects. Note: this model is restricted to the amino half.
Pssm-ID: 188093 [Multi-domain] Cd Length: 2740 Bit Score: 42.76 E-value: 2.51e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 505403041 1112 YHWDFGDGNVIDTT--SPTITHTYSSANTYSVTLTVTDSLGRQDSVTK 1157
Cdd:TIGR00864 1287 FDWSFGDGSPNETHhgCPGISHNFRGNGTFPLALTISSGVNKAHFFTQ 1334
|
|
| BglS |
COG2273 |
Beta-glucanase, GH16 family [Carbohydrate transport and metabolism]; |
455-514 |
9.97e-03 |
|
Beta-glucanase, GH16 family [Carbohydrate transport and metabolism];
Pssm-ID: 441874 [Multi-domain] Cd Length: 259 Bit Score: 39.59 E-value: 9.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 505403041 455 TFVFFDDFEGTSLNTTKWATNTD----------TYQ-----VENGAIRLWGSWND---------GAYLNTRDSFSGSFvv 510
Cdd:COG2273 30 TLVFSDEFDGTSLDTSKWTYDTGgpgwgngelqYYTdenvsVENGNLVITARKEPyggggrpytSGRITTKGKFSFTY-- 107
|
....
gi 505403041 511 eGRW 514
Cdd:COG2273 108 -GRF 110
|
|
|