|
Name |
Accession |
Description |
Interval |
E-value |
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
3191-3268 |
3.17e-38 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus. :
Pssm-ID: 464783 Cd Length: 78 Bit Score: 138.51 E-value: 3.17e-38
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 386771624 3191 QERHRILKHAHKRAVERAWELEKQLVAAGFQGRGDWTEEEKEELVQHGDVDGWNGIDIHSIHKYPQLADDPGNVAFQR 3268
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| NHL super family |
cl18310 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1750-2065 |
9.38e-35 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats. The actual alignment was detected with superfamily member cd14953:
Pssm-ID: 302697 [Multi-domain] Cd Length: 323 Bit Score: 137.28 E-value: 9.38e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1750 DGSLFVGDF--NYIRRIMTDGSIRTVV----------KLNATRVSYRYHMALSPlDGTLYVSDPESHQIIRVrdtndysQ 1817
Cdd:cd14953 33 AGNLYVADRgnHRIRKITPDGVVTTVAgtgtagfadgGGAAAQFNTPSGVAVDA-AGNLYVADTGNHRIRKI-------T 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1818 PELNWEAVVGSGErclpgdeAHCGDGALAKDAKLAYPKGIAISSDNILYFADGTN--IRMVDRDGIVSTLIGNHMHKSHW 1895
Cdd:cd14953 105 PDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGNhrIRKITPDGVVTTVAGTGGAGYAG 177
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1896 kpipcEGTLklEEMHLRWPTELAVSPMDNtLHIID--DHMILRMTPDGRVRVISGRPlhcaTASTAYDTDlATHATLVMP 1973
Cdd:cd14953 178 -----DGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG----TAGFSGDGG-ATAAQLNNP 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1974 QSIAFGPLGELYVAESDSqriNRVRVIGTDGRIAPFAGaeskcnclerGCDCFEAEHYLATSAKFNTIAALAVTPDSHVH 2053
Cdd:cd14953 245 TGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAG----------GGAGFSGDGGPATSAQFNNPTGVAVDAAGNLY 311
|
330
....*....|..
gi 386771624 2054 IADQANYRIRSV 2065
Cdd:cd14953 312 VADTGNNRIRKI 323
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
2248-2980 |
1.07e-22 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only]; :
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 107.53 E-value: 1.07e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2248 GATVIVRNGAAESRTTVDMDGSTTSITPWGHNLQMEVAPYTILAEQSPLLGESYPVPAKQRTEIAGDLANRFEWRYFVRR 2327
Cdd:COG3209 341 GTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPA 420
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2328 QQPLQAGKQSKGPPRPVTEVGRKLRVNGDNVLTLEYDRETQSVVVMVDDKQELLNVTYDRTSRPISFRPQSGDYADVDLE 2407
Cdd:COG3209 421 TAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDT 500
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2408 YDRFGRLVSWKWGVLQEAYSFDRNGRLNEIKYGDGSTMVYAFKDMFGSLPLKVTTPRRSDYLLQYDDAGALQSLTTPRGH 2487
Cdd:COG3209 501 TLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGAST 580
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2488 IHAFSLQTSLGFFkYQYYSPINRHPFEILYNDEGQILAKIHPHQSGKVAFVHDTAGRLETILAGLSSTHYTYQDTTSLVK 2567
Cdd:COG3209 581 TTGTTGGTATTTT-VTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRA 659
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2568 SVEVQEPGFELRREFKYHAGILKDEKLRFGSKNSLASARYKYAYDGNARLSGIEMAIDDKELPTTRYKYSQNLGQLEVVQ 2647
Cdd:COG3209 660 TGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGG 739
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2648 DLKITRNAFNRTVIQDSAKQFfaivDYDQHGRVKSV-LMNVKNIDVFRLELDYDLRNRIKSQKTTFGRSTAFDkinYNAD 2726
Cdd:COG3209 740 TTGTLTTTSTTTTTTAGALTY----TYDALGRLTSEtTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYT---YDAL 812
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2727 GHVVEVLgtnnwkylfdENGNTVGVVDQGekFNLGYDIGDRVIKVGDVEFNNydargfvvkRGEQKYRYNNRGQLIHSFE 2806
Cdd:COG3209 813 GRLTSVI----------TVGSGGGTDLQD--RTYTYDAAGNITSITDALRAG---------TLTQTYTYDALGRLTSATD 871
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2807 RERFQSwYYYDDRsrlvawhdnkGNTTQYYYANPRTphlvthvhfpkisrtmkLFYDDRDMLIALEHEDQR--YYV---- 2880
Cdd:COG3209 872 PGTTES-YTYDAN----------GNLTSRTDGGTTT-----------------YTYDALGRLVSVTKPDGTttTYTydal 923
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2881 -ATDQNGSPLAFFDQNGSIVKEMKRTPFGRIIKDTKPEFFVPIDFHGGLIDPHTKLVYTEQRQYDPHVGQWMT--PLWET 2957
Cdd:COG3209 924 gHTDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSpdPIGLA 1003
|
730 740
....*....|....*....|...
gi 386771624 2958 latemshpTDVFIYRYHNNDPIN 2980
Cdd:COG3209 1004 --------GGLNLYAYVGNNPVN 1018
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
1337-1363 |
3.47e-06 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids. :
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 45.97 E-value: 3.47e-06
10 20
....*....|....*....|....*..
gi 386771624 1337 NCGDSKDNDKDGLVDCEDPECCASHVC 1363
Cdd:NF033662 6 TCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| EGF_Tenascin super family |
cl46594 |
Tenascin EGF domain; This entry represents the EGF-like domains found in tenascin proteins. |
1071-1098 |
1.34e-04 |
|
Tenascin EGF domain; This entry represents the EGF-like domains found in tenascin proteins. The actual alignment was detected with superfamily member pfam18720:
Pssm-ID: 480934 Cd Length: 29 Bit Score: 41.52 E-value: 1.34e-04
10 20
....*....|....*....|....*...
gi 386771624 1071 CPNGCSGNGQCLLGHCQCNPGFGGDDCS 1098
Cdd:pfam18720 2 CPLGCSSRGVCVDGQCICDSEYSGDDCS 29
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
1139-1162 |
3.03e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. :
Pssm-ID: 400365 Cd Length: 26 Bit Score: 40.41 E-value: 3.03e-04
10 20
....*....|....*....|....*.
gi 386771624 1139 DCSGHGHCVS--GKCQCMRGYKGKFC 1162
Cdd:pfam07974 1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2183-2217 |
3.34e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain. :
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 40.27 E-value: 3.34e-04
10 20 30
....*....|....*....|....*....|....*
gi 386771624 2183 PTGLLRTKLDSTGRSYVYNYDEFGRLTSAVTPTGR 2217
Cdd:pfam05593 3 AAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
| DUF5885 super family |
cl44670 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
1181-1229 |
3.52e-04 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses. The actual alignment was detected with superfamily member pfam19232:
Pssm-ID: 437064 Cd Length: 265 Bit Score: 45.38 E-value: 3.52e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 386771624 1181 GTCICKKGWKGPDCaTMDQDalqclpdCSGHGTFDLDTQTCTCEAKWSG 1229
Cdd:pfam19232 207 GVCPCKPGWAGGSC-TEDRT-------CNGRGTWNETTGQCACNIDFSG 247
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
3191-3268 |
3.17e-38 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 138.51 E-value: 3.17e-38
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 386771624 3191 QERHRILKHAHKRAVERAWELEKQLVAAGFQGRGDWTEEEKEELVQHGDVDGWNGIDIHSIHKYPQLADDPGNVAFQR 3268
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1750-2065 |
9.38e-35 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 137.28 E-value: 9.38e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1750 DGSLFVGDF--NYIRRIMTDGSIRTVV----------KLNATRVSYRYHMALSPlDGTLYVSDPESHQIIRVrdtndysQ 1817
Cdd:cd14953 33 AGNLYVADRgnHRIRKITPDGVVTTVAgtgtagfadgGGAAAQFNTPSGVAVDA-AGNLYVADTGNHRIRKI-------T 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1818 PELNWEAVVGSGErclpgdeAHCGDGALAKDAKLAYPKGIAISSDNILYFADGTN--IRMVDRDGIVSTLIGNHMHKSHW 1895
Cdd:cd14953 105 PDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGNhrIRKITPDGVVTTVAGTGGAGYAG 177
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1896 kpipcEGTLklEEMHLRWPTELAVSPMDNtLHIID--DHMILRMTPDGRVRVISGRPlhcaTASTAYDTDlATHATLVMP 1973
Cdd:cd14953 178 -----DGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG----TAGFSGDGG-ATAAQLNNP 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1974 QSIAFGPLGELYVAESDSqriNRVRVIGTDGRIAPFAGaeskcnclerGCDCFEAEHYLATSAKFNTIAALAVTPDSHVH 2053
Cdd:cd14953 245 TGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAG----------GGAGFSGDGGPATSAQFNNPTGVAVDAAGNLY 311
|
330
....*....|..
gi 386771624 2054 IADQANYRIRSV 2065
Cdd:cd14953 312 VADTGNNRIRKI 323
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
2248-2980 |
1.07e-22 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 107.53 E-value: 1.07e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2248 GATVIVRNGAAESRTTVDMDGSTTSITPWGHNLQMEVAPYTILAEQSPLLGESYPVPAKQRTEIAGDLANRFEWRYFVRR 2327
Cdd:COG3209 341 GTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPA 420
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2328 QQPLQAGKQSKGPPRPVTEVGRKLRVNGDNVLTLEYDRETQSVVVMVDDKQELLNVTYDRTSRPISFRPQSGDYADVDLE 2407
Cdd:COG3209 421 TAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDT 500
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2408 YDRFGRLVSWKWGVLQEAYSFDRNGRLNEIKYGDGSTMVYAFKDMFGSLPLKVTTPRRSDYLLQYDDAGALQSLTTPRGH 2487
Cdd:COG3209 501 TLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGAST 580
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2488 IHAFSLQTSLGFFkYQYYSPINRHPFEILYNDEGQILAKIHPHQSGKVAFVHDTAGRLETILAGLSSTHYTYQDTTSLVK 2567
Cdd:COG3209 581 TTGTTGGTATTTT-VTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRA 659
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2568 SVEVQEPGFELRREFKYHAGILKDEKLRFGSKNSLASARYKYAYDGNARLSGIEMAIDDKELPTTRYKYSQNLGQLEVVQ 2647
Cdd:COG3209 660 TGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGG 739
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2648 DLKITRNAFNRTVIQDSAKQFfaivDYDQHGRVKSV-LMNVKNIDVFRLELDYDLRNRIKSQKTTFGRSTAFDkinYNAD 2726
Cdd:COG3209 740 TTGTLTTTSTTTTTTAGALTY----TYDALGRLTSEtTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYT---YDAL 812
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2727 GHVVEVLgtnnwkylfdENGNTVGVVDQGekFNLGYDIGDRVIKVGDVEFNNydargfvvkRGEQKYRYNNRGQLIHSFE 2806
Cdd:COG3209 813 GRLTSVI----------TVGSGGGTDLQD--RTYTYDAAGNITSITDALRAG---------TLTQTYTYDALGRLTSATD 871
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2807 RERFQSwYYYDDRsrlvawhdnkGNTTQYYYANPRTphlvthvhfpkisrtmkLFYDDRDMLIALEHEDQR--YYV---- 2880
Cdd:COG3209 872 PGTTES-YTYDAN----------GNLTSRTDGGTTT-----------------YTYDALGRLVSVTKPDGTttTYTydal 923
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2881 -ATDQNGSPLAFFDQNGSIVKEMKRTPFGRIIKDTKPEFFVPIDFHGGLIDPHTKLVYTEQRQYDPHVGQWMT--PLWET 2957
Cdd:COG3209 924 gHTDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSpdPIGLA 1003
|
730 740
....*....|....*....|...
gi 386771624 2958 latemshpTDVFIYRYHNNDPIN 2980
Cdd:COG3209 1004 --------GGLNLYAYVGNNPVN 1018
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
2465-2837 |
4.73e-12 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 72.34 E-value: 4.73e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2465 RSDYLLQYDDAGALQSLTTPRGHIHAFSLQTSlgffKYQYYSPINRHpfEILYNDEGQILAKI--HPHQSGKVAFV-HDT 2541
Cdd:NF041261 362 RPEMCYRYDDTGRVTEQLNPAGLSYRYQYEQD----RITITDSLNRR--EVLHTEGEGGLKRVvkKEHADGSVTRSgYDA 435
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2542 AGRL--ETILAGLSS---------------------THYTYQDTTSLVKSVevQEPGFELRREFkyhagilkDEKLRFGS 2598
Cdd:NF041261 436 AGRLtaQTDAAGRRTeyslnvvsgditdittpdgreTKFYYNDGNQLTSVT--SPDGLESRREY--------DEPGRLVS 505
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2599 KNSLASARYKYAYDGNArlSGIEMAIDDKELPTTRYKYSQnLGQLEVVQDLK--ITRnafnrtviqdsakqffaiVDYDQ 2676
Cdd:NF041261 506 ETSRSGETTRYRYDDPH--SELPATTTDATGSTKQMTWSR-YGQLLAFTDCSgyQTR------------------YEYDR 564
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2677 HGRVKSVLMNvKNIDVFRlelDYDLRNRIKSQKTTFGRSTAFDkinYNADGHVVEVLGT--NNWKYLFDENGNTVGVVDQ 2754
Cdd:NF041261 565 FGQMTAVHRE-EGISTYR---RYDNRGQLTSVKDAQGRETRYE---YNAAGDLTAVITPdgNRSETQYDAWGKAVSTTQG 637
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2755 GEKFNLGYDIGDRVIKV----GDVEFNNYDARGFVVKRG-----EQKYRYNNRGQLIHSfERERFQSWYYYDDRSRLVAW 2825
Cdd:NF041261 638 GLTRSMEYDAAGRITTLtnenGSHSTFLYDALDRLVQQRgfdgrTQRYHYDLTGKLTQS-EDEGLVTLWHYDESDRITHR 716
|
410
....*....|..
gi 386771624 2826 HDNKGNTTQYYY 2837
Cdd:NF041261 717 TVNGEPAEQWQY 728
|
|
| PLN02919 |
PLN02919 |
haloacid dehalogenase-like hydrolase family protein |
1775-1997 |
1.36e-08 |
|
haloacid dehalogenase-like hydrolase family protein
Pssm-ID: 215497 [Multi-domain] Cd Length: 1057 Bit Score: 61.02 E-value: 1.36e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1775 KLNATRVSYRYHMALSPLDGTLYVSDPESHQIIrVRDTNDysqpelNWEAVVGS-GERCLPgdeahcgDGALaKDAKLAY 1853
Cdd:PLN02919 561 RLLTSPLKFPGKLAIDLLNNRLFISDSNHNRIV-VTDLDG------NFIVQIGStGEEGLR-------DGSF-EDATFNR 625
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1854 PKGIAISSD-NILYFADGTN--IRMVD-RDGIVSTLIGNHMHKSHWKpipceGTLKLEEMHLRWPTELAVSPMDNTLHII 1929
Cdd:PLN02919 626 PQGLAYNAKkNLLYVADTENhaLREIDfVNETVRTLAGNGTKGSDYQ-----GGKKGTSQVLNSPWDVCFEPVNEKVYIA 700
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 386771624 1930 --DDHMILRM-TPDGRVRVISG----RPLHcatASTAYDTDLAthatlvMPQSIAFGP-LGELYVAESDSQRINRV 1997
Cdd:PLN02919 701 maGQHQIWEYnISDGVTRVFSGdgyeRNLN---GSSGTSTSFA------QPSGISLSPdLKELYIADSESSSIRAL 767
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2795-2836 |
3.12e-08 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 51.82 E-value: 3.12e-08
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 386771624 2795 YNNRGQLIHSFERERFQSWYYYDDRSRLVAWHDNKGNTTQYY 2836
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1750-2010 |
3.28e-07 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 54.64 E-value: 3.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1750 DGSLFVGDF--NYIRRI-MTDGSIRTVVKLNATRVsyrYHMALSPlDGTLYVSDPESHQIIRV-RDTNDYSQpelnweav 1825
Cdd:COG4257 27 DGAVWFTDQggGRIGRLdPATGEFTEYPLGGGSGP---HGIAVDP-DGNLWFTDNGNNRIGRIdPKTGEITT-------- 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1826 vgsgerclpgdeahcgdgaLAKDAKLAYPKGIAISSDNILYFADGTN--IRMVD-RDGIVSTLignhmhkshwkPIPCEG 1902
Cdd:COG4257 95 -------------------FALPGGGSNPHGIAFDPDGNLWFTDQGGnrIGRLDpATGEVTEF-----------PLPTGG 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1903 TLkleemhlrwPTELAVSPmDNTLHIID--DHMILRMTPD-GRVRVISG-----RPLHCATAS--------------TAY 1960
Cdd:COG4257 145 AG---------PYGIAVDP-DGNLWVTDfgANAIGRIDPDtGTLTEYALptpgaGPRGLAVDPdgnlwvadtgsgriGRF 214
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*.
gi 386771624 1961 DTD------LATHATLVMPQSIAFGPLGELYVAESDSqriNRVRVIGTDGRIAPFA 2010
Cdd:COG4257 215 DPKtgtvteYPLPGGGARPYGVAVDGDGRVWFAESGA---NRIVRFDPDTELTEYV 267
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
1337-1363 |
3.47e-06 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 45.97 E-value: 3.47e-06
10 20
....*....|....*....|....*..
gi 386771624 1337 NCGDSKDNDKDGLVDCEDPECCASHVC 1363
Cdd:NF033662 6 TCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| EGF_Tenascin |
pfam18720 |
Tenascin EGF domain; This entry represents the EGF-like domains found in tenascin proteins. |
1071-1098 |
1.34e-04 |
|
Tenascin EGF domain; This entry represents the EGF-like domains found in tenascin proteins.
Pssm-ID: 376143 Cd Length: 29 Bit Score: 41.52 E-value: 1.34e-04
10 20
....*....|....*....|....*...
gi 386771624 1071 CPNGCSGNGQCLLGHCQCNPGFGGDDCS 1098
Cdd:pfam18720 2 CPLGCSSRGVCVDGQCICDSEYSGDDCS 29
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
1139-1162 |
3.03e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 40.41 E-value: 3.03e-04
10 20
....*....|....*....|....*.
gi 386771624 1139 DCSGHGHCVS--GKCQCMRGYKGKFC 1162
Cdd:pfam07974 1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2183-2217 |
3.34e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 40.27 E-value: 3.34e-04
10 20 30
....*....|....*....|....*....|....*
gi 386771624 2183 PTGLLRTKLDSTGRSYVYNYDEFGRLTSAVTPTGR 2217
Cdd:pfam05593 3 AAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
1181-1229 |
3.52e-04 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 45.38 E-value: 3.52e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 386771624 1181 GTCICKKGWKGPDCaTMDQDalqclpdCSGHGTFDLDTQTCTCEAKWSG 1229
Cdd:pfam19232 207 GVCPCKPGWAGGSC-TEDRT-------CNGRGTWNETTGQCACNIDFSG 247
|
|
| TNFRSF26 |
cd15837 |
Tumor necrosis factor receptor superfamily member 26 (TNFRSF26), also known as tumor necrosis ... |
1069-1198 |
1.25e-03 |
|
Tumor necrosis factor receptor superfamily member 26 (TNFRSF26), also known as tumor necrosis factor receptor homolog 3 (TNFRH3); TNFRSF26 (also known as tumor necrosis factor receptor homolog 3 (TNFRH3) or TNFRSF24) is predominantly expressed in embryos and lymphoid cell types, along with its closely related TNFRSF22 and TNFRSF23 orthologs, and is developmentally regulated. Unlike TNFRSF22/23, TNFRSF26 does not serve as a TRAIL decoy receptor; it remains an orphan receptor.
Pssm-ID: 276933 [Multi-domain] Cd Length: 118 Bit Score: 41.20 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1069 QNCPNGcsgngQCLLGHCQCNPGFGgdDCSEsvcpvlCsQHGEYT---NGECICNPgwkgkeCSL-RHDECEVADCSGhg 1144
Cdd:cd15837 14 QLCPAG-----HYVSEPCQENHGVG--ECAP------C-EPGTFTahpNGETSCFP------CSQcRDDQEVVAECSA-- 71
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 386771624 1145 hcVSG-KCQCMRGYkgkFCEEVDCPHpNCSGHGFCADGTCICKKgwkgpdC-ATMD 1198
Cdd:cd15837 72 --TSDrQCQCKQGH---FYCDENCLE-SCFRCSRCPGGRVVLQP------CnATRD 115
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
3191-3268 |
3.17e-38 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 138.51 E-value: 3.17e-38
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 386771624 3191 QERHRILKHAHKRAVERAWELEKQLVAAGFQGRGDWTEEEKEELVQHGDVDGWNGIDIHSIHKYPQLADDPGNVAFQR 3268
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1750-2065 |
9.38e-35 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 137.28 E-value: 9.38e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1750 DGSLFVGDF--NYIRRIMTDGSIRTVV----------KLNATRVSYRYHMALSPlDGTLYVSDPESHQIIRVrdtndysQ 1817
Cdd:cd14953 33 AGNLYVADRgnHRIRKITPDGVVTTVAgtgtagfadgGGAAAQFNTPSGVAVDA-AGNLYVADTGNHRIRKI-------T 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1818 PELNWEAVVGSGErclpgdeAHCGDGALAKDAKLAYPKGIAISSDNILYFADGTN--IRMVDRDGIVSTLIGNHMHKSHW 1895
Cdd:cd14953 105 PDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGNhrIRKITPDGVVTTVAGTGGAGYAG 177
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1896 kpipcEGTLklEEMHLRWPTELAVSPMDNtLHIID--DHMILRMTPDGRVRVISGRPlhcaTASTAYDTDlATHATLVMP 1973
Cdd:cd14953 178 -----DGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG----TAGFSGDGG-ATAAQLNNP 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1974 QSIAFGPLGELYVAESDSqriNRVRVIGTDGRIAPFAGaeskcnclerGCDCFEAEHYLATSAKFNTIAALAVTPDSHVH 2053
Cdd:cd14953 245 TGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAG----------GGAGFSGDGGPATSAQFNNPTGVAVDAAGNLY 311
|
330
....*....|..
gi 386771624 2054 IADQANYRIRSV 2065
Cdd:cd14953 312 VADTGNNRIRKI 323
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1841-2065 |
1.46e-25 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 110.31 E-value: 1.46e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1841 GDGALAkdAKLAYPKGIAISSDNILYFADGTN--IRMVDRDGIVSTLIGNhmhkshwkpipceGT------------LKL 1906
Cdd:cd14953 14 GGGGTA--ARFNSPSGVAVDAAGNLYVADRGNhrIRKITPDGVVTTVAGT-------------GTagfadgggaaaqFNT 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1907 eemhlrwPTELAVSPMDNtLHIID--DHMILRMTPDGRVRVISGrplhcaTASTAYDTD-LATHATLVMPQSIAFGPLGE 1983
Cdd:cd14953 79 -------PSGVAVDAAGN-LYVADtgNHRIRKITPDGVVSTLAG------TGTAGFSDDgGATAAQFNYPTGVAVDAAGN 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1984 LYVAESDSQRInrvRVIGTDGRIAPFAGAESKcnclerGCDCFEAehylATSAKFNTIAALAVTPDSHVHIADQANYRIR 2063
Cdd:cd14953 145 LYVADTGNHRI---RKITPDGVVTTVAGTGGA------GYAGDGP----ATAAQFNNPTGVAVDAAGNLYVADRGNHRIR 211
|
..
gi 386771624 2064 SV 2065
Cdd:cd14953 212 KI 213
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
2248-2980 |
1.07e-22 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 107.53 E-value: 1.07e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2248 GATVIVRNGAAESRTTVDMDGSTTSITPWGHNLQMEVAPYTILAEQSPLLGESYPVPAKQRTEIAGDLANRFEWRYFVRR 2327
Cdd:COG3209 341 GTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPA 420
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2328 QQPLQAGKQSKGPPRPVTEVGRKLRVNGDNVLTLEYDRETQSVVVMVDDKQELLNVTYDRTSRPISFRPQSGDYADVDLE 2407
Cdd:COG3209 421 TAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDT 500
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2408 YDRFGRLVSWKWGVLQEAYSFDRNGRLNEIKYGDGSTMVYAFKDMFGSLPLKVTTPRRSDYLLQYDDAGALQSLTTPRGH 2487
Cdd:COG3209 501 TLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGAST 580
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2488 IHAFSLQTSLGFFkYQYYSPINRHPFEILYNDEGQILAKIHPHQSGKVAFVHDTAGRLETILAGLSSTHYTYQDTTSLVK 2567
Cdd:COG3209 581 TTGTTGGTATTTT-VTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRA 659
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2568 SVEVQEPGFELRREFKYHAGILKDEKLRFGSKNSLASARYKYAYDGNARLSGIEMAIDDKELPTTRYKYSQNLGQLEVVQ 2647
Cdd:COG3209 660 TGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGG 739
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2648 DLKITRNAFNRTVIQDSAKQFfaivDYDQHGRVKSV-LMNVKNIDVFRLELDYDLRNRIKSQKTTFGRSTAFDkinYNAD 2726
Cdd:COG3209 740 TTGTLTTTSTTTTTTAGALTY----TYDALGRLTSEtTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYT---YDAL 812
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2727 GHVVEVLgtnnwkylfdENGNTVGVVDQGekFNLGYDIGDRVIKVGDVEFNNydargfvvkRGEQKYRYNNRGQLIHSFE 2806
Cdd:COG3209 813 GRLTSVI----------TVGSGGGTDLQD--RTYTYDAAGNITSITDALRAG---------TLTQTYTYDALGRLTSATD 871
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2807 RERFQSwYYYDDRsrlvawhdnkGNTTQYYYANPRTphlvthvhfpkisrtmkLFYDDRDMLIALEHEDQR--YYV---- 2880
Cdd:COG3209 872 PGTTES-YTYDAN----------GNLTSRTDGGTTT-----------------YTYDALGRLVSVTKPDGTttTYTydal 923
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2881 -ATDQNGSPLAFFDQNGSIVKEMKRTPFGRIIKDTKPEFFVPIDFHGGLIDPHTKLVYTEQRQYDPHVGQWMT--PLWET 2957
Cdd:COG3209 924 gHTDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSpdPIGLA 1003
|
730 740
....*....|....*....|...
gi 386771624 2958 latemshpTDVFIYRYHNNDPIN 2980
Cdd:COG3209 1004 --------GGLNLYAYVGNNPVN 1018
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1750-2063 |
1.59e-18 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 88.53 E-value: 1.59e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1750 DGSLFVGDF--NYIRRIMTDGSIRTVVKLNATRVSYRYH---MALSPlDGTLYVSDPESHQIIRVRdtndysqPELNWEA 1824
Cdd:cd05819 18 SGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEpagVAVDS-DGNLYVADTGNHRIQKFD-------PDGNFLA 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1825 VVGSgerclPGDeahcgdgalaKDAKLAYPKGIAISSDNILYFADGTN--IRMVDRDGIVSTLIGnhmhkshwkpipceg 1902
Cdd:cd05819 90 SFGG-----SGD----------GDGEFNGPRGIAVDSSGNIYVADTGNhrIQKFDPDGEFLTTFG--------------- 139
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1903 TLKLEEMHLRWPTELAVSPmDNTLHIID--DHMILRMTPDGrvrvisgrplhcaTASTAYDTDLATHATLVMPQSIAFGP 1980
Cdd:cd05819 140 SGGSGPGQFNGPTGVAVDS-DGNIYVADtgNHRIQVFDPDG-------------NFLTTFGSTGTGPGQFNYPTGIAVDS 205
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1981 LGELYVAESDSqriNRVRVIGTDGRIAPFAGaeskcnclergcdcfeaeHYLATSAKFNTIAALAVTPDSHVHIADQANY 2060
Cdd:cd05819 206 DGNIYVADSGN---NRVQVFDPDGAGFGGNG------------------NFLGSDGQFNRPSGLAVDSDGNLYVADTGNN 264
|
...
gi 386771624 2061 RIR 2063
Cdd:cd05819 265 RIQ 267
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1750-1997 |
1.14e-15 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 80.06 E-value: 1.14e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1750 DGSLFVGDF--NYIRRIMTDGSIRTVVKLNATRVSYRYH---MALSPlDGTLYVSDPESHQIIRVRDTNDYsqpelnwEA 1824
Cdd:cd05819 65 DGNLYVADTgnHRIQKFDPDGNFLASFGGSGDGDGEFNGprgIAVDS-SGNIYVADTGNHRIQKFDPDGEF-------LT 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1825 VVGSGERClpgdeahcgdgalakDAKLAYPKGIAISSDNILYFADGTN--IRMVDRDGIVSTLIGNhmhkshwkpipcEG 1902
Cdd:cd05819 137 TFGSGGSG---------------PGQFNGPTGVAVDSDGNIYVADTGNhrIQVFDPDGNFLTTFGS------------TG 189
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1903 TLKLEemhLRWPTELAVSPmDNTLHIID--DHMILRMTPDGRVRVISGrplhcatastaydTDLATHATLVMPQSIAFGP 1980
Cdd:cd05819 190 TGPGQ---FNYPTGIAVDS-DGNIYVADsgNNRVQVFDPDGAGFGGNG-------------NFLGSDGQFNRPSGLAVDS 252
|
250
....*....|....*..
gi 386771624 1981 LGELYVAESDSQRINRV 1997
Cdd:cd05819 253 DGNLYVADTGNNRIQVF 269
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1847-2063 |
5.56e-14 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 75.05 E-value: 5.56e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1847 KDAKLAYPKGIAISSDNILYFADGTN--IRMVDRDGIVSTLIGnhmhkshwkpipCEGTlklEEMHLRWPTELAVSPmDN 1924
Cdd:cd05819 3 GPGELNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFG------------SFGS---GDGQFNEPAGVAVDS-DG 66
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1925 TLHIID--DHMILRMTPDGRVRVISGRPLHcatastaydtdlaTHATLVMPQSIAFGPLGELYVAESDSqriNRVRVIGT 2002
Cdd:cd05819 67 NLYVADtgNHRIQKFDPDGNFLASFGGSGD-------------GDGEFNGPRGIAVDSSGNIYVADTGN---HRIQKFDP 130
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 386771624 2003 DGRIAPFAGAESKCNclergcdcfeaehylatsAKFNTIAALAVTPDSHVHIADQANYRIR 2063
Cdd:cd05819 131 DGEFLTTFGSGGSGP------------------GQFNGPTGVAVDSDGNIYVADTGNHRIQ 173
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
2465-2837 |
4.73e-12 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 72.34 E-value: 4.73e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2465 RSDYLLQYDDAGALQSLTTPRGHIHAFSLQTSlgffKYQYYSPINRHpfEILYNDEGQILAKI--HPHQSGKVAFV-HDT 2541
Cdd:NF041261 362 RPEMCYRYDDTGRVTEQLNPAGLSYRYQYEQD----RITITDSLNRR--EVLHTEGEGGLKRVvkKEHADGSVTRSgYDA 435
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2542 AGRL--ETILAGLSS---------------------THYTYQDTTSLVKSVevQEPGFELRREFkyhagilkDEKLRFGS 2598
Cdd:NF041261 436 AGRLtaQTDAAGRRTeyslnvvsgditdittpdgreTKFYYNDGNQLTSVT--SPDGLESRREY--------DEPGRLVS 505
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2599 KNSLASARYKYAYDGNArlSGIEMAIDDKELPTTRYKYSQnLGQLEVVQDLK--ITRnafnrtviqdsakqffaiVDYDQ 2676
Cdd:NF041261 506 ETSRSGETTRYRYDDPH--SELPATTTDATGSTKQMTWSR-YGQLLAFTDCSgyQTR------------------YEYDR 564
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2677 HGRVKSVLMNvKNIDVFRlelDYDLRNRIKSQKTTFGRSTAFDkinYNADGHVVEVLGT--NNWKYLFDENGNTVGVVDQ 2754
Cdd:NF041261 565 FGQMTAVHRE-EGISTYR---RYDNRGQLTSVKDAQGRETRYE---YNAAGDLTAVITPdgNRSETQYDAWGKAVSTTQG 637
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2755 GEKFNLGYDIGDRVIKV----GDVEFNNYDARGFVVKRG-----EQKYRYNNRGQLIHSfERERFQSWYYYDDRSRLVAW 2825
Cdd:NF041261 638 GLTRSMEYDAAGRITTLtnenGSHSTFLYDALDRLVQQRgfdgrTQRYHYDLTGKLTQS-EDEGLVTLWHYDESDRITHR 716
|
410
....*....|..
gi 386771624 2826 HDNKGNTTQYYY 2837
Cdd:NF041261 717 TVNGEPAEQWQY 728
|
|
| NHL_like_3 |
cd14956 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1793-2062 |
1.80e-10 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271326 [Multi-domain] Cd Length: 274 Bit Score: 64.61 E-value: 1.80e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1793 DGTLYVSDPESHQIirVRDTNDYSqpelnWEAVVGSgerclpgdeahCGDGAlakdAKLAYPKGIAISSDNILYFADGTN 1872
Cdd:cd14956 70 DGWLYVADYWGDRI--QVFTLTGE-----LQTIGGS-----------SGSGP----GQFNAPRGVAVDADGNLYVADFGN 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1873 IRMV--DRDG-IVSTlignhmhkshWkpipceGTLKLEEMHLRWPTELAVSPmDNTLHIID--DHMILRMTPDGR-VRVI 1946
Cdd:cd14956 128 QRIQkfDPDGsFLRQ----------W------GGTGIEPGSFNYPRGVAVDP-DGTLYVADtyNDRIQVFDNDGAfLRKW 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1947 SGRplhcatastaydtdLATHATLVMPQSIAFGPLGELYVAESDSqriNRVRVIGTDGR-IAPFAGAESkcnclERGcdc 2025
Cdd:cd14956 191 GGR--------------GTGPGQFNYPYGIAIDPDGNVFVADFGN---NRIQKFTADGTfLTSWGSPGT-----GPG--- 245
|
250 260 270
....*....|....*....|....*....|....*..
gi 386771624 2026 feaehylatsaKFNTIAALAVTPDSHVHIADQANYRI 2062
Cdd:cd14956 246 -----------QFKNPWGVVVDADGTVYVADSNNNRV 271
|
|
| PLN02919 |
PLN02919 |
haloacid dehalogenase-like hydrolase family protein |
1775-1997 |
1.36e-08 |
|
haloacid dehalogenase-like hydrolase family protein
Pssm-ID: 215497 [Multi-domain] Cd Length: 1057 Bit Score: 61.02 E-value: 1.36e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1775 KLNATRVSYRYHMALSPLDGTLYVSDPESHQIIrVRDTNDysqpelNWEAVVGS-GERCLPgdeahcgDGALaKDAKLAY 1853
Cdd:PLN02919 561 RLLTSPLKFPGKLAIDLLNNRLFISDSNHNRIV-VTDLDG------NFIVQIGStGEEGLR-------DGSF-EDATFNR 625
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1854 PKGIAISSD-NILYFADGTN--IRMVD-RDGIVSTLIGNHMHKSHWKpipceGTLKLEEMHLRWPTELAVSPMDNTLHII 1929
Cdd:PLN02919 626 PQGLAYNAKkNLLYVADTENhaLREIDfVNETVRTLAGNGTKGSDYQ-----GGKKGTSQVLNSPWDVCFEPVNEKVYIA 700
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 386771624 1930 --DDHMILRM-TPDGRVRVISG----RPLHcatASTAYDTDLAthatlvMPQSIAFGP-LGELYVAESDSQRINRV 1997
Cdd:PLN02919 701 maGQHQIWEYnISDGVTRVFSGdgyeRNLN---GSSGTSTSFA------QPSGISLSPdLKELYIADSESSSIRAL 767
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1955-2065 |
2.19e-08 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 58.70 E-value: 2.19e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1955 TASTAYDTDLATHATLVMPQSIAFGPLGELYVAESDSQRInrvRVIGTDGRIAPFAGAeskcncLERGcdcfeaehYL-- 2032
Cdd:cd14953 7 SGTAGFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRI---RKITPDGVVTTVAGT------GTAG--------FAdg 69
|
90 100 110
....*....|....*....|....*....|....
gi 386771624 2033 -ATSAKFNTIAALAVTPDSHVHIADQANYRIRSV 2065
Cdd:cd14953 70 gGAAAQFNTPSGVAVDAAGNLYVADTGNHRIRKI 103
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2795-2836 |
3.12e-08 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 51.82 E-value: 3.12e-08
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 386771624 2795 YNNRGQLIHSFERERFQSWYYYDDRSRLVAWHDNKGNTTQYY 2836
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| NHL-2_like |
cd14951 |
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ... |
1826-2063 |
5.51e-08 |
|
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271321 [Multi-domain] Cd Length: 334 Bit Score: 57.59 E-value: 5.51e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1826 VGSGERclpGDEahcgDGALAkDAKLAYPKGIAISSDNILYFADGTN--IRMVD-RDGIVSTLIGN---HMHKshwkpip 1899
Cdd:cd14951 1 IGSGER---GLK----DGSFA-EASFNEPQGLALLPGNILYVADTENhaLRKIDlETGTVTTLAGTgeqGRDG------- 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1900 cEGTLKLEEMHLRWPTELAVSPMDNTLHI----IddHMILRMTPD-GRVRVISGrplhcatasTAY----DTDLATHATL 1970
Cdd:cd14951 66 -EGGGPGREQPLSSPWDVAWGPEDDILYIamagT--HQIWAYDLDtGTCRVFAG---------SGNegnrNGPYPHEAWF 133
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1971 VMPQSIAFGPLGELYVAESDSQRINRVRVigTDGRIAPFAGAEskcnclERGCDCFEAEHY--LATSAKFNtiAALAVT- 2047
Cdd:cd14951 134 AQPSGLSLAGWGELFVADSESSAIRAVSL--KDGGVKTLVGGT------RVGTGLFDFGDRdgPGAEALLQ--HPLGVAa 203
|
250
....*....|....*..
gi 386771624 2048 -PDSHVHIADQANYRIR 2063
Cdd:cd14951 204 lPDGSVYVADTYNHKIK 220
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1750-2010 |
3.28e-07 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 54.64 E-value: 3.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1750 DGSLFVGDF--NYIRRI-MTDGSIRTVVKLNATRVsyrYHMALSPlDGTLYVSDPESHQIIRV-RDTNDYSQpelnweav 1825
Cdd:COG4257 27 DGAVWFTDQggGRIGRLdPATGEFTEYPLGGGSGP---HGIAVDP-DGNLWFTDNGNNRIGRIdPKTGEITT-------- 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1826 vgsgerclpgdeahcgdgaLAKDAKLAYPKGIAISSDNILYFADGTN--IRMVD-RDGIVSTLignhmhkshwkPIPCEG 1902
Cdd:COG4257 95 -------------------FALPGGGSNPHGIAFDPDGNLWFTDQGGnrIGRLDpATGEVTEF-----------PLPTGG 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1903 TLkleemhlrwPTELAVSPmDNTLHIID--DHMILRMTPD-GRVRVISG-----RPLHCATAS--------------TAY 1960
Cdd:COG4257 145 AG---------PYGIAVDP-DGNLWVTDfgANAIGRIDPDtGTLTEYALptpgaGPRGLAVDPdgnlwvadtgsgriGRF 214
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*.
gi 386771624 1961 DTD------LATHATLVMPQSIAFGPLGELYVAESDSqriNRVRVIGTDGRIAPFA 2010
Cdd:COG4257 215 DPKtgtvteYPLPGGGARPYGVAVDGDGRVWFAESGA---NRIVRFDPDTELTEYV 267
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
1337-1363 |
3.47e-06 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 45.97 E-value: 3.47e-06
10 20
....*....|....*....|....*..
gi 386771624 1337 NCGDSKDNDKDGLVDCEDPECCASHVC 1363
Cdd:NF033662 6 TCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| NHL_like_3 |
cd14956 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1848-2062 |
1.17e-05 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271326 [Multi-domain] Cd Length: 274 Bit Score: 49.97 E-value: 1.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1848 DAKLAYPKGIAISSDNILYFADGTN--IRMVDRDGivstlignhMHKSHWKPIPcEGTLKLEEmhlrwPTELAVSPmDNT 1925
Cdd:cd14956 9 PGQFKDPRGIAVDADDNVYVADARNgrIQVFDKDG---------TFLRRFGTTG-DGPGQFGR-----PRGLAVDK-DGW 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1926 LHIID--DHMILRMTPDGRVRVISGRPLHCATASTAydtdlathatlvmPQSIAFGPLGELYVAESDSQRINRVRVIGTD 2003
Cdd:cd14956 73 LYVADywGDRIQVFTLTGELQTIGGSSGSGPGQFNA-------------PRGVAVDADGNLYVADFGNQRIQKFDPDGSF 139
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 386771624 2004 GRIAPFAGAEskcnclergcdcfeaehylatSAKFNTIAALAVTPDSHVHIADQANYRI 2062
Cdd:cd14956 140 LRQWGGTGIE---------------------PGSFNYPRGVAVDPDGTLYVADTYNDRI 177
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1785-2065 |
1.87e-05 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 49.25 E-value: 1.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1785 YHMALSPlDGTLYVSDPESHQIIRVrDTNDysqpelnweavvgsgerclpgdeahcGDGALAKDAKLAYPKGIAISSDNI 1864
Cdd:COG4257 20 RDVAVDP-DGAVWFTDQGGGRIGRL-DPAT--------------------------GEFTEYPLGGGSGPHGIAVDPDGN 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1865 LYFADGTN--IRMVDR-DGIVSTLignhmhkshwkPIPCEGTLkleemhlrwPTELAVSPmDNTLHIID--DHMILRMTP 1939
Cdd:COG4257 72 LWFTDNGNnrIGRIDPkTGEITTF-----------ALPGGGSN---------PHGIAFDP-DGNLWFTDqgGNRIGRLDP 130
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1940 D-GRVRVISGRplhcATASTAYDtdlathatlvmpqsIAFGPLGELYVAESDSQRInrVRVIGTDGRIAPFAGaeskcnc 2018
Cdd:COG4257 131 AtGEVTEFPLP----TGGAGPYG--------------IAVDPDGNLWVTDFGANAI--GRIDPDTGTLTEYAL------- 183
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 386771624 2019 lergcdcfeaehylatSAKFNTIAALAVTPDSHVHIADQANYRIRSV 2065
Cdd:COG4257 184 ----------------PTPGAGPRGLAVDPDGNLWVADTGSGRIGRF 214
|
|
| NHL-2_like |
cd14951 |
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ... |
1788-2015 |
8.00e-05 |
|
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271321 [Multi-domain] Cd Length: 334 Bit Score: 47.57 E-value: 8.00e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1788 ALSPLDGTLYVSDPESHQIIRVRDTNDYsqpelnWEAVVGSG-ERCLpgdeahcgDGALAKDAKLAYPKGIAISSDNILY 1866
Cdd:cd14951 83 AWGPEDDILYIAMAGTHQIWAYDLDTGT------CRVFAGSGnEGNR--------NGPYPHEAWFAQPSGLSLAGWGELF 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1867 FAD--GTNIRMVDR-DGIVSTLIGNhmhkshwkpiPCEGT-L--------KLEEMHLRWPTELAVSPmDNTLHIIDdhmi 1934
Cdd:cd14951 149 VADseSSAIRAVSLkDGGVKTLVGG----------TRVGTgLfdfgdrdgPGAEALLQHPLGVAALP-DGSVYVAD---- 213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1935 lrmTPDGRVRVISGRPLHCAT-ASTAYDTDLATHATLVMPQSIAFGPLGELYVAESDSQRINRVRVIGTDGRIAPFAGAE 2013
Cdd:cd14951 214 ---TYNHKIKRVDPATGEVSTlAGTGKAGYKDLEAQFSEPSGLVVDGDGRLYVADTNNHRIRRLDLPTEALEVLTLAHRT 290
|
..
gi 386771624 2014 SK 2015
Cdd:cd14951 291 LR 292
|
|
| EGF_Tenascin |
pfam18720 |
Tenascin EGF domain; This entry represents the EGF-like domains found in tenascin proteins. |
1071-1098 |
1.34e-04 |
|
Tenascin EGF domain; This entry represents the EGF-like domains found in tenascin proteins.
Pssm-ID: 376143 Cd Length: 29 Bit Score: 41.52 E-value: 1.34e-04
10 20
....*....|....*....|....*...
gi 386771624 1071 CPNGCSGNGQCLLGHCQCNPGFGGDDCS 1098
Cdd:pfam18720 2 CPLGCSSRGVCVDGQCICDSEYSGDDCS 29
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1787-2062 |
2.27e-04 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 45.75 E-value: 2.27e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1787 MALSPLDGTLYVSDPESHQIiRVRDtndysqpeLNWEAVVGSGErclPGdeahcgdgalAKDAKLAYPKGIAISSDNILY 1866
Cdd:cd14963 13 MGVAVSDGRIYVADTNNHRV-QVFD--------YEGKFKKSFGG---PG----------TGPGEFKYPYGIAVDSDGNIY 70
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1867 FADGTN--IRMVDRDGivsTLIGNHMHKS----HWKPIPC----------------------EGTLKLE-------EMHL 1911
Cdd:cd14963 71 VADLYNgrIQVFDPDG---KFLKYFPEKKdrvkLISPAGLaiddgklyvsdvkkhkvivfdlEGKLLLEfgkpgsePGEL 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1912 RWPTELAVSPmDNTLHIID--DHMILRMTPDGR-VRVISGRPLhcatastaydtdlaTHATLVMPQSIAFGPLGELYVAE 1988
Cdd:cd14963 148 SYPNGIAVDE-DGNIYVADsgNGRIQVFDKNGKfIKELNGSPD--------------GKSGFVNPRGIAVDPDGNLYVVD 212
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 386771624 1989 SDSqriNRVRVIGTDGR-IAPFAGaeskcncleRGCDcfeaehylatSAKFNTIAALAVTPDSHVHIADQANYRI 2062
Cdd:cd14963 213 NLS---HRVYVFDEQGKeLFTFGG---------RGKD----------DGQFNLPNGLFIDDDGRLYVTDRENNRV 265
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
1139-1162 |
3.03e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 40.41 E-value: 3.03e-04
10 20
....*....|....*....|....*.
gi 386771624 1139 DCSGHGHCVS--GKCQCMRGYKGKFC 1162
Cdd:pfam07974 1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2183-2217 |
3.34e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 40.27 E-value: 3.34e-04
10 20 30
....*....|....*....|....*....|....*
gi 386771624 2183 PTGLLRTKLDSTGRSYVYNYDEFGRLTSAVTPTGR 2217
Cdd:pfam05593 3 AAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1793-1994 |
3.45e-04 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 45.33 E-value: 3.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1793 DGTLYVSDPESHqIIRV---RDTNDYSqpelnweavVGSGerclpgdeahcGDGalakDAKLAYPKGIAISSDNILYFAD 1869
Cdd:cd14957 122 NGNIYVADTGNH-RIQVftsSGTFSYS---------IGSG-----------GTG----PGQFNGPQGIAVDSDGNIYVAD 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1870 GTN--IRMVDRDGIVSTLIGNhmhkshwkpipcEGTLKLEemhLRWPTELAVSPMDNtLHIIDdhmilrmTPDGRVRVIs 1947
Cdd:cd14957 177 TGNhrIQVFTSSGTFQYTFGS------------SGSGPGQ---FSDPYGIAVDSDGN-IYVAD-------TGNHRIQVF- 232
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 386771624 1948 grplhcaTASTAYDTDLATHATLV----MPQSIAFGPLGELYVAESDSQRI 1994
Cdd:cd14957 233 -------TSSGAYQYSIGTSGSGNgqfnYPYGIAVDNDGKIYVADSNNNRI 276
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
1181-1229 |
3.52e-04 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 45.38 E-value: 3.52e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 386771624 1181 GTCICKKGWKGPDCaTMDQDalqclpdCSGHGTFDLDTQTCTCEAKWSG 1229
Cdd:pfam19232 207 GVCPCKPGWAGGSC-TEDRT-------CNGRGTWNETTGQCACNIDFSG 247
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2905-2980 |
3.55e-04 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 41.33 E-value: 3.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 2905 TPFGRIIKDTkPEFFVPIDFHGGLIDPHTKLVYTEQRQYDPHVGQWMTPlwetlatemshptDVF-------IYRYHNND 2977
Cdd:TIGR03696 2 DPYGEVLSES-GAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSP-------------DPIglggglnLYAYVGNN 67
|
...
gi 386771624 2978 PIN 2980
Cdd:TIGR03696 68 PVN 70
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
1172-1194 |
4.49e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 39.64 E-value: 4.49e-04
10 20
....*....|....*....|....*
gi 386771624 1172 CSGHGFCAD--GTCICKKGWKGPDC 1194
Cdd:pfam07974 2 CSGRGTCVNqcGKCVCDSGYQGATC 26
|
|
| TNFRSF26 |
cd15837 |
Tumor necrosis factor receptor superfamily member 26 (TNFRSF26), also known as tumor necrosis ... |
1069-1198 |
1.25e-03 |
|
Tumor necrosis factor receptor superfamily member 26 (TNFRSF26), also known as tumor necrosis factor receptor homolog 3 (TNFRH3); TNFRSF26 (also known as tumor necrosis factor receptor homolog 3 (TNFRH3) or TNFRSF24) is predominantly expressed in embryos and lymphoid cell types, along with its closely related TNFRSF22 and TNFRSF23 orthologs, and is developmentally regulated. Unlike TNFRSF22/23, TNFRSF26 does not serve as a TRAIL decoy receptor; it remains an orphan receptor.
Pssm-ID: 276933 [Multi-domain] Cd Length: 118 Bit Score: 41.20 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 386771624 1069 QNCPNGcsgngQCLLGHCQCNPGFGgdDCSEsvcpvlCsQHGEYT---NGECICNPgwkgkeCSL-RHDECEVADCSGhg 1144
Cdd:cd15837 14 QLCPAG-----HYVSEPCQENHGVG--ECAP------C-EPGTFTahpNGETSCFP------CSQcRDDQEVVAECSA-- 71
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 386771624 1145 hcVSG-KCQCMRGYkgkFCEEVDCPHpNCSGHGFCADGTCICKKgwkgpdC-ATMD 1198
Cdd:cd15837 72 --TSDrQCQCKQGH---FYCDENCLE-SCFRCSRCPGGRVVLQP------CnATRD 115
|
|
| DUF3844 |
pfam12955 |
Domain of unknown function (DUF3844); This presumed domain is found in fungal species. It ... |
1171-1235 |
5.33e-03 |
|
Domain of unknown function (DUF3844); This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins that are thought to be found in the endoplasmic reticulum.
Pssm-ID: 432898 Cd Length: 104 Bit Score: 39.13 E-value: 5.33e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 386771624 1171 NCSGHGFCADGTcickKGWKGPDCATmdqdalqclpdCSGHGTFDLDTQTCTCEAKWSGDDCSKE 1235
Cdd:pfam12955 14 NCSGHGECVKKY----KSKSGRDCFA-----------CKCKATVVRKGDDGSKTTYWGGPACQKK 63
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
1207-1232 |
7.97e-03 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 36.17 E-value: 7.97e-03
10 20
....*....|....*....|....*.
gi 386771624 1207 DCSGHGTFDLDTQTCTCEAKWSGDDC 1232
Cdd:pfam07974 1 ICSGRGTCVNQCGKCVCDSGYQGATC 26
|
|
|