NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1622865605|ref|XP_028689702|]
View 

teneurin-4 isoform X1 [Macaca mulatta]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
36-435 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


:

Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 579.24  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605   36 SLT-RRRDAERRYTSSSADSEEGKTP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGANFTLRELGLGEV 113
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  114 TSPHGTLYRTDIGLPHCGYSMGASSDADMEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 190
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  191 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPGGLQNHAR 270
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  271 LRTPPPPLSHTHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPASgaQEPAHAQDNWLLNSNIPLETRnlgkq 350
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQST--QESVQLQDSWVLNSNVPLETR----- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  351 pflgtlqdnliemdilgasrhdgaygdgHFLFKPG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFARPAFNLKKPS 429
Cdd:pfam06484  310 ----------------------------HFLFKTGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPY 361

                   ....*.
gi 1622865605  430 KYCNWK 435
Cdd:pfam06484  362 KYCSWK 367
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1317-1658 7.34e-46

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 169.25  E-value: 7.34e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1317 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILELRNKDFRHSHSPAHKYY----LTTDPmSGA 1386
Cdd:cd14953     11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1387 VFLSDTNSRRVFKIKSTVVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1464
Cdd:cd14953     90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1465 RRIDQNGIISTLLGsndlTSARPLSCDSVMdiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1542
Cdd:cd14953    156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1543 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1622
Cdd:cd14953    226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1622865605 1623 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1658
Cdd:cd14953    288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2780-2857 3.35e-38

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 138.13  E-value: 3.35e-38
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865605 2780 EEKARVLELARQRAVRQAWAREQQRLREGEEGLRAWTEGEKQQVLSTGRVQGYDGFFVISVEQYPELSDSANNIHFMR 2857
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1684-2556 5.67e-34

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 144.13  E-value: 5.67e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1684 FDTTGKHLYTQSLPTGDYLYNFTYTGDGDVTLITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALKSVTTQGH 1763
Cdd:COG3209    184 VGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTT 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1764 ELAMMTYHGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAF 1843
Cdd:COG3209    264 GAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTG 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1844 YTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFG 1923
Cdd:COG3209    344 GTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAA 423
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1924 RRLRVHNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSlWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYD 2003
Cdd:COG3209    424 GALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGA-ATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTL 502
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2004 QAGRIISRIFADGKTWSYTYLEKSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASV 2083
Cdd:COG3209    503 DDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTT 582
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2084 IQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTINLQNEGFTCTIRYRQIGPLIDRQI 2163
Cdd:COG3209    583 GTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGT 662
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2164 FRFTEEGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAY 2243
Cdd:COG3209    663 TGTGTGVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2244 GRMKEVQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSP 2323
Cdd:COG3209    742 GTLTTTSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVIT 820
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2324 GNSARLTPL-----RYDLRDRITRLGDVQykmdEDGFLRQRggdiFEYNSAGLLIKAynRAGGWSVRYRYDGLGRRVSSK 2398
Cdd:COG3209    821 VGSGGGTDLqdrtyTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTSRT 890
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2399 SSHSHHLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYT 2478
Cdd:COG3209    891 DGGTTTYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYD 949
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865605 2479 AYGEIYMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhELWQHLSssnimpFNLYMFKNNNPISNS 2556
Cdd:COG3209    950 PFGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-PIGLAGG------LNLYAYVGNNPVNYV 1020
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
928-958 2.29e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


:

Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.83  E-value: 2.29e-09
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1622865605  928 SMETACGDSKDNDGDGLVDCMDPDCCLQPLC 958
Cdd:NF033662     2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
DUF5885 super family cl44670
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
659-817 2.37e-05

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


The actual alignment was detected with superfamily member pfam19232:

Pssm-ID: 437064  Cd Length: 265  Bit Score: 48.85  E-value: 2.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  659 DNCPSNCYGNGDCISGTCH-----------------CFLGFLGPD---CGRASCpvlcsGNGQ----------YMKGRCL 708
Cdd:pfam19232   10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTPKascCGGVTC-----GAGQtcdaktntcvYVKGYCS 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  709 C-HSGWKGAECDVPTNQCIDVACSNHGT---CIMG-----------------TCIC-NPG-YK-GESC-----------E 753
Cdd:pfam19232   85 AdHPCPSGSACDTAKNACIAQPPYGPDSgkgCVRGfgawiweldpatnsgvwRCRCaNGSlYNsAHECspladqtlcaaE 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  754 EVD-----------------------CMDPTCSGRGVC--VRGECHCSVGWGGTNCETPRAtcldqCSGHGTFLPDTGLC 808
Cdd:pfam19232  165 NLDpnalvpassvpafaaygwgnqpvLINKSTAGAAVPspLAGVCPCKPGWAGGSCTEDRT-----CNGRGTWNETTGQC 239

                   ....*....
gi 1622865605  809 SCDPSWTGH 817
Cdd:pfam19232  240 ACNIDFSGH 248
DSL super family cl19567
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
841-884 4.60e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


The actual alignment was detected with superfamily member pfam01414:

Pssm-ID: 473190  Cd Length: 46  Bit Score: 43.00  E-value: 4.60e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1622865605  841 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 884
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
36-435 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 579.24  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605   36 SLT-RRRDAERRYTSSSADSEEGKTP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGANFTLRELGLGEV 113
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  114 TSPHGTLYRTDIGLPHCGYSMGASSDADMEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 190
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  191 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPGGLQNHAR 270
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  271 LRTPPPPLSHTHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPASgaQEPAHAQDNWLLNSNIPLETRnlgkq 350
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQST--QESVQLQDSWVLNSNVPLETR----- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  351 pflgtlqdnliemdilgasrhdgaygdgHFLFKPG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFARPAFNLKKPS 429
Cdd:pfam06484  310 ----------------------------HFLFKTGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPY 361

                   ....*.
gi 1622865605  430 KYCNWK 435
Cdd:pfam06484  362 KYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1317-1658 7.34e-46

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 169.25  E-value: 7.34e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1317 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILELRNKDFRHSHSPAHKYY----LTTDPmSGA 1386
Cdd:cd14953     11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1387 VFLSDTNSRRVFKIKSTVVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1464
Cdd:cd14953     90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1465 RRIDQNGIISTLLGsndlTSARPLSCDSVMdiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1542
Cdd:cd14953    156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1543 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1622
Cdd:cd14953    226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1622865605 1623 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1658
Cdd:cd14953    288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2780-2857 3.35e-38

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 138.13  E-value: 3.35e-38
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865605 2780 EEKARVLELARQRAVRQAWAREQQRLREGEEGLRAWTEGEKQQVLSTGRVQGYDGFFVISVEQYPELSDSANNIHFMR 2857
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1684-2556 5.67e-34

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 144.13  E-value: 5.67e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1684 FDTTGKHLYTQSLPTGDYLYNFTYTGDGDVTLITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALKSVTTQGH 1763
Cdd:COG3209    184 VGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTT 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1764 ELAMMTYHGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAF 1843
Cdd:COG3209    264 GAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTG 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1844 YTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFG 1923
Cdd:COG3209    344 GTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAA 423
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1924 RRLRVHNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSlWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYD 2003
Cdd:COG3209    424 GALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGA-ATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTL 502
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2004 QAGRIISRIFADGKTWSYTYLEKSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASV 2083
Cdd:COG3209    503 DDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTT 582
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2084 IQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTINLQNEGFTCTIRYRQIGPLIDRQI 2163
Cdd:COG3209    583 GTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGT 662
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2164 FRFTEEGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAY 2243
Cdd:COG3209    663 TGTGTGVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2244 GRMKEVQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSP 2323
Cdd:COG3209    742 GTLTTTSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVIT 820
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2324 GNSARLTPL-----RYDLRDRITRLGDVQykmdEDGFLRQRggdiFEYNSAGLLIKAynRAGGWSVRYRYDGLGRRVSSK 2398
Cdd:COG3209    821 VGSGGGTDLqdrtyTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTSRT 890
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2399 SSHSHHLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYT 2478
Cdd:COG3209    891 DGGTTTYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYD 949
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865605 2479 AYGEIYMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhELWQHLSssnimpFNLYMFKNNNPISNS 2556
Cdd:COG3209    950 PFGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-PIGLAGG------LNLYAYVGNNPVNYV 1020
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2477-2556 4.16e-11

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 60.98  E-value: 4.16e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2477 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwqhlsssnimPF------NLYMFKNN 2550
Cdd:TIGR03696    1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66

                   ....*.
gi 1622865605 2551 NPISNS 2556
Cdd:TIGR03696   67 NPVNWV 72
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
928-958 2.29e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.83  E-value: 2.29e-09
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1622865605  928 SMETACGDSKDNDGDGLVDCMDPDCCLQPLC 958
Cdd:NF033662     2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1327-1658 1.41e-08

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 58.49  E-value: 1.41e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1327 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilELRNKDFRHSHSpahkyyLTTDPmSGAVFLSDTNSRRVFKI--K 1401
Cdd:COG4257     19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIdpK 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1402 STVVvkdlvknsEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRID-QNGIISTLLG 1478
Cdd:COG4257     89 TGEI--------TTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1479 snDLTSARplscdsvmdisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAI 1558
Cdd:COG4257    141 --PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYAL 183
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1559 HATLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLA 1638
Cdd:COG4257    184 PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVA 237
                          330       340
                   ....*....|....*....|
gi 1622865605 1639 VCADGELYVADLGNIRIRFI 1658
Cdd:COG4257    238 VDGDGRVWFAESGANRIVRF 257
RHS_core NF041261
RHS element core protein;
1975-2393 2.48e-08

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 60.02  E-value: 2.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1975 LNGVNVTYSPGGhiAGIQRGIMSE-------RMEYDQAGRIISRIFADGKTWSYTYLEKS---MVLLLHSQRQYIFEFDK 2044
Cdd:NF041261   401 LNRREVLHTEGE--GGLKRVVKKEhadgsvtRSGYDAAGRLTAQTDAAGRRTEYSLNVVSgdiTDITTPDGRETKFYYND 478
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2045 NDRLSSVTMPNvarqTLETIRsvgyyrniyqppegnasviqDFTEDGHLLhtfylgtgrrviykygklsklAETLYDTTK 2124
Cdd:NF041261   479 GNQLTSVTSPD----GLESRR--------------------EYDEPGRLV---------------------SETSRSGET 513
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2125 VSFTYDETAGMLKTINLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETPlpIDLYR 2204
Cdd:NF041261   514 TRYRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREEG--ISTYR 581
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2205 -YDD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNMGRVVKK 2274
Cdd:NF041261   582 rYDNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAAGRITTL 654
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2275 ELKvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GDV--QYK 2349
Cdd:NF041261   655 TNE-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGEPaeQWQ 727
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 1622865605 2350 MDEDGFLRQrggdiFEYNSAGLLIkaynraggwSVRYRYDGLGR 2393
Cdd:NF041261   728 YDEHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
RHS_core NF041261
RHS element core protein;
2203-2539 4.33e-06

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 52.70  E-value: 4.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2203 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2264
Cdd:NF041261   367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2265 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2335
Cdd:NF041261   443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2336 lrDRITRLGDVqyKMDEDGFLRQrggdiFEYNSAGLLIkAYNRAGGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2415
Cdd:NF041261   520 --DPHSELPAT--TTDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2416 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2495
Cdd:NF041261   587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1622865605 2496 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LWQHLSSSNI 2539
Cdd:NF041261   659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLWHYDESDRI 713
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1362-1662 1.48e-05

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 51.01  E-value: 1.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1362 RNKDFRHSHSPAhKY--YLTTDPMSGAVFLSDTNSRRVfkikstvVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1435
Cdd:PLN02919   556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1436 kateATLTNPRGITVDKFGLIYFVDGT---MIRRID-QNGIISTLLGS----NDLTSARPLScdsvmdiSQVrLEWPTDL 1507
Cdd:PLN02919   621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1508 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1555
Cdd:PLN02919   689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1556 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1607
Cdd:PLN02919   769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1622865605 1608 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1662
Cdd:PLN02919   847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
659-817 2.37e-05

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 48.85  E-value: 2.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  659 DNCPSNCYGNGDCISGTCH-----------------CFLGFLGPD---CGRASCpvlcsGNGQ----------YMKGRCL 708
Cdd:pfam19232   10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTPKascCGGVTC-----GAGQtcdaktntcvYVKGYCS 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  709 C-HSGWKGAECDVPTNQCIDVACSNHGT---CIMG-----------------TCIC-NPG-YK-GESC-----------E 753
Cdd:pfam19232   85 AdHPCPSGSACDTAKNACIAQPPYGPDSgkgCVRGfgawiweldpatnsgvwRCRCaNGSlYNsAHECspladqtlcaaE 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  754 EVD-----------------------CMDPTCSGRGVC--VRGECHCSVGWGGTNCETPRAtcldqCSGHGTFLPDTGLC 808
Cdd:pfam19232  165 NLDpnalvpassvpafaaygwgnqpvLINKSTAGAAVPspLAGVCPCKPGWAGGSCTEDRT-----CNGRGTWNETTGQC 239

                   ....*....
gi 1622865605  809 SCDPSWTGH 817
Cdd:pfam19232  240 ACNIDFSGH 248
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
841-884 4.60e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 43.00  E-value: 4.60e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1622865605  841 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 884
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
730-753 5.53e-04

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 39.54  E-value: 5.53e-04
                           10        20
                   ....*....|....*....|....*...
gi 1622865605  730 CSNHGTCIMG----TCICNPGYKGESCE 753
Cdd:cd00054     11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1775-1807 5.94e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.50  E-value: 5.94e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1622865605 1775 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1807
Cdd:pfam05593    5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
36-435 0e+00

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 579.24  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605   36 SLT-RRRDAERRYTSSSADSEEGKTP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGANFTLRELGLGEV 113
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  114 TSPHGTLYRTDIGLPHCGYSMGASSDADMEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 190
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  191 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPGGLQNHAR 270
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  271 LRTPPPPLSHTHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPASgaQEPAHAQDNWLLNSNIPLETRnlgkq 350
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQST--QESVQLQDSWVLNSNVPLETR----- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  351 pflgtlqdnliemdilgasrhdgaygdgHFLFKPG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFARPAFNLKKPS 429
Cdd:pfam06484  310 ----------------------------HFLFKTGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPY 361

                   ....*.
gi 1622865605  430 KYCNWK 435
Cdd:pfam06484  362 KYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1317-1658 7.34e-46

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 169.25  E-value: 7.34e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1317 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILELRNKDFRHSHSPAHKYY----LTTDPmSGA 1386
Cdd:cd14953     11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1387 VFLSDTNSRRVFKIKSTVVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1464
Cdd:cd14953     90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1465 RRIDQNGIISTLLGsndlTSARPLSCDSVMdiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1542
Cdd:cd14953    156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1543 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1622
Cdd:cd14953    226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1622865605 1623 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1658
Cdd:cd14953    288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2780-2857 3.35e-38

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 138.13  E-value: 3.35e-38
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865605 2780 EEKARVLELARQRAVRQAWAREQQRLREGEEGLRAWTEGEKQQVLSTGRVQGYDGFFVISVEQYPELSDSANNIHFMR 2857
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1684-2556 5.67e-34

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 144.13  E-value: 5.67e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1684 FDTTGKHLYTQSLPTGDYLYNFTYTGDGDVTLITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALKSVTTQGH 1763
Cdd:COG3209    184 VGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTT 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1764 ELAMMTYHGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAF 1843
Cdd:COG3209    264 GAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTG 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1844 YTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFG 1923
Cdd:COG3209    344 GTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAA 423
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1924 RRLRVHNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSlWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYD 2003
Cdd:COG3209    424 GALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGA-ATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTL 502
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2004 QAGRIISRIFADGKTWSYTYLEKSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASV 2083
Cdd:COG3209    503 DDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTT 582
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2084 IQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTINLQNEGFTCTIRYRQIGPLIDRQI 2163
Cdd:COG3209    583 GTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGT 662
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2164 FRFTEEGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAY 2243
Cdd:COG3209    663 TGTGTGVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2244 GRMKEVQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSP 2323
Cdd:COG3209    742 GTLTTTSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVIT 820
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2324 GNSARLTPL-----RYDLRDRITRLGDVQykmdEDGFLRQRggdiFEYNSAGLLIKAynRAGGWSVRYRYDGLGRRVSSK 2398
Cdd:COG3209    821 VGSGGGTDLqdrtyTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTSRT 890
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2399 SSHSHHLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYT 2478
Cdd:COG3209    891 DGGTTTYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYD 949
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865605 2479 AYGEIYMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhELWQHLSssnimpFNLYMFKNNNPISNS 2556
Cdd:COG3209    950 PFGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-PIGLAGG------LNLYAYVGNNPVNYV 1020
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1322-1658 4.75e-18

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 86.99  E-value: 4.75e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1322 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILELRNKDFRHSHSPAHkyyLTTDPmSGAVFLSDTNSRRVFK 1399
Cdd:cd05819      5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEPAG---VAVDS-DGNLYVADTGNHRIQK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1400 IKStvvvkdlvkNSEVVAGTGdqclpfddtrcGDGGKATEatLTNPRGITVDKFGLIYFVDgTM---IRRIDQNGIISTL 1476
Cdd:cd05819     81 FDP---------DGNFLASFG-----------GSGDGDGE--FNGPRGIAVDSSGNIYVAD-TGnhrIQKFDPDGEFLTT 137
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1477 LGSNDLTSARplscdsvmdisqvrLEWPTDLAINPmDNSLYVLDnnvvlqiSENHQVRIVAgrpmhcqvPGiDHFLL--- 1553
Cdd:cd05819    138 FGSGGSGPGQ--------------FNGPTGVAVDS-DGNIYVAD-------TGNHRIQVFD--------PD-GNFLTtfg 186
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1554 SKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcfsgddgyaKDAKLNT 1633
Cdd:cd05819    187 STGTGPGQFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNGNFLG-------------------SDGQFNR 244
                          330       340
                   ....*....|....*....|....*
gi 1622865605 1634 PSSLAVCADGELYVADLGNIRIRFI 1658
Cdd:cd05819    245 PSGLAVDSDGNLYVADTGNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1442-1726 7.74e-18

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 86.22  E-value: 7.74e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1442 LTNPRGITVDKFGLIYFVDGTM--IRRIDQNGIISTLLGSNDltsarplscdsvmdISQVRLEWPTDLAINPmDNSLYVL 1519
Cdd:cd05819      7 LNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNLYVA 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1520 D--NNVVLQISENHQVRIVAGRPmhcqvpGIDHFLLSkvaihatleSATALAVSHNGVLYIAETDEkkiNRIRQVTTSGE 1597
Cdd:cd05819     72 DtgNHRIQKFDPDGNFLASFGGS------GDGDGEFN---------GPRGIAVDSSGNIYVADTGN---HRIQKFDPDGE 133
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1598 ISLVAGAPSGCDckndancdcfsgddgyakdAKLNTPSSLAVCADGELYVADLGNIRIRFIrknkpflntqnmyelsSPI 1677
Cdd:cd05819    134 FLTTFGSGGSGP-------------------GQFNGPTGVAVDSDGNIYVADTGNHRIQVF----------------DPD 178
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1622865605 1678 DQELYLFDTTGKHLYTQSLPTGDylynfTYTGDGDVtLITDNNGNMVNV 1726
Cdd:cd05819    179 GNFLTTFGSTGTGPGQFNYPTGI-----AVDSDGNI-YVADSGNNRVQV 221
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1314-1527 2.54e-14

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 76.80  E-value: 2.54e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1314 SCNGLADGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNilelrnkdfrhshspahkyylttdpmsgavflsd 1391
Cdd:cd14953    176 AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTT---------------------------------- 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1392 tnsrrvfkikstvvvkdlvknsevVAGTGDQclPFddtrcGDGGKATEATLTNPRGITVDKFGLIYFVD---GTmIRRID 1468
Cdd:cd14953    222 ------------------------VAGTGTA--GF-----SGDGGATAAQLNNPTGVAVDAAGNLYVADsgnHR-IRKIT 269
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622865605 1469 QNGIISTLLGSndlTSARPLSCDSVmdiSQVRLEWPTDLAINPmDNSLYVLD--NNVVLQI 1527
Cdd:cd14953    270 PAGVVTTVAGG---GAGFSGDGGPA---TSAQFNNPTGVAVDA-AGNLYVADtgNNRIRKI 323
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1320-1589 2.52e-13

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 72.74  E-value: 2.52e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1320 DGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTN---ILELRNKDFRHshsPahkYYLTTDPmSGAVFLSDTNS 1394
Cdd:cd05819     50 GDGQFNEPAGVAVDSDGNLYVADTgnHRIQKFDPDGNFLAsfgGSGDGDGEFNG---P---RGIAVDS-SGNIYVADTGN 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1395 RRVFKIKStvvvkdlvkNSEVVAGTGdqclpfddtrcgdGGKATEATLTNPRGITVDKFGLIYFVDGT--MIRRIDQNGI 1472
Cdd:cd05819    123 HRIQKFDP---------DGEFLTTFG-------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGN 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1473 ISTLLGSNDLTSArplscdsvmdisqvRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGrpmhcqvpgidh 1550
Cdd:cd05819    181 FLTTFGSTGTGPG--------------QFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG------------ 233
                          250       260       270
                   ....*....|....*....|....*....|....*....
gi 1622865605 1551 fllSKVAIHATLESATALAVSHNGVLYIAETDEKKINRI 1589
Cdd:cd05819    234 ---NFLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2477-2556 4.16e-11

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 60.98  E-value: 4.16e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2477 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwqhlsssnimPF------NLYMFKNN 2550
Cdd:TIGR03696    1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66

                   ....*.
gi 1622865605 2551 NPISNS 2556
Cdd:TIGR03696   67 NPVNWV 72
acid_disulf_rpt NF033662
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ...
928-958 2.29e-09

acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.


Pssm-ID: 411265 [Multi-domain]  Cd Length: 32  Bit Score: 54.83  E-value: 2.29e-09
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1622865605  928 SMETACGDSKDNDGDGLVDCMDPDCCLQPLC 958
Cdd:NF033662     2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1384-1655 6.88e-09

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 59.14  E-value: 6.88e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1384 SGAVFLSDTNSRRVFKikstvvvkdlvknseVVAGTGDQC-LPFDDtrcgdggkateatLTNPRGITVDKFGLIYFVDGt 1462
Cdd:cd14952     20 AGNVYVADSGNNRVLK---------------LAAGSTTQTvLPFTG-------------LYQPQGVAVDAAGTVYVTDF- 70
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1463 mirriDQNGIISTLLGSNDLTsarPLSCDSvmdisqvrLEWPTDLAINPMDNsLYVLD--NNVVLqisenhqvRIVAGRP 1540
Cdd:cd14952     71 -----GNNRVLKLAAGSTTQT---VLPFTG--------LNDPTGVAVDAAGN-VYVADtgNNRVL--------KLAAGSN 125
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1541 MHCQVPgidhFllskvaihATLESATALAVSHNGVLYIAETDEkkiNRIRQvttsgeisLVAGA------Psgcdcknda 1614
Cdd:cd14952    126 TQTVLP----F--------TGLSNPDGVAVDGAGNVYVTDTGN---NRVLK--------LAAGSttqtvlP--------- 173
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1622865605 1615 ncdcFSGddgyakdakLNTPSSLAVCADGELYVADLGNIRI 1655
Cdd:cd14952    174 ----FTG---------LNSPSGVAVDTAGNVYVTDHGNNRV 201
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1327-1658 1.41e-08

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 58.49  E-value: 1.41e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1327 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilELRNKDFRHSHSpahkyyLTTDPmSGAVFLSDTNSRRVFKI--K 1401
Cdd:COG4257     19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIdpK 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1402 STVVvkdlvknsEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRID-QNGIISTLLG 1478
Cdd:COG4257     89 TGEI--------TTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1479 snDLTSARplscdsvmdisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAI 1558
Cdd:COG4257    141 --PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYAL 183
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1559 HATLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLA 1638
Cdd:COG4257    184 PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVA 237
                          330       340
                   ....*....|....*....|
gi 1622865605 1639 VCADGELYVADLGNIRIRFI 1658
Cdd:COG4257    238 VDGDGRVWFAESGANRIVRF 257
RHS_core NF041261
RHS element core protein;
1975-2393 2.48e-08

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 60.02  E-value: 2.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1975 LNGVNVTYSPGGhiAGIQRGIMSE-------RMEYDQAGRIISRIFADGKTWSYTYLEKS---MVLLLHSQRQYIFEFDK 2044
Cdd:NF041261   401 LNRREVLHTEGE--GGLKRVVKKEhadgsvtRSGYDAAGRLTAQTDAAGRRTEYSLNVVSgdiTDITTPDGRETKFYYND 478
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2045 NDRLSSVTMPNvarqTLETIRsvgyyrniyqppegnasviqDFTEDGHLLhtfylgtgrrviykygklsklAETLYDTTK 2124
Cdd:NF041261   479 GNQLTSVTSPD----GLESRR--------------------EYDEPGRLV---------------------SETSRSGET 513
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2125 VSFTYDETAGMLKTINLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETPlpIDLYR 2204
Cdd:NF041261   514 TRYRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREEG--ISTYR 581
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2205 -YDD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNMGRVVKK 2274
Cdd:NF041261   582 rYDNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAAGRITTL 654
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2275 ELKvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GDV--QYK 2349
Cdd:NF041261   655 TNE-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGEPaeQWQ 727
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 1622865605 2350 MDEDGFLRQrggdiFEYNSAGLLIkaynraggwSVRYRYDGLGR 2393
Cdd:NF041261   728 YDEHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1444-1655 4.43e-07

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 54.21  E-value: 4.43e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1444 NPRGITVDKFGLIYFVD--GTMIRRIDQNGIISTLLGSndlTSARPLScdsvmdisqvrLEWPTDLAINPmDNSLYVLDn 1521
Cdd:cd14956    108 APRGVAVDADGNLYVADfgNQRIQKFDPDGSFLRQWGG---TGIEPGS-----------FNYPRGVAVDP-DGTLYVAD- 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1522 nvvlqiSENHQVrivagrpmhcQVPGIDHFLLSKVAIHAT----LESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGE 1597
Cdd:cd14956    172 ------TYNDRI----------QVFDNDGAFLRKWGGRGTgpgqFNYPYGIAIDPDGNVFVADFGN---NRIQKFTADGT 232
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865605 1598 ISLVAGAPSGcdckndancdcfsgddgyaKDAKLNTPSSLAVCADGELYVADLGNIRI 1655
Cdd:cd14956    233 FLTSWGSPGT-------------------GPGQFKNPWGVVVDADGTVYVADSNNNRV 271
RHS_core NF041261
RHS element core protein;
2203-2539 4.33e-06

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 52.70  E-value: 4.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2203 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2264
Cdd:NF041261   367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2265 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2335
Cdd:NF041261   443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2336 lrDRITRLGDVqyKMDEDGFLRQrggdiFEYNSAGLLIkAYNRAGGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2415
Cdd:NF041261   520 --DPHSELPAT--TTDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 2416 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2495
Cdd:NF041261   587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1622865605 2496 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LWQHLSSSNI 2539
Cdd:NF041261   659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLWHYDESDRI 713
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1440-1658 6.19e-06

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 50.40  E-value: 6.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1440 ATLTNPRGITVDKFGLIYFVD--GTMIRRID-QNGIISTllgsndltsarplscdsvmdISQVRLEWPTDLAINPmDNSL 1516
Cdd:COG4257     14 APGSGPRDVAVDPDGAVWFTDqgGGRIGRLDpATGEFTE--------------------YPLGGGSGPHGIAVDP-DGNL 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1517 YVLD--NNVVLQIS-ENHQVRIVAGrpmhcqvPGIDHFLlskvaihatlesaTALAVSHNGVLYIAETDekkINRIRQVT 1593
Cdd:COG4257     73 WFTDngNNRIGRIDpKTGEITTFAL-------PGGGSNP-------------HGIAFDPDGNLWFTDQG---GNRIGRLD 129
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1594 T-SGEISLV-----AGAPSGCDCKND---------ANC-DCFSGDDG----YAKDAKLNTPSSLAVCADGELYVADLGNI 1653
Cdd:COG4257    130 PaTGEVTEFplptgGAGPYGIAVDPDgnlwvtdfgANAiGRIDPDTGtlteYALPTPGAGPRGLAVDPDGNLWVADTGSG 209

                   ....*
gi 1622865605 1654 RIRFI 1658
Cdd:COG4257    210 RIGRF 214
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1362-1662 1.48e-05

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 51.01  E-value: 1.48e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1362 RNKDFRHSHSPAhKY--YLTTDPMSGAVFLSDTNSRRVfkikstvVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1435
Cdd:PLN02919   556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1436 kateATLTNPRGITVDKFGLIYFVDGT---MIRRID-QNGIISTLLGS----NDLTSARPLScdsvmdiSQVrLEWPTDL 1507
Cdd:PLN02919   621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1508 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1555
Cdd:PLN02919   689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1556 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1607
Cdd:PLN02919   769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1622865605 1608 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1662
Cdd:PLN02919   847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
DUF5885 pfam19232
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ...
659-817 2.37e-05

Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.


Pssm-ID: 437064  Cd Length: 265  Bit Score: 48.85  E-value: 2.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  659 DNCPSNCYGNGDCISGTCH-----------------CFLGFLGPD---CGRASCpvlcsGNGQ----------YMKGRCL 708
Cdd:pfam19232   10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTPKascCGGVTC-----GAGQtcdaktntcvYVKGYCS 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  709 C-HSGWKGAECDVPTNQCIDVACSNHGT---CIMG-----------------TCIC-NPG-YK-GESC-----------E 753
Cdd:pfam19232   85 AdHPCPSGSACDTAKNACIAQPPYGPDSgkgCVRGfgawiweldpatnsgvwRCRCaNGSlYNsAHECspladqtlcaaE 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605  754 EVD-----------------------CMDPTCSGRGVC--VRGECHCSVGWGGTNCETPRAtcldqCSGHGTFLPDTGLC 808
Cdd:pfam19232  165 NLDpnalvpassvpafaaygwgnqpvLINKSTAGAAVPspLAGVCPCKPGWAGGSCTEDRT-----CNGRGTWNETTGQC 239

                   ....*....
gi 1622865605  809 SCDPSWTGH 817
Cdd:pfam19232  240 ACNIDFSGH 248
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1445-1678 3.09e-05

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 48.42  E-value: 3.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1445 PRGITVDKFGLIYFVD--GTMIRRIDQNGIISTLLGSNDltsarplscdsvmdISQVRLEWPTDLAINPMDNsLYVLDnn 1522
Cdd:cd14957     20 PRGIAVDSAGNIYVADtgNNRIQVFTSSGVYSYSIGSGG--------------TGSGQFNSPYGIAVDSNGN-IYVAD-- 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1523 vvlqiSENHQVRIvagrpmhcqvpgidhFLLSKVAIHA---------TLESATALAVSHNGVLYIAETDEkkiNRIrQVT 1593
Cdd:cd14957     83 -----TDNNRIQV---------------FNSSGVYQYSigtggsgdgQFNGPYGIAVDSNGNIYVADTGN---HRI-QVF 138
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1594 TSgeislvAGAPsgcdckndancdCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNKPFLNT-----Q 1668
Cdd:cd14957    139 TS------SGTF------------SYSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQVFTSSGTFQYTfgssgS 200
                          250
                   ....*....|
gi 1622865605 1669 NMYELSSPID 1678
Cdd:cd14957    201 GPGQFSDPYG 210
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1434-1655 4.23e-05

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 48.05  E-value: 4.23e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1434 GGKATEA-TLTNPRGITVDKFGLIYFVDGT--MIRRIDQNGIISTLLGSNdltSARPLSCDSvmdisqvrlewPTDLAIN 1510
Cdd:cd14956     50 GTTGDGPgQFGRPRGLAVDKDGWLYVADYWgdRIQVFTLTGELQTIGGSS---GSGPGQFNA-----------PRGVAVD 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1511 PmDNSLYVLD--NNVVLQISENHQ-VRIVAGRPmhcQVPGidHFLlskvaihatleSATALAVSHNGVLYIAETdekKIN 1587
Cdd:cd14956    116 A-DGNLYVADfgNQRIQKFDPDGSfLRQWGGTG---IEPG--SFN-----------YPRGVAVDPDGTLYVADT---YND 175
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865605 1588 RIRQVTTSGEISLVAGAPSGcdckndancdcFSGDdgyakdakLNTPSSLAVCADGELYVADLGNIRI 1655
Cdd:cd14956    176 RIQVFDNDGAFLRKWGGRGT-----------GPGQ--------FNYPYGIAIDPDGNVFVADFGNNRI 224
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
841-884 4.60e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 43.00  E-value: 4.60e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1622865605  841 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 884
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1563-1722 4.91e-05

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 48.03  E-value: 4.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1563 ESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGapsgcdckndancdcfSGDDGyakDAKLNTPSSLAVCAD 1642
Cdd:cd14957     65 NSPYGIAVDSNGNIYVADTDN---NRIQVFNSSGVYQYSIG----------------TGGSG---DGQFNGPYGIAVDSN 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1643 GELYVADLGNIRIRFIRKNKPFLNT-----QNMYELSSP----IDQE--LYLFDTTGK--HLYTqslPTGDYLYNFTYTG 1709
Cdd:cd14957    123 GNIYVADTGNHRIQVFTSSGTFSYSigsggTGPGQFNGPqgiaVDSDgnIYVADTGNHriQVFT---SSGTFQYTFGSSG 199
                          170
                   ....*....|....*....
gi 1622865605 1710 DGDVTLIT------DNNGN 1722
Cdd:cd14957    200 SGPGQFSDpygiavDSDGN 218
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1598-1658 7.78e-05

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 47.52  E-value: 7.78e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622865605 1598 ISLVAGAPSGcdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1658
Cdd:cd14953      1 VSTVAGSGTA------------GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKI 49
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1327-1468 8.18e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 46.94  E-value: 8.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1327 PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILELRNKDFRhshsPahkYYLTTDPmSGAVFLSDTNSRRVFKIkstv 1404
Cdd:COG4257    147 PYGIAVDPDGNLWVTDFgaNAIGRIDPDTGTLTEYALPTPGAG----P---RGLAVDP-DGNLWVADTGSGRIGRF---- 214
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622865605 1405 vvkdlvknsevvagtgdqclpfdDTRCGDGGK-ATEATLTNPRGITVDKFGLIYFVDGT--MIRRID 1468
Cdd:COG4257    215 -----------------------DPKTGTVTEyPLPGGGARPYGVAVDGDGRVWFAESGanRIVRFD 258
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1771-1811 2.77e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 40.65  E-value: 2.77e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1622865605 1771 HGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSF 1811
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRY 41
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1569-1659 3.39e-04

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 45.65  E-value: 3.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1569 AVSHNGVLYIAETdekKINRIRQV-TTSGEISLVAGapsgcdckndancdcfSGDDGYA-KDAKLNTPSSLAVCADGELY 1646
Cdd:cd14951    202 AALPDGSVYVADT---YNHKIKRVdPATGEVSTLAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLY 262
                           90
                   ....*....|...
gi 1622865605 1647 VADLGNIRIRFIR 1659
Cdd:cd14951    263 VADTNNHRIRRLD 275
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
730-753 5.53e-04

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 39.54  E-value: 5.53e-04
                           10        20
                   ....*....|....*....|....*...
gi 1622865605  730 CSNHGTCIMG----TCICNPGYKGESCE 753
Cdd:cd00054     11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1775-1807 5.94e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.50  E-value: 5.94e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1622865605 1775 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1807
Cdd:pfam05593    5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1323-1581 6.08e-04

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 44.20  E-value: 6.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1323 KLLAPVALTCGSDGSLYVGDFnYIRRI--F-PSGnvtnilelrnkDFRHSHspAHKYYLTTDPMSGAVFLSDTNSRrVFK 1399
Cdd:cd14963     54 EFKYPYGIAVDSDGNIYVADL-YNGRIqvFdPDG-----------KFLKYF--PEKKDRVKLISPAGLAIDDGKLY-VSD 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1400 IK-STVVVKDLvknsevvagTGDQCLPFddtrcGDGGKAtEATLTNPRGITVDKFGLIYFVDgTMIRRI---DQNG-IIS 1474
Cdd:cd14963    119 VKkHKVIVFDL---------EGKLLLEF-----GKPGSE-PGELSYPNGIAVDEDGNIYVAD-SGNGRIqvfDKNGkFIK 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1475 TLLGSNDLTSArplscdsvmdisqvrLEWPTDLAINPmDNSLYVLDN--NVVLQISENHQVRIVAGRpmhcqvPGIDhfl 1552
Cdd:cd14963    183 ELNGSPDGKSG---------------FVNPRGIAVDP-DGNLYVVDNlsHRVYVFDEQGKELFTFGG------RGKD--- 237
                          250       260
                   ....*....|....*....|....*....
gi 1622865605 1553 lskvaiHATLESATALAVSHNGVLYIAET 1581
Cdd:cd14963    238 ------DGQFNLPNGLFIDDDGRLYVTDR 260
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
729-752 9.53e-04

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 38.87  E-value: 9.53e-04
                           10        20
                   ....*....|....*....|....*.
gi 1622865605  729 ACSNHGTCIM--GTCICNPGYKGESC 752
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
859-881 3.84e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.94  E-value: 3.84e-03
                           10        20
                   ....*....|....*....|....*
gi 1622865605  859 CAEHGTCRD--GKCECSPGWNGEHC 881
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1435-1537 4.15e-03

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 42.18  E-value: 4.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1435 GKATEATLTNPRGITVDKFGLIYFVDgTM---IRRID-QNGIISTLLGSNDLTSarplscdsvmDISQVRLEWPTDLAIN 1510
Cdd:cd14951    188 GPGAEALLQHPLGVAALPDGSVYVAD-TYnhkIKRVDpATGEVSTLAGTGKAGY----------KDLEAQFSEPSGLVVD 256
                           90       100
                   ....*....|....*....|....*..
gi 1622865605 1511 PmDNSLYVLDNNvvlqiseNHQVRIVA 1537
Cdd:cd14951    257 G-DGRLYVADTN-------NHRIRRLD 275
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1322-1400 4.29e-03

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 41.93  E-value: 4.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865605 1322 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTNilelrnkdFRHSHSPAHKYYLTTDPmSGAVFLSDTNSRRVF 1398
Cdd:COG4257    185 TPGAGPRGLAVDPDGNLWVADTgsGRIGRFDPkTGTVTE--------YPLPGGGARPYGVAVDG-DGRVWFAESGANRIV 255

                   ..
gi 1622865605 1399 KI 1400
Cdd:COG4257    256 RF 257
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
730-750 5.75e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 36.59  E-value: 5.75e-03
                           10        20
                   ....*....|....*....|....*
gi 1622865605  730 CSNHGTCIMG----TCICNPGYKGE 750
Cdd:pfam00008    6 CSNGGTCVDTpggyTCICPEGYTGK 30
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
795-819 6.40e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.17  E-value: 6.40e-03
                           10        20
                   ....*....|....*....|....*
gi 1622865605  795 CSGHGTFLPDTGLCSCDPSWTGHDC 819
Cdd:pfam07974    2 CSGRGTCVNQCGKCVCDSGYQGATC 26
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH