NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568950580|ref|XP_006507870|]
View 

teneurin-4 isoform X11 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
11-410 1.84e-179

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


:

Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 554.97  E-value: 1.84e-179
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580    11 SLT-RRRDAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGTNFTLRELGLGEM 88
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580    89 TPPHGTLYRTDIGLPHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 165
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   166 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPSSLQNHPR 245
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   246 LRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPagSAQEPTHAQDNWLLNSNIPLETRnlgkq 325
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQ--STQESVQLQDSWVLNSNVPLETR----- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   326 pflgtlqdnliemdilsasrhdgaysdgHFLFKPG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPS 404
Cdd:pfam06484  310 ----------------------------HFLFKTGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPY 361

                   ....*.
gi 568950580   405 KYCNWK 410
Cdd:pfam06484  362 KYCSWK 367
NHL super family cl18310
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1294-1635 3.93e-47

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


The actual alignment was detected with superfamily member cd14953:

Pssm-ID: 302697 [Multi-domain]  Cd Length: 323  Bit Score: 173.10  E-value: 3.93e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1294 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHKYY----LATDPmSGA 1363
Cdd:cd14953    11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1364 VFLSDTNSRRVFKVKSTTVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1441
Cdd:cd14953    90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1442 RRVDQNGIISTLLGsndlTSARPLSCDSVMeiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1519
Cdd:cd14953   156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1520 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1599
Cdd:cd14953   226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 568950580 1600 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1635
Cdd:cd14953   288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2757-2834 3.22e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


:

Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.43  E-value: 3.22e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568950580  2757 EEKVRVLELARQRAVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMR 2834
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1668-2533 1.58e-33

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


:

Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.59  E-value: 1.58e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1668 LYTQSLPTGDYLYNFTYTGDGDITHITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALRSVTTQGHELAMMTY 1747
Cdd:COG3209   191 LATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAS 270
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1748 HGNSGLLATKSNENGW--TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQ 1825
Cdd:COG3209   271 GAGLDASTGTGGAGGSnaAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVG 350
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1826 DQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRV 1905
Cdd:COG3209   351 GGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGG 430
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1906 HNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRI 1985
Cdd:COG3209   431 TATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTT 510
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1986 TSRIFADgkmWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFT 2065
Cdd:COG3209   511 TTTAGAR---GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGG 587
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2066 EDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTE 2145
Cdd:COG3209   588 TATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGT 667
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2146 EGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKE 2225
Cdd:COG3209   668 GVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTT 746
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2226 VQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSAR 2305
Cdd:COG3209   747 TSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGG 825
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2306 LTPL-----RYDLRDRITRLgdvqykmdEDGFLRQRGGDVFEYNSAGLLIKAynRASGWSVRYRYDGLGRRVSSKSSHSH 2380
Cdd:COG3209   826 GTDLqdrtyTYDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTT 895
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2381 HLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYTAYGEI 2460
Cdd:COG3209   896 TYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNL 954
                         810       820       830       840       850       860       870
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568950580 2461 YMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnSIVP---FHLYMFKNNNPISNS 2533
Cdd:COG3209   955 LAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD----------PIGLaggLNLYAYVGNNPVNYV 1020
DSL super family cl19567
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
818-861 4.56e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


The actual alignment was detected with superfamily member pfam01414:

Pssm-ID: 473190  Cd Length: 46  Bit Score: 43.00  E-value: 4.56e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 568950580   818 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 861
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
Keratin_B2 super family cl37504
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
699-847 1.30e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


The actual alignment was detected with superfamily member pfam01500:

Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 42.09  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   699 TNQCIDVACSSHGTCimGTCICNPGYKGESCEEVDCMDPTCS----SRGVCVRGECHCSVgwgGTNCETPraTCLDQCS- 773
Cdd:pfam01500    6 TSFCGFPTCSTGGTC--GSGCCQPCCCQSSCCRPSCCQTSCCqpttFQSSCCRPTCQPCC---QTSCCQP--TCCQTSSc 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   774 -------GHGTfLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCrCEDGWMGAACdqraCHPRCAEHGTCRDGK 846
Cdd:pfam01500   79 qtgcggiGYGQ-EGSSGAVSSRTRWCRPDCRVEGTCLPPCCVVSCTPPTC-CQLHHAQASC----CRPSYCGQSCCRPAC 152

                   .
gi 568950580   847 C 847
Cdd:pfam01500  153 C 153
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
11-410 1.84e-179

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 554.97  E-value: 1.84e-179
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580    11 SLT-RRRDAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGTNFTLRELGLGEM 88
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580    89 TPPHGTLYRTDIGLPHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 165
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   166 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPSSLQNHPR 245
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   246 LRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPagSAQEPTHAQDNWLLNSNIPLETRnlgkq 325
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQ--STQESVQLQDSWVLNSNVPLETR----- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   326 pflgtlqdnliemdilsasrhdgaysdgHFLFKPG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPS 404
Cdd:pfam06484  310 ----------------------------HFLFKTGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPY 361

                   ....*.
gi 568950580   405 KYCNWK 410
Cdd:pfam06484  362 KYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1294-1635 3.93e-47

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 173.10  E-value: 3.93e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1294 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHKYY----LATDPmSGA 1363
Cdd:cd14953    11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1364 VFLSDTNSRRVFKVKSTTVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1441
Cdd:cd14953    90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1442 RRVDQNGIISTLLGsndlTSARPLSCDSVMeiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1519
Cdd:cd14953   156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1520 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1599
Cdd:cd14953   226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 568950580 1600 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1635
Cdd:cd14953   288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2757-2834 3.22e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.43  E-value: 3.22e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568950580  2757 EEKVRVLELARQRAVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMR 2834
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1668-2533 1.58e-33

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.59  E-value: 1.58e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1668 LYTQSLPTGDYLYNFTYTGDGDITHITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALRSVTTQGHELAMMTY 1747
Cdd:COG3209   191 LATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAS 270
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1748 HGNSGLLATKSNENGW--TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQ 1825
Cdd:COG3209   271 GAGLDASTGTGGAGGSnaAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVG 350
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1826 DQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRV 1905
Cdd:COG3209   351 GGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGG 430
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1906 HNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRI 1985
Cdd:COG3209   431 TATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTT 510
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1986 TSRIFADgkmWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFT 2065
Cdd:COG3209   511 TTTAGAR---GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGG 587
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2066 EDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTE 2145
Cdd:COG3209   588 TATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGT 667
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2146 EGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKE 2225
Cdd:COG3209   668 GVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTT 746
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2226 VQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSAR 2305
Cdd:COG3209   747 TSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGG 825
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2306 LTPL-----RYDLRDRITRLgdvqykmdEDGFLRQRGGDVFEYNSAGLLIKAynRASGWSVRYRYDGLGRRVSSKSSHSH 2380
Cdd:COG3209   826 GTDLqdrtyTYDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTT 895
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2381 HLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYTAYGEI 2460
Cdd:COG3209   896 TYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNL 954
                         810       820       830       840       850       860       870
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568950580 2461 YMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnSIVP---FHLYMFKNNNPISNS 2533
Cdd:COG3209   955 LAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD----------PIGLaggLNLYAYVGNNPVNYV 1020
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2454-2533 2.91e-10

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 58.67  E-value: 2.91e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580  2454 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnsivPF------HLYMFKNN 2527
Cdd:TIGR03696    1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66

                   ....*.
gi 568950580  2528 NPISNS 2533
Cdd:TIGR03696   67 NPVNWV 72
RHS_core NF041261
RHS element core protein;
1952-2370 3.48e-09

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 62.71  E-value: 3.48e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1952 LNGVNVTYSPGGhiAGIQRGIMSE-------RMEYDQAGRITSRIFADGKMWSYTyleksmvLHLHSqrqyifefdknDR 2024
Cdd:NF041261  401 LNRREVLHTEGE--GGLKRVVKKEhadgsvtRSGYDAAGRLTAQTDAAGRRTEYS-------LNVVS-----------GD 460
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2025 LSSVTMPNVarqtletiRSVGYYRNiyqppegnasviqdfteDGHLLHTFYLGTGRRVIYKYGKLSKL-AETLYDTTKVS 2103
Cdd:NF041261  461 ITDITTPDG--------RETKFYYN-----------------DGNQLTSVTSPDGLESRREYDEPGRLvSETSRSGETTR 515
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2104 FTYDETAGMLKTVNLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETPlpIDLYR-Y 2182
Cdd:NF041261  516 YRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREEG--ISTYRrY 583
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2183 DD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNMGRVVKKEL 2253
Cdd:NF041261  584 DNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAAGRITTLTN 656
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2254 KvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GDV--QYKMD 2328
Cdd:NF041261  657 E-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGEPaeQWQYD 729
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|..
gi 568950580 2329 EDGFLRQrggdvFEYNSAGLLIkaynrasgwSVRYRYDGLGR 2370
Cdd:NF041261  730 EHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1304-1635 3.70e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 60.42  E-value: 3.70e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1304 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilEMRNKDFRHSHSpahkyyLATDPmSGAVFLSDTNSRRVFKVKST 1380
Cdd:COG4257    19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIDPK 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1381 TvvkdlvKNSEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRVD-QNGIISTLLGsn 1457
Cdd:COG4257    89 T------GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL-- 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1458 DLTSARplscdsvmeisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAIHA 1537
Cdd:COG4257   141 PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYALPT 185
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1538 TLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLAVC 1617
Cdd:COG4257   186 PGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVAVD 239
                         330
                  ....*....|....*...
gi 568950580 1618 ADGELYVADLGNIRIRFI 1635
Cdd:COG4257   240 GDGRVWFAESGANRIVRF 257
RHS_core NF041261
RHS element core protein;
2180-2516 2.48e-06

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 53.47  E-value: 2.48e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2180 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2241
Cdd:NF041261  367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2242 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2312
Cdd:NF041261  443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2313 lrDRITRLGDVqyKMDEDGFLRQrggdvFEYNSAGLLIkAYNRASGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2392
Cdd:NF041261  520 --DPHSELPAT--TTDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2393 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2472
Cdd:NF041261  587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568950580 2473 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LWKRLSSNSI 2516
Cdd:NF041261  659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLWHYDESDRI 713
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1339-1639 1.07e-05

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 51.39  E-value: 1.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1339 RNKDFRHSHSPAhKY--YLATDPMSGAVFLSDTNSRRVfkvksttVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1412
Cdd:PLN02919  556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1413 kateATLTNPRGITVDKFGLIYFVDGT---MIRRVD-QNGIISTLLGS----NDLTSARPLScdsvmeiSQVrLEWPTDL 1484
Cdd:PLN02919  621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1485 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1532
Cdd:PLN02919  689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1533 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1584
Cdd:PLN02919  769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 568950580 1585 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1639
Cdd:PLN02919  847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
818-861 4.56e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 43.00  E-value: 4.56e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 568950580   818 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 861
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1752-1784 5.90e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.50  E-value: 5.90e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 568950580  1752 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1784
Cdd:pfam05593    5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
699-847 1.30e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 42.09  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   699 TNQCIDVACSSHGTCimGTCICNPGYKGESCEEVDCMDPTCS----SRGVCVRGECHCSVgwgGTNCETPraTCLDQCS- 773
Cdd:pfam01500    6 TSFCGFPTCSTGGTC--GSGCCQPCCCQSSCCRPSCCQTSCCqpttFQSSCCRPTCQPCC---QTSCCQP--TCCQTSSc 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   774 -------GHGTfLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCrCEDGWMGAACdqraCHPRCAEHGTCRDGK 846
Cdd:pfam01500   79 qtgcggiGYGQ-EGSSGAVSSRTRWCRPDCRVEGTCLPPCCVVSCTPPTC-CQLHHAQASC----CRPSYCGQSCCRPAC 152

                   .
gi 568950580   847 C 847
Cdd:pfam01500  153 C 153
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1545-1731 2.12e-03

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 42.64  E-value: 2.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1545 LAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGapsgcdckndancdcfSGDDGyakDAKLNTPSSLAVCADGELYV 1624
Cdd:cd14957    23 IAVDSAGNIYVADTGN---NRIQVFTSSGVYSYSIG----------------SGGTG---SGQFNSPYGIAVDSNGNIYV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1625 ADLGNIRIRfirknkpFLNTQNMYElsspidqelYLFDTSGkhlytQSLPTGDYLYNFTYTGDGDItHITDNNGNMVNVr 1704
Cdd:cd14957    81 ADTDNNRIQ-------VFNSSGVYQ---------YSIGTGG-----SGDGQFNGPYGIAVDSNGNI-YVADTGNHRIQV- 137
                         170       180
                  ....*....|....*....|....*..
gi 568950580 1705 RDSTGmplwlvvpdgqVYWVTMGTNSA 1731
Cdd:cd14957   138 FTSSG-----------TFSYSIGSGGT 153
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
707-730 3.92e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 3.92e-03
                          10        20
                  ....*....|....*....|....*...
gi 568950580  707 CSSHGTCIMG----TCICNPGYKGESCE 730
Cdd:cd00054    11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
 
Name Accession Description Interval E-value
Ten_N pfam06484
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ...
11-410 1.84e-179

Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).


Pssm-ID: 461932 [Multi-domain]  Cd Length: 367  Bit Score: 554.97  E-value: 1.84e-179
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580    11 SLT-RRRDAERRYTSSSADSEEGKGP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGTNFTLRELGLGEM 88
Cdd:pfam06484    1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580    89 TPPHGTLYRTDIGLPHCGYSMGASSDADLEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT---ETG 165
Cdd:pfam06484   81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKsdnENG 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   166 APLHCSSASSTPIEQSPSPPPSPpaNESQRRLLGNGVAQPTPDSDSEEEFVPNSFLVKSGSASLGVAAnDHPSSLQNHPR 245
Cdd:pfam06484  161 PPIPPSSSSSSPVEQHSPPPPSL--NENQRPLLGNNASHPILDSDPDEEFSPNSYLVRTGSGPQSAPS-EQPPNFQNHSR 237
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   246 LRTPPPPLPHAHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPagSAQEPTHAQDNWLLNSNIPLETRnlgkq 325
Cdd:pfam06484  238 LRTPPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQ--STQESVQLQDSWVLNSNVPLETR----- 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   326 pflgtlqdnliemdilsasrhdgaysdgHFLFKPG-GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFSRPAFNLKKPS 404
Cdd:pfam06484  310 ----------------------------HFLFKTGtGTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPY 361

                   ....*.
gi 568950580   405 KYCNWK 410
Cdd:pfam06484  362 KYCSWK 367
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1294-1635 3.93e-47

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 173.10  E-value: 3.93e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1294 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHKYY----LATDPmSGA 1363
Cdd:cd14953    11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1364 VFLSDTNSRRVFKVKSTTVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1441
Cdd:cd14953    90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1442 RRVDQNGIISTLLGsndlTSARPLSCDSVMeiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1519
Cdd:cd14953   156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1520 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1599
Cdd:cd14953   226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 568950580 1600 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1635
Cdd:cd14953   288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
Tox-GHH pfam15636
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ...
2757-2834 3.22e-37

GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.


Pssm-ID: 464783  Cd Length: 78  Bit Score: 135.43  E-value: 3.22e-37
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568950580  2757 EEKVRVLELARQRAVRQAWAREQQRLREGEEGLRAWTDGEKQQVLNTGRVQGYDGFFVTSVEQYPELSDSANNIHFMR 2834
Cdd:pfam15636    1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
1668-2533 1.58e-33

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.59  E-value: 1.58e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1668 LYTQSLPTGDYLYNFTYTGDGDITHITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALRSVTTQGHELAMMTY 1747
Cdd:COG3209   191 LATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAS 270
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1748 HGNSGLLATKSNENGW--TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQ 1825
Cdd:COG3209   271 GAGLDASTGTGGAGGSnaAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVG 350
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1826 DQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFGRRLRV 1905
Cdd:COG3209   351 GGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGG 430
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1906 HNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYDQAGRI 1985
Cdd:COG3209   431 TATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTT 510
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1986 TSRIFADgkmWSYTYLEKSMVLHLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFT 2065
Cdd:COG3209   511 TTTAGAR---GLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGG 587
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2066 EDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTVNLQNEGFTCTIRYRQIGPLIDRQIFRFTE 2145
Cdd:COG3209   588 TATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGT 667
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2146 EGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKE 2225
Cdd:COG3209   668 GVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTT 746
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2226 VQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSAR 2305
Cdd:COG3209   747 TSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGG 825
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2306 LTPL-----RYDLRDRITRLgdvqykmdEDGFLRQRGGDVFEYNSAGLLIKAynRASGWSVRYRYDGLGRRVSSKSSHSH 2380
Cdd:COG3209   826 GTDLqdrtyTYDAAGNITSI--------TDALRAGTLTQTYTYDALGRLTSA--TDPGTTESYTYDANGNLTSRTDGGTT 895
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2381 HLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYTAYGEI 2460
Cdd:COG3209   896 TYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDPFGNL 954
                         810       820       830       840       850       860       870
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568950580 2461 YMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnSIVP---FHLYMFKNNNPISNS 2533
Cdd:COG3209   955 LAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD----------PIGLaggLNLYAYVGNNPVNYV 1020
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1299-1635 6.94e-19

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 89.30  E-value: 6.94e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1299 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPAHkyyLATDPmSGAVFLSDTNSRRVFK 1376
Cdd:cd05819     5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEPAG---VAVDS-DGNLYVADTGNHRIQK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1377 VKSTTVVKDlvknseVVAGTGDQCLPFDdtrcgdggkateatltNPRGITVDKFGLIYFVDgTM---IRRVDQNGIISTL 1453
Cdd:cd05819    81 FDPDGNFLA------SFGGSGDGDGEFN----------------GPRGIAVDSSGNIYVAD-TGnhrIQKFDPDGEFLTT 137
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1454 LGSNDLTSARplscdsvmeisqvrLEWPTDLAINPmDNSLYVLDnnvvlqiSENHQVRIVAgrpmhcqvPGiDHFLL--- 1530
Cdd:cd05819   138 FGSGGSGPGQ--------------FNGPTGVAVDS-DGNIYVAD-------TGNHRIQVFD--------PD-GNFLTtfg 186
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1531 SKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcfsgddgyaKDAKLNT 1610
Cdd:cd05819   187 STGTGPGQFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNGNFLG-------------------SDGQFNR 244
                         330       340
                  ....*....|....*....|....*
gi 568950580 1611 PSSLAVCADGELYVADLGNIRIRFI 1635
Cdd:cd05819   245 PSGLAVDSDGNLYVADTGNNRIQVF 269
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1419-1729 3.70e-17

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 84.29  E-value: 3.70e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1419 LTNPRGITVDKFGLIYFVDGTM--IRRVDQNGIISTLLGSNDltsarplscdsvmeISQVRLEWPTDLAINPmDNSLYVL 1496
Cdd:cd05819     7 LNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNLYVA 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1497 D--NNVVLQISENHQVRIVAGRPmhcqvpGIDHFLLSkvaihatleSATALAVSHNGVLYIAETDEkkiNRIRQVTTSGE 1574
Cdd:cd05819    72 DtgNHRIQKFDPDGNFLASFGGS------GDGDGEFN---------GPRGIAVDSSGNIYVADTGN---HRIQKFDPDGE 133
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1575 ISLVAGAPSGCDckndancdcfsgddgyakdAKLNTPSSLAVCADGELYVADLGNIRIRFIrknkpflntqnmyelsSPI 1654
Cdd:cd05819   134 FLTTFGSGGSGP-------------------GQFNGPTGVAVDSDGNIYVADTGNHRIQVF----------------DPD 178
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1655 DQELYLFDTSGKHLYTQSLPTG------DYLYnFTYTGDGDITHITDN------NGNmVNVRRDSTGMPLWLVV-PDGQV 1721
Cdd:cd05819   179 GNFLTTFGSTGTGPGQFNYPTGiavdsdGNIY-VADSGNNRVQVFDPDgagfggNGN-FLGSDGQFNRPSGLAVdSDGNL 256

                  ....*...
gi 568950580 1722 YWVTMGTN 1729
Cdd:cd05819   257 YVADTGNN 264
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1291-1504 2.54e-14

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 76.80  E-value: 2.54e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1291 SCNGLADGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNilemrnkdfrhshspahkyylatdpmsgavflsd 1368
Cdd:cd14953   176 AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTT---------------------------------- 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1369 tnsrrvfkvksttvvkdlvknsevVAGTGDQclPFddtrcGDGGKATEATLTNPRGITVDKFGLIYFVD---GTmIRRVD 1445
Cdd:cd14953   222 ------------------------VAGTGTA--GF-----SGDGGATAAQLNNPTGVAVDAAGNLYVADsgnHR-IRKIT 269
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568950580 1446 QNGIISTLLGSndlTSARPLSCDSVmeiSQVRLEWPTDLAINPmDNSLYVLD--NNVVLQI 1504
Cdd:cd14953   270 PAGVVTTVAGG---GAGFSGDGGPA---TSAQFNNPTGVAVDA-AGNLYVADtgNNRIRKI 323
NHL cd05819
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ...
1297-1566 2.23e-13

NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.


Pssm-ID: 271320 [Multi-domain]  Cd Length: 269  Bit Score: 73.12  E-value: 2.23e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1297 DGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRHSHSPahkYYLATDPmSGAVFLSDTNSRRV 1374
Cdd:cd05819    50 GDGQFNEPAGVAVDSDGNLYVADTgnHRIQKFDPDGNFLASFGGSGDGDGEFNGP---RGIAVDS-SGNIYVADTGNHRI 125
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1375 FKVKSttvvkdlvkNSEVVAGTGdqclpfddtrcgdGGKATEATLTNPRGITVDKFGLIYFVDGT--MIRRVDQNGIIST 1452
Cdd:cd05819   126 QKFDP---------DGEFLTTFG-------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLT 183
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1453 LLGSNDLTSArplscdsvmeisqvRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGrpmhcqvpgidhfll 1530
Cdd:cd05819   184 TFGSTGTGPG--------------QFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------- 233
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 568950580 1531 SKVAIHATLESATALAVSHNGVLYIAETDEKKINRI 1566
Cdd:cd05819   234 NFLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
2454-2533 2.91e-10

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 58.67  E-value: 2.91e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580  2454 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwkrlssnsivPF------HLYMFKNN 2527
Cdd:TIGR03696    1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66

                   ....*.
gi 568950580  2528 NPISNS 2533
Cdd:TIGR03696   67 NPVNWV 72
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1355-1632 1.83e-09

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 61.07  E-value: 1.83e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1355 LATDPmSGAVFLSDTNSRRVFKVksttvvkdlvknsevVAGTGDQC-LPFDDtrcgdggkateatLTNPRGITVDKFGLI 1433
Cdd:cd14952    15 VAVDA-AGNVYVADSGNNRVLKL---------------AAGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1434 YFVDGtmirrvDQNGIISTLLGSNDLTsarPLSCDSvmeisqvrLEWPTDLAINPMDNsLYVLD--NNVVLqisenhqvR 1511
Cdd:cd14952    66 YVTDF------GNNRVLKLAAGSTTQT---VLPFTG--------LNDPTGVAVDAAGN-VYVADtgNNRVL--------K 119
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1512 IVAGRPMHCQVPgidhFllskvaihATLESATALAVSHNGVLYIAETDEkkiNRIRQvttsgeisLVAGA------Psgc 1585
Cdd:cd14952   120 LAAGSNTQTVLP----F--------TGLSNPDGVAVDGAGNVYVTDTGN---NRVLK--------LAAGSttqtvlP--- 173
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*..
gi 568950580 1586 dckndancdcFSGddgyakdakLNTPSSLAVCADGELYVADLGNIRI 1632
Cdd:cd14952   174 ----------FTG---------LNSPSGVAVDTAGNVYVTDHGNNRV 201
RHS_core NF041261
RHS element core protein;
1952-2370 3.48e-09

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 62.71  E-value: 3.48e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1952 LNGVNVTYSPGGhiAGIQRGIMSE-------RMEYDQAGRITSRIFADGKMWSYTyleksmvLHLHSqrqyifefdknDR 2024
Cdd:NF041261  401 LNRREVLHTEGE--GGLKRVVKKEhadgsvtRSGYDAAGRLTAQTDAAGRRTEYS-------LNVVS-----------GD 460
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2025 LSSVTMPNVarqtletiRSVGYYRNiyqppegnasviqdfteDGHLLHTFYLGTGRRVIYKYGKLSKL-AETLYDTTKVS 2103
Cdd:NF041261  461 ITDITTPDG--------RETKFYYN-----------------DGNQLTSVTSPDGLESRREYDEPGRLvSETSRSGETTR 515
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2104 FTYDETAGMLKTVNLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETPlpIDLYR-Y 2182
Cdd:NF041261  516 YRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREEG--ISTYRrY 583
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2183 DD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNMGRVVKKEL 2253
Cdd:NF041261  584 DNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAAGRITTLTN 656
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2254 KvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GDV--QYKMD 2328
Cdd:NF041261  657 E-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGEPaeQWQYD 729
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|..
gi 568950580 2329 EDGFLRQrggdvFEYNSAGLLIkaynrasgwSVRYRYDGLGR 2370
Cdd:NF041261  730 EHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1304-1635 3.70e-09

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 60.42  E-value: 3.70e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1304 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilEMRNKDFRHSHSpahkyyLATDPmSGAVFLSDTNSRRVFKVKST 1380
Cdd:COG4257    19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIDPK 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1381 TvvkdlvKNSEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRVD-QNGIISTLLGsn 1457
Cdd:COG4257    89 T------GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL-- 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1458 DLTSARplscdsvmeisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAIHA 1537
Cdd:COG4257   141 PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYALPT 185
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1538 TLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLAVC 1617
Cdd:COG4257   186 PGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVAVD 239
                         330
                  ....*....|....*...
gi 568950580 1618 ADGELYVADLGNIRIRFI 1635
Cdd:COG4257   240 GDGRVWFAESGANRIVRF 257
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1421-1632 7.83e-07

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 53.44  E-value: 7.83e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1421 NPRGITVDKFGLIYFVD--GTMIRRVDQNGIISTLLGSndlTSARPLScdsvmeisqvrLEWPTDLAINPmDNSLYVLDn 1498
Cdd:cd14956   108 APRGVAVDADGNLYVADfgNQRIQKFDPDGSFLRQWGG---TGIEPGS-----------FNYPRGVAVDP-DGTLYVAD- 171
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1499 nvvlqiSENHQVrivagrpmhcQVPGIDHFLLSKVAIHAT----LESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGE 1574
Cdd:cd14956   172 ------TYNDRI----------QVFDNDGAFLRKWGGRGTgpgqFNYPYGIAIDPDGNVFVADFGN---NRIQKFTADGT 232
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 568950580 1575 ISLVAGAPSGcdckndancdcfsgddgyaKDAKLNTPSSLAVCADGELYVADLGNIRI 1632
Cdd:cd14956   233 FLTSWGSPGT-------------------GPGQFKNPWGVVVDADGTVYVADSNNNRV 271
RHS_core NF041261
RHS element core protein;
2180-2516 2.48e-06

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 53.47  E-value: 2.48e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2180 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2241
Cdd:NF041261  367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2242 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2312
Cdd:NF041261  443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2313 lrDRITRLGDVqyKMDEDGFLRQrggdvFEYNSAGLLIkAYNRASGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2392
Cdd:NF041261  520 --DPHSELPAT--TTDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 2393 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2472
Cdd:NF041261  587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 568950580 2473 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LWKRLSSNSI 2516
Cdd:NF041261  659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLWHYDESDRI 713
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1417-1635 4.36e-06

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 51.17  E-value: 4.36e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1417 ATLTNPRGITVDKFGLIYFVD--GTMIRRVD-QNGIIStllgsndltsarplscdsvmEISQVRLEWPTDLAINPmDNSL 1493
Cdd:COG4257    14 APGSGPRDVAVDPDGAVWFTDqgGGRIGRLDpATGEFT--------------------EYPLGGGSGPHGIAVDP-DGNL 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1494 YVLD--NNVVLQIS-ENHQVRIVAGrpmhcqvPGIDHFLlskvaihatlesaTALAVSHNGVLYIAETDekkINRIRQVT 1570
Cdd:COG4257    73 WFTDngNNRIGRIDpKTGEITTFAL-------PGGGSNP-------------HGIAFDPDGNLWFTDQG---GNRIGRLD 129
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1571 T-SGEISLV-----AGAPSGCDCKND---------ANC-DCFSGDDG----YAKDAKLNTPSSLAVCADGELYVADLGNI 1630
Cdd:COG4257   130 PaTGEVTEFplptgGAGPYGIAVDPDgnlwvtdfgANAiGRIDPDTGtlteYALPTPGAGPRGLAVDPDGNLWVADTGSG 209

                  ....*
gi 568950580 1631 RIRFI 1635
Cdd:COG4257   210 RIGRF 214
PLN02919 PLN02919
haloacid dehalogenase-like hydrolase family protein
1339-1639 1.07e-05

haloacid dehalogenase-like hydrolase family protein


Pssm-ID: 215497 [Multi-domain]  Cd Length: 1057  Bit Score: 51.39  E-value: 1.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1339 RNKDFRHSHSPAhKY--YLATDPMSGAVFLSDTNSRRVfkvksttVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1412
Cdd:PLN02919  556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1413 kateATLTNPRGITVDKFGLIYFVDGT---MIRRVD-QNGIISTLLGS----NDLTSARPLScdsvmeiSQVrLEWPTDL 1484
Cdd:PLN02919  621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1485 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1532
Cdd:PLN02919  689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1533 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1584
Cdd:PLN02919  769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 568950580 1585 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1639
Cdd:PLN02919  847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1422-1729 3.32e-05

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 48.42  E-value: 3.32e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1422 PRGITVDKFGLIYFVD--GTMIRRVDQNGIISTLLGSNDltsarplscdsvmeISQVRLEWPTDLAINPMDNsLYVLDnn 1499
Cdd:cd14957    20 PRGIAVDSAGNIYVADtgNNRIQVFTSSGVYSYSIGSGG--------------TGSGQFNSPYGIAVDSNGN-IYVAD-- 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1500 vvlqiSENHQVRIvagrpmhcqvpgidhFLLSKVAIHA---------TLESATALAVSHNGVLYIAETDEkkiNRIrQVT 1570
Cdd:cd14957    83 -----TDNNRIQV---------------FNSSGVYQYSigtggsgdgQFNGPYGIAVDSNGNIYVADTGN---HRI-QVF 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1571 TSgeislvAGAPsgcdckndancdCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRfirknkpflntqnmyel 1650
Cdd:cd14957   139 TS------SGTF------------SYSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ----------------- 183
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1651 sspidqelyLFDTSGKHLYT-QSLPTGDYLYNFTY----TGDGDItHITDNNGNMVNVrRDSTGmplwlvvpdgqVYWVT 1725
Cdd:cd14957   184 ---------VFTSSGTFQYTfGSSGSGPGQFSDPYgiavDSDGNI-YVADTGNHRIQV-FTSSG-----------AYQYS 241

                  ....
gi 568950580 1726 MGTN 1729
Cdd:cd14957   242 IGTS 245
NHL_like_3 cd14956
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1411-1632 4.05e-05

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271326 [Multi-domain]  Cd Length: 274  Bit Score: 48.05  E-value: 4.05e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1411 GGKATEA-TLTNPRGITVDKFGLIYFVDGT--MIRRVDQNGIISTLLGSNdltSARPLSCDSvmeisqvrlewPTDLAIN 1487
Cdd:cd14956    50 GTTGDGPgQFGRPRGLAVDKDGWLYVADYWgdRIQVFTLTGELQTIGGSS---GSGPGQFNA-----------PRGVAVD 115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1488 PmDNSLYVLD--NNVVLQISENHQ-VRIVAGRPmhcQVPGidHFLlskvaihatleSATALAVSHNGVLYIAETdekKIN 1564
Cdd:cd14956   116 A-DGNLYVADfgNQRIQKFDPDGSfLRQWGGTG---IEPG--SFN-----------YPRGVAVDPDGTLYVADT---YND 175
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 568950580 1565 RIRQVTTSGEISLVAGAPSGcdckndancdcFSGDdgyakdakLNTPSSLAVCADGELYVADLGNIRI 1632
Cdd:cd14956   176 RIQVFDNDGAFLRKWGGRGT-----------GPGQ--------FNYPYGIAIDPDGNVFVADFGNNRI 224
DSL pfam01414
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ...
818-861 4.56e-05

Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.


Pssm-ID: 460202  Cd Length: 46  Bit Score: 43.00  E-value: 4.56e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 568950580   818 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 861
Cdd:pfam01414    1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1304-1445 5.77e-05

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 47.71  E-value: 5.77e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1304 PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILEMRNKDFRhshsPahkYYLATDPmSGAVFLSDTNSRRVFKVKSTT 1381
Cdd:COG4257   147 PYGIAVDPDGNLWVTDFgaNAIGRIDPDTGTLTEYALPTPGAG----P---RGLAVDP-DGNLWVADTGSGRIGRFDPKT 218
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1382 vvkdlvknsevvagtgdqclpfddtrcgdgGKATEATLTN----PRGITVDKFGLIYFVDGT--MIRRVD 1445
Cdd:COG4257   219 ------------------------------GTVTEYPLPGggarPYGVAVDGDGRVWFAESGanRIVRFD 258
NHL_like_1 cd14953
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ...
1575-1635 6.52e-05

Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271323 [Multi-domain]  Cd Length: 323  Bit Score: 47.91  E-value: 6.52e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568950580 1575 ISLVAGAPSGcdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1635
Cdd:cd14953     1 VSTVAGSGTA------------GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKI 49
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1748-1788 2.88e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 40.65  E-value: 2.88e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 568950580  1748 HGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSF 1788
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRY 41
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1546-1636 3.36e-04

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 45.65  E-value: 3.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1546 AVSHNGVLYIAETdekKINRIRQV-TTSGEISLVAGapsgcdckndancdcfSGDDGYA-KDAKLNTPSSLAVCADGELY 1623
Cdd:cd14951   202 AALPDGSVYVADT---YNHKIKRVdPATGEVSTLAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLY 262
                          90
                  ....*....|...
gi 568950580 1624 VADLGNIRIRFIR 1636
Cdd:cd14951   263 VADTNNHRIRRLD 275
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1752-1784 5.90e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.50  E-value: 5.90e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 568950580  1752 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1784
Cdd:pfam05593    5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
Keratin_B2 pfam01500
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ...
699-847 1.30e-03

Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.


Pssm-ID: 366678 [Multi-domain]  Cd Length: 161  Bit Score: 42.09  E-value: 1.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   699 TNQCIDVACSSHGTCimGTCICNPGYKGESCEEVDCMDPTCS----SRGVCVRGECHCSVgwgGTNCETPraTCLDQCS- 773
Cdd:pfam01500    6 TSFCGFPTCSTGGTC--GSGCCQPCCCQSSCCRPSCCQTSCCqpttFQSSCCRPTCQPCC---QTSCCQP--TCCQTSSc 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580   774 -------GHGTfLPDTGLCNCDPSWTGHDCSIEICAADCGGHGVCVGGTCrCEDGWMGAACdqraCHPRCAEHGTCRDGK 846
Cdd:pfam01500   79 qtgcggiGYGQ-EGSSGAVSSRTRWCRPDCRVEGTCLPPCCVVSCTPPTC-CQLHHAQASC----CRPSYCGQSCCRPAC 152

                   .
gi 568950580   847 C 847
Cdd:pfam01500  153 C 153
NHL_like_5 cd14963
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ...
1300-1558 1.42e-03

Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271333 [Multi-domain]  Cd Length: 268  Bit Score: 43.05  E-value: 1.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1300 KLLAPVALTCGSDGSLYVGDFnYIRRI--F-PSGNVTNILEmRNKDFRHSHSPAHkyyLATDpmSGAVFLSDTNSRRVfk 1376
Cdd:cd14963    54 EFKYPYGIAVDSDGNIYVADL-YNGRIqvFdPDGKFLKYFP-EKKDRVKLISPAG---LAID--DGKLYVSDVKKHKV-- 124
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1377 vksttVVKDLvknsevvagTGDQCLPFddtrcGDGGKAtEATLTNPRGITVDKFGLIYFVDgTMIRRV---DQNG-IIST 1452
Cdd:cd14963   125 -----IVFDL---------EGKLLLEF-----GKPGSE-PGELSYPNGIAVDEDGNIYVAD-SGNGRIqvfDKNGkFIKE 183
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1453 LLGSNDLTSArplscdsvmeisqvrLEWPTDLAINPmDNSLYVLDN--NVVLQISENHQVRIVAGRpmhcqvPGIDhfll 1530
Cdd:cd14963   184 LNGSPDGKSG---------------FVNPRGIAVDP-DGNLYVVDNlsHRVYVFDEQGKELFTFGG------RGKD---- 237
                         250       260
                  ....*....|....*....|....*...
gi 568950580 1531 skvaiHATLESATALAVSHNGVLYIAET 1558
Cdd:cd14963   238 -----DGQFNLPNGLFIDDDGRLYVTDR 260
Vgb COG4257
Streptogramin lyase [Defense mechanisms];
1299-1383 1.90e-03

Streptogramin lyase [Defense mechanisms];


Pssm-ID: 443399 [Multi-domain]  Cd Length: 270  Bit Score: 42.70  E-value: 1.90e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1299 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTNilemrnkdFRHSHSPAHKYYLATDPmSGAVFLSDTNSRRVF 1375
Cdd:COG4257   185 TPGAGPRGLAVDPDGNLWVADTgsGRIGRFDPkTGTVTE--------YPLPGGGARPYGVAVDG-DGRVWFAESGANRIV 255

                  ....*...
gi 568950580 1376 KVKSTTVV 1383
Cdd:COG4257   256 RFDPDTEL 263
NHL_like_2 cd14957
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ...
1545-1731 2.12e-03

Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271327 [Multi-domain]  Cd Length: 280  Bit Score: 42.64  E-value: 2.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1545 LAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGapsgcdckndancdcfSGDDGyakDAKLNTPSSLAVCADGELYV 1624
Cdd:cd14957    23 IAVDSAGNIYVADTGN---NRIQVFTSSGVYSYSIG----------------SGGTG---SGQFNSPYGIAVDSNGNIYV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1625 ADLGNIRIRfirknkpFLNTQNMYElsspidqelYLFDTSGkhlytQSLPTGDYLYNFTYTGDGDItHITDNNGNMVNVr 1704
Cdd:cd14957    81 ADTDNNRIQ-------VFNSSGVYQ---------YSIGTGG-----SGDGQFNGPYGIAVDSNGNI-YVADTGNHRIQV- 137
                         170       180
                  ....*....|....*....|....*..
gi 568950580 1705 RDSTGmplwlvvpdgqVYWVTMGTNSA 1731
Cdd:cd14957   138 FTSSG-----------TFSYSIGSGGT 153
NHL-2_like cd14951
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ...
1412-1514 3.61e-03

NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271321 [Multi-domain]  Cd Length: 334  Bit Score: 42.18  E-value: 3.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1412 GKATEATLTNPRGITVDKFGLIYFVDgTM---IRRVD-QNGIISTLLGSNDLTSArplscdsvmeISQVRLEWPTDLAIN 1487
Cdd:cd14951   188 GPGAEALLQHPLGVAALPDGSVYVAD-TYnhkIKRVDpATGEVSTLAGTGKAGYK----------DLEAQFSEPSGLVVD 256
                          90       100
                  ....*....|....*....|....*..
gi 568950580 1488 PmDNSLYVLDNNvvlqiseNHQVRIVA 1514
Cdd:cd14951   257 G-DGRLYVADTN-------NHRIRRLD 275
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
836-858 3.89e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.94  E-value: 3.89e-03
                           10        20
                   ....*....|....*....|....*
gi 568950580   836 CAEHGTCRD--GKCECSPGWNGEHC 858
Cdd:pfam07974    2 CSGRGTCVNqcGKCVCDSGYQGATC 26
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
707-730 3.92e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 3.92e-03
                          10        20
                  ....*....|....*....|....*...
gi 568950580  707 CSSHGTCIMG----TCICNPGYKGESCE 730
Cdd:cd00054    11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
NHL_PKND_like cd14952
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ...
1304-1505 5.21e-03

NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.


Pssm-ID: 271322 [Multi-domain]  Cd Length: 247  Bit Score: 41.42  E-value: 5.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1304 PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILemrnkDFRHSHSPahkYYLATDPmSGAVFLSDTNSRRVFKVkstt 1381
Cdd:cd14952    96 PTGVAVDAAGNVYVADTgnNRVLKLAAGSNTQTVL-----PFTGLSNP---DGVAVDG-AGNVYVTDTGNNRVLKL---- 162
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568950580 1382 vvkdlvknsevVAGTGDQC-LPFDDtrcgdggkateatLTNPRGITVDKFGLIYFVDGtmirrvDQNGIISTLLGSNDLT 1460
Cdd:cd14952   163 -----------AAGSTTQTvLPFTG-------------LNSPSGVAVDTAGNVYVTDH------GNNRVLKLAAGSTTPT 212
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 568950580 1461 sARPLScdsvmeisqvRLEWPTDLAINPmDNSLYVLD--NNVVLQIS 1505
Cdd:cd14952   213 -VLPFT----------GLNGPLGVAVDA-AGNVYVADrgNDRVVKLP 247
EGF_2 pfam07974
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
706-729 6.48e-03

EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.


Pssm-ID: 400365  Cd Length: 26  Bit Score: 36.17  E-value: 6.48e-03
                           10        20
                   ....*....|....*....|....*.
gi 568950580   706 ACSSHGTCIM--GTCICNPGYKGESC 729
Cdd:pfam07974    1 ICSGRGTCVNqcGKCVCDSGYQGATC 26
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
2343-2372 9.37e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 36.04  E-value: 9.37e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 568950580  2343 YNSAGLLIKAYNrASGWSVRYRYDGLGRRV 2372
Cdd:pfam05593    1 YDAAGRLTSVTD-PDGRVTTYTYDAAGRLT 29
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH