|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
10-304 |
9.35e-177 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 546.88 E-value: 9.35e-177
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLIHRESDEFPRQGTNFTLAELGICEP 89
Cdd:pfam06484 1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDENG 168
Cdd:pfam06484 81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 169 PP--------------------------------------------------------------------NHHSQSTLRP 180
Cdd:pfam06484 161 PPippsssssspveqhsppppslnenqrpllgnnashpildsdpdeefspnsylvrtgsgpqsapseqppNFQNHSRLRT 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 181 PLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKTSSGST 257
Cdd:pfam06484 241 PPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTGTGTT 320
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 2462603315 258 PLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 304
Cdd:pfam06484 321 PLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1170-1500 |
1.91e-48 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 176.95 E-value: 1.91e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1170 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYY----LAVDPvSGSLYVSDTNSRRIYRV 1243
Cdd:cd14953 25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1244 kslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1321
Cdd:cd14953 104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1322 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1399
Cdd:cd14953 170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1400 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNSPSSL 1479
Cdd:cd14953 237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
|
330 340
....*....|....*....|.
gi 2462603315 1480 AVAPDGTIYIADLGNIRIRAV 1500
Cdd:cd14953 303 AVDAAGNLYVADTGNNRIRKI 323
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2620-2697 |
4.10e-37 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 135.05 E-value: 4.10e-37
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462603315 2620 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2697
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1461-2397 |
1.04e-30 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 133.34 E-value: 1.04e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1461 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1537
Cdd:COG3209 105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1538 LVTGEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMTYDGNTG 1617
Cdd:COG3209 185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1618 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1697
Cdd:COG3209 265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1698 YQLCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1777
Cdd:COG3209 345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1778 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1857
Cdd:COG3209 425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1858 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1929
Cdd:COG3209 505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1930 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRF 2009
Cdd:COG3209 585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2010 SEEGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 2089
Cdd:COG3209 665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2090 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2163
Cdd:COG3209 742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2164 LLNPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2243
Cdd:COG3209 822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2244 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTA 2323
Cdd:COG3209 891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462603315 2324 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2397
Cdd:COG3209 951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1170-1500 |
1.39e-12 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 70.43 E-value: 1.39e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1170 PVALAVGIDGSLYVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNNPAHKYY-LAVDPvSGSLYVSDTNSRRIYRVks 1245
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1246 lsGTKDlaGNSEVVAGTGEQCLPFdearcgdggkaidatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1322
Cdd:COG4257 86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1323 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1399
Cdd:COG4257 139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1400 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncncysgdDAYATDAILNSPSSL 1479
Cdd:COG4257 184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
|
330 340
....*....|....*....|.
gi 2462603315 1480 AVAPDGTIYIADLGNIRIRAV 1500
Cdd:COG4257 237 AVDGDGRVWFAESGANRIVRF 257
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2321-2397 |
2.54e-09 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 55.97 E-value: 2.54e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2321 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwknvgkePA----PFNLYMFKSNNPL 2396
Cdd:TIGR03696 1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69
|
.
gi 2462603315 2397 S 2397
Cdd:TIGR03696 70 N 70
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
773-803 |
3.55e-08 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 51.36 E-value: 3.55e-08
10 20 30
....*....|....*....|....*....|.
gi 2462603315 773 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 803
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
505-664 |
1.25e-07 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 55.78 E-value: 1.25e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 505 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 554
Cdd:pfam19232 11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 555 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 603
Cdd:pfam19232 90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 604 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 655
Cdd:pfam19232 167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
|
250 260
....*....|....*....|....
gi 2462603315 656 ------------DPN---WMGPDC 664
Cdd:pfam19232 242 nidfsghnscgdDNNctsWTGPRC 265
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
574-720 |
2.71e-07 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 52.07 E-value: 2.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 574 SCGGHGS-CIDGNCVCsagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 643
Cdd:NF041328 13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 644 gtylpDTglcSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 717
Cdd:NF041328 75 -----DT---ASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139
|
...
gi 2462603315 718 REG 720
Cdd:NF041328 140 RGG 142
|
|
| PLN02919 |
PLN02919 |
haloacid dehalogenase-like hydrolase family protein |
1221-1498 |
1.62e-06 |
|
haloacid dehalogenase-like hydrolase family protein
Pssm-ID: 215497 [Multi-domain] Cd Length: 1057 Bit Score: 54.09 E-value: 1.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1221 LAVDPVSGSLYVSDTNSRRIYrvkslsgTKDLAGNSEV-VAGTGEQCL---PFDearcgdggkaiDATLMSPRGIAVD-K 1295
Cdd:PLN02919 573 LAIDLLNNRLFISDSNHNRIV-------VTDLDGNFIVqIGSTGEEGLrdgSFE-----------DATFNRPQGLAYNaK 634
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1296 NGLMYFVDAT--MIRKVD-QNGIISTLLGS----NDLTAVRPLScdssmdvAQVrLEWPTDLAVNPMDNSLYVlennvil 1368
Cdd:PLN02919 635 KNLLYVADTEnhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDVCFEPVNEKVYI------- 699
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1369 RITENHQV---SIIAGRPMHCQVPGIDYSLS-KLAIHSALESASAIAIS-HTGVLYITETDEKKINRLrQVTTNGEIcLL 1443
Cdd:PLN02919 700 AMAGQHQIweyNISDGVTRVFSGDGYERNLNgSSGTSTSFAQPSGISLSpDLKELYIADSESSSIRAL-DLKTGGSR-LL 777
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 2462603315 1444 AGAasdcDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIR 1498
Cdd:PLN02919 778 AGG----DPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIK 828
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1612-1648 |
7.34e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 41.82 E-value: 7.34e-05
10 20 30
....*....|....*....|....*....|....*..
gi 2462603315 1612 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1648
Cdd:pfam05593 1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
669-750 |
4.25e-04 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 42.82 E-value: 4.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 669 CSVDCGTHGVCIGGACRCEEGWT--GAAC-----DQR---VCHPRCIEHGTCKDGkcECREGwngehCTIGRQTAGtetD 738
Cdd:NF041328 45 CGVACGAGQTCVAGACGCGPGTVacGGACvdtasDPAhcgACGAACAPGQVCEGG--ACREA-----CSEGLTRCG---G 114
|
90
....*....|..
gi 2462603315 739 GCPDLCNGNGRC 750
Cdd:NF041328 115 ACVDLATDPLHC 126
|
|
| DSL |
pfam01414 |
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ... |
686-728 |
1.79e-03 |
|
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.
Pssm-ID: 460202 Cd Length: 46 Bit Score: 38.37 E-value: 1.79e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 2462603315 686 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 728
Cdd:pfam01414 1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
601-630 |
2.52e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.62 E-value: 2.52e-03
10 20 30
....*....|....*....|....*....|....*
gi 2462603315 601 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 630
Cdd:cd00054 4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
513-685 |
7.46e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 39.36 E-value: 7.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 513 NGECVSgvchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGScidgnc 586
Cdd:NF041328 29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGA------ 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 587 vcsagykgehceevdcldpTCSSHGVCVNGECL--CSPGwgglncelaRVQCPDQCSGHGTylpDTGLCScdpnwmgpdc 664
Cdd:NF041328 89 -------------------ACAPGQVCEGGACReaCSEG---------LTRCGGACVDLAT---DPLHCG---------- 127
|
170 180
....*....|....*....|.
gi 2462603315 665 sveVCSVDCGTHGVCIGGACR 685
Cdd:NF041328 128 ---ACGVACDPGESCRGGACT 145
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
10-304 |
9.35e-177 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 546.88 E-value: 9.35e-177
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 10 SLTRGRCGKECRYTSSSLDSEDCRVPTQKSYSSSETLKAYDHDSRMHYGNRVTDLIHRESDEFPRQGTNFTLAELGICEP 89
Cdd:pfam06484 1 SLTKRRRDKERRYTSSSADSEECRVPTQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 90 SP-HRSGYCSDMGILHQGYSLSTGSDADSDTEGGMSPEHAIRLWGRGIKSRRSSGLSSRENSALTLTDSDNENKSDDENG 168
Cdd:pfam06484 81 SPrHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 169 PP--------------------------------------------------------------------NHHSQSTLRP 180
Cdd:pfam06484 161 PPippsssssspveqhsppppslnenqrpllgnnashpildsdpdeefspnsylvrtgsgpqsapseqppNFQNHSRLRT 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 181 PLPP-PHNHT-LSHHHSSANSLNRNSLTNRRSQIHAP-APAPNDLATTPESVQLQDSWVLNSNVPLETRHFLFKTSSGST 257
Cdd:pfam06484 241 PPPPlPPPHKqNQHHHPSINSLNRSSLTNRRNPSPAPtASLPAELQSTQESVQLQDSWVLNSNVPLETRHFLFKTGTGTT 320
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 2462603315 258 PLFSSSSPGYPLTSGTVYTPPPRLLPRNTFSRKAFKLKKPSKYCSWK 304
Cdd:pfam06484 321 PLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1170-1500 |
1.91e-48 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 176.95 E-value: 1.91e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1170 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHKYY----LAVDPvSGSLYVSDTNSRRIYRV 1243
Cdd:cd14953 25 PSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGNLYVADTGNHRIRKI 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1244 kslsgtkDLAGNSEVVAGTGEqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLLG 1321
Cdd:cd14953 104 -------TPDGVVSTLAGTGT-------AGFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRIRKITPDGVVTTVAG 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1322 sndlTAVRPLSCDSSMDVAQVRleWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGRPmhcqvpGIDYSLSKLA 1399
Cdd:cd14953 170 ----TGGAGYAGDGPATAAQFN--NPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGTG------TAGFSGDGGA 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1400 IHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndvncncYSGDDAYATDAILNSPSSL 1479
Cdd:cd14953 237 TAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGDGGPATSAQFNNPTGV 302
|
330 340
....*....|....*....|.
gi 2462603315 1480 AVAPDGTIYIADLGNIRIRAV 1500
Cdd:cd14953 303 AVDAAGNLYVADTGNNRIRKI 323
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1221-1501 |
1.31e-40 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 154.23 E-value: 1.31e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1221 LAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGEqclpfdEARCGDGGKAidATLMSPRGIAVDKNGLMY 1300
Cdd:cd14953 28 VAVDA-AGNLYVADRGNHRIRKI-------TPDGVVTTVAGTGT------AGFADGGGAA--AQFNTPSGVAVDAAGNLY 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1301 FVDAT--MIRKVDQNGIISTLLGsndlTAVRPLSCDSSMDVAQvrLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQV 1376
Cdd:cd14953 92 VADTGnhRIRKITPDGVVSTLAG----TGTAGFSDDGGATAAQ--FNYPTGVAVDAAGN-LYVADtgNHRIRKITPDGVV 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1377 SIIAGRPmhcqVPGidYSLSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASDcdckndv 1456
Cdd:cd14953 165 TTVAGTG----GAG--YAGDGPATAAQFNNPTGVAVDAAGNLYVADRGN---HRIRKITPDGVVTTVAGTGTA------- 228
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 2462603315 1457 ncncYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVS 1501
Cdd:cd14953 229 ----GFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGNHRIRKIT 269
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2620-2697 |
4.10e-37 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 135.05 E-value: 4.10e-37
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462603315 2620 EEKARVLDQARQRALGTAWAKEQQKARDGREGSRLWTEGEKQQLLSTGRVQGYEGYYVLPVEQYPELADSSSNIQFLR 2697
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1258-1501 |
4.47e-32 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 129.19 E-value: 4.47e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1258 VVAGTGeqclpfdeARCGDGGKAIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL-----GSNDLTAvrp 1330
Cdd:cd14953 3 TVAGSG--------TAGFSGGGGTAARFNSPSGVAVDAAGNLYVADRGnhRIRKITPDGVVTTVAgtgtaGFADGGG--- 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1331 lscdssmdvAQVRLEWPTDLAVNPMDNsLYV--LENNVILRITENHQVSIIAGRPmhcqVPGidYSLSKLAIHSALESAS 1408
Cdd:cd14953 72 ---------AAAQFNTPSGVAVDAAGN-LYVadTGNHRIRKITPDGVVSTLAGTG----TAG--FSDDGGATAAQFNYPT 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1409 AIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASdcdckndvncNCYSGDDAyATDAILNSPSSLAVAPDGTIY 1488
Cdd:cd14953 136 GVAVDAAGNLYVADTGN---HRIRKITPDGVVTTVAGTGG----------AGYAGDGP-ATAAQFNNPTGVAVDAAGNLY 201
|
250
....*....|...
gi 2462603315 1489 IADLGNIRIRAVS 1501
Cdd:cd14953 202 VADRGNHRIRKIT 214
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1461-2397 |
1.04e-30 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 133.34 E-value: 1.04e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1461 YSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASP---GEQELYVFNADGIHQYTVS 1537
Cdd:COG3209 105 LTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAgggASAYGLTLGGAAAGPATGV 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1538 LVTGEYLYNFTYSTDNDVTELIDNNGNSLKIRRDSSGMPRHLLMPDNQIITLTVGTNGGLKVVSTQNLELGLMTYDGNTG 1617
Cdd:COG3209 185 GTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTG 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1618 LLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLHREMEKSITIDIENSNRDDDVTVITNLSSVEASYTVVQDQVRNS 1697
Cdd:COG3209 265 AGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGG 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1698 YQLCNNGTLRVMYANGMGISFHSEPHVLAGTITPTIGRCNISLPMENGLNSIEWRLRKEQIKGKVTIFGRKLRVHGRNLL 1777
Cdd:COG3209 345 TTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAG 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1778 SIDYDRNIRTEKIYDDHRKFTLRIIYDQVGRPFLWLPSSGLAAVNVSYFFNGRLAGLQRGAMSERTDIDKQGRIVSRMFA 1857
Cdd:COG3209 425 ALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDD 504
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1858 DGKVWSYSYLDKSMVLLLQSQRQYIF--------EYDSSDRLLAVTMPSVARHSMSTHTSIGYIRNIYNPPESNASVIFD 1929
Cdd:COG3209 505 TLGGTTTTTAGARGLVVTTGTTLTLGttttatlsATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGT 584
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1930 YSDDGRILKTSFLGTGRQVFYKYGKLSKLSEIVYDSTAVTFGYDETTGVLKMVNLQSGGFSCTIRYRKIGPLVDKQIYRF 2009
Cdd:COG3209 585 TGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTG 664
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2010 SEEGMVNARFDYTYHDNSFRIASikpVISETPLPVDLYRYDEISGKVEHFGKFGVIYYDINQIITTAVMTLSKHFDTHGR 2089
Cdd:COG3209 665 TGTGVTAGLTTLATGGTTVGGGT---GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2090 IKEVQYEMF-RSLMYWMTVQYDSMGRVIKRELKLGPYANTTKYTYDYDGDGQLQSVAVNDRPTWRYSYDLNGNLH----- 2163
Cdd:COG3209 742 GTLTTTSTTtTTTAGALTYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTsvitv 821
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2164 LLNPGNSVRLMPLRYDLRDRITRLGDVQykidDDGYLCQRgsdiFEYNSKGLLTRAynKASGWSVQYRYDGVGRRASyKT 2243
Cdd:COG3209 822 GSGGGTDLQDRTYTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTS-RT 890
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2244 NLGHHlQYFYSDLHNPTRITHvynhSNSEITSLYYDLQGHlfamesssgeeyyvaSDNTGTPLAVFSINGLMIKQLQYTA 2323
Cdd:COG3209 891 DGGTT-TYTYDALGRLVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYDP 950
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462603315 2324 YGEIYYDSNPDFQMVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwkNVGKEPAPfNLYMFKSNNPLS 2397
Cdd:COG3209 951 FGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-----PIGLAGGL-NLYAYVGNNPVN 1018
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1140-1310 |
2.62e-18 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 88.74 E-value: 2.62e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1140 IITSIMGNGRRRSiscpSCNGLAEGNKLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHS----- 1212
Cdd:cd14953 163 VVTTVAGTGGAGY----AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFSGDggata 238
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1213 ---NNPahkYYLAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGeQCLPfdearcGDGGKAIDATLMSPR 1289
Cdd:cd14953 239 aqlNNP---TGVAVDA-AGNLYVADSGNHRIRKI-------TPAGVVTTVAGGG-AGFS------GDGGPATSAQFNNPT 300
|
170 180
....*....|....*....|...
gi 2462603315 1290 GIAVDKNGLMYFVDAT--MIRKV 1310
Cdd:cd14953 301 GVAVDAAGNLYVADTGnnRIRKI 323
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1221-1519 |
6.09e-18 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 86.60 E-value: 6.09e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1221 LAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAGNSEVVAGTGeqclpfdearcGDGgkaiDATLMSPRGIAVDKNGLMY 1300
Cdd:cd05819 13 IAVDS-SGNIYVADTGNNRIQVF-------DPDGNFITSFGSF-----------GSG----DGQFNEPAGVAVDSDGNLY 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1301 FVDAT--MIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL--ENNVILRITENHQV 1376
Cdd:cd05819 70 VADTGnhRIQKFDPDGNFLASFGGSGDG--------------DGEFNGPRGIAVDSSGN-IYVAdtGNHRIQKFDPDGEF 134
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1377 SIIAGrpmhcqvpgidyslSKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndv 1456
Cdd:cd05819 135 LTTFG--------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGN---HRIQVFDPDGNFLTTFG----------- 186
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462603315 1457 ncncysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQYEAASPG 1519
Cdd:cd05819 187 --------STGTGPGQFNYPTGIAVDSDGNIYVADSGNNRVQVFDPDGAGFGGNGNFLGSDGQ 241
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1165-1498 |
2.13e-17 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 84.68 E-value: 2.13e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1165 NKLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILELRNKEFKHSNNPAHkyyLAVDPvSGSLYVSDTNSRRIYR 1242
Cdd:cd05819 5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEPAG---VAVDS-DGNLYVADTGNHRIQK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1243 VkslsgtkDLAGNSEVVAGTGeqclpfdearcGDGgkaiDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIISTLL 1320
Cdd:cd05819 81 F-------DPDGNFLASFGGS-----------GDG----DGEFNGPRGIAVDSSGNIYVADTGnhRIQKFDPDGEFLTTF 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1321 GSNdltavrplscdsSMDVAQvrLEWPTDLAVNPmDNSLYVLE--NNVILRITENHQVSIIAGRPmhCQVPGidyslskl 1398
Cdd:cd05819 139 GSG------------GSGPGQ--FNGPTGVAVDS-DGNIYVADtgNHRIQVFDPDGNFLTTFGST--GTGPG-------- 193
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1399 aihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGaasdcdckndvncncysgdDAYATDAILNSPSS 1478
Cdd:cd05819 194 ----QFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNG-------------------NFLGSDGQFNRPSG 247
|
330 340
....*....|....*....|
gi 2462603315 1479 LAVAPDGTIYIADLGNIRIR 1498
Cdd:cd05819 248 LAVDSDGNLYVADTGNNRIQ 267
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1159-1370 |
4.91e-16 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 80.83 E-value: 4.91e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1159 NGLAEGNkLLAPVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSIL---ELRNKEFkhsNNPahkYYLAVDPvSGSLYVS 1233
Cdd:cd05819 94 SGDGDGE-FNGPRGIAVDSSGNIYVADTgnHRIQKFDPDGEFLTTFgsgGSGPGQF---NGP---TGVAVDS-DGNIYVA 165
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1234 DTNSRRIYRVKSlsgtkdlagNSEVVAGTGEQCLPfdearcgdggkaiDATLMSPRGIAVDKNGLMYFVDATM--IRKVD 1311
Cdd:cd05819 166 DTGNHRIQVFDP---------DGNFLTTFGSTGTG-------------PGQFNYPTGIAVDSDGNIYVADSGNnrVQVFD 223
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462603315 1312 QNGIISTLLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPmDNSLYVLE--NNVILRI 1370
Cdd:cd05819 224 PDGAGFGGNGNF--------------LGSDGQFNRPSGLAVDS-DGNLYVADtgNNRIQVF 269
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1166-1431 |
6.95e-15 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 77.36 E-value: 6.95e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1166 KLLAPVALAVGIDGSLYVGDFNYIR-RIFPS----RNVTSILELRNKEFkhsNNPahkYYLAVDPvSGSLYVSDTNSRRI 1240
Cdd:cd05819 53 QFNEPAGVAVDSDGNLYVADTGNHRiQKFDPdgnfLASFGGSGDGDGEF---NGP---RGIAVDS-SGNIYVADTGNHRI 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1241 YRVkslsgtkDLAGNSEVVAGTGEQClpfdearcgdggkaiDATLMSPRGIAVDKNGLMYFVDAT--MIRKVDQNGIIST 1318
Cdd:cd05819 126 QKF-------DPDGEFLTTFGSGGSG---------------PGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGNFLT 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1319 LLGSNdltavrplscdssmDVAQVRLEWPTDLAVNPMDNsLYVLE--NNVILRITENHQVSIIAGrpmhcqvpgidyslS 1396
Cdd:cd05819 184 TFGST--------------GTGPGQFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG--------------N 234
|
250 260 270
....*....|....*....|....*....|....*
gi 2462603315 1397 KLAIHSALESASAIAISHTGVLYITETDEKKINRL 1431
Cdd:cd05819 235 FLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1282-1513 |
9.11e-15 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 76.97 E-value: 9.11e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1282 DATLMSPRGIAVDKNGLMYFVDATM--IRKVDQNGIISTLLGSNDltavrplscdssmdVAQVRLEWPTDLAVNPmDNSL 1359
Cdd:cd05819 4 PGELNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNL 68
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1360 YVL--ENNVILRITENHQVSIIAGRPmhcqvpGIDYSlsklaihsALESASAIAISHTGVLYITETDEkkiNRLRQVTTN 1437
Cdd:cd05819 69 YVAdtGNHRIQKFDPDGNFLASFGGS------GDGDG--------EFNGPRGIAVDSSGNIYVADTGN---HRIQKFDPD 131
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462603315 1438 GEICLLAGAASDCDCKndvncncysgddayatdaiLNSPSSLAVAPDGTIYIADLGNIRIRAVSKNKPVLNAFNQY 1513
Cdd:cd05819 132 GEFLTTFGSGGSGPGQ-------------------FNGPTGVAVDSDGNIYVADTGNHRIQVFDPDGNFLTTFGST 188
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1170-1500 |
1.39e-12 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 70.43 E-value: 1.39e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1170 PVALAVGIDGSLYVGDF--NYIRRIFPsrnvtsilelRNKEFK-HSNNPAHKYY-LAVDPvSGSLYVSDTNSRRIYRVks 1245
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDP----------ATGEFTeYPLGGGSGPHgIAVDP-DGNLWFTDNGNNRIGRI-- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1246 lsGTKDlaGNSEVVAGTGEQCLPFdearcgdggkaidatlmsprGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTLlgs 1322
Cdd:COG4257 86 --DPKT--GEITTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEF--- 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1323 ndltavrPLSCDSSMdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRI-TENHQVSIIAGrpmhcqvpgidyslskla 1399
Cdd:COG4257 139 -------PLPTGGAG---------PYGIAVDP-DGNLWVtdFGANAIGRIdPDTGTLTEYAL------------------ 183
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1400 iHSALESASAIAISHTGVLYITETDEKKINRLRqvTTNGEIcllagaasdcdckndvncncysgdDAYATDAILNSPSSL 1479
Cdd:COG4257 184 -PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTV------------------------TEYPLPGGGARPYGV 236
|
330 340
....*....|....*....|.
gi 2462603315 1480 AVAPDGTIYIADLGNIRIRAV 1500
Cdd:COG4257 237 AVDGDGRVWFAESGANRIVRF 257
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1165-1440 |
7.10e-10 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 62.34 E-value: 7.10e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1165 NKLLAPVALAVGIDGSLYVGD--FNYIRRIFPSRNVTSILELRNKEfkhsNNPahkYYLAVDPvSGSLYVSDTNSRRIYR 1242
Cdd:COG4257 56 GGGSGPHGIAVDPDGNLWFTDngNNRIGRIDPKTGEITTFALPGGG----SNP---HGIAFDP-DGNLWFTDQGGNRIGR 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1243 VkslsgtkDLAGNsEVVAGTgeqcLPFDEARcgdggkaidatlmsPRGIAVDKNGLMYFVD--ATMIRKVD-QNGIISTL 1319
Cdd:COG4257 128 L-------DPATG-EVTEFP----LPTGGAG--------------PYGIAVDPDGNLWVTDfgANAIGRIDpDTGTLTEY 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1320 LGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYVLE--NNVILRITENhqvsiiagrpmhcqvpgiDYSLSK 1397
Cdd:COG4257 182 ALPTPGAG-------------------PRGLAVDP-DGNLWVADtgSGRIGRFDPK------------------TGTVTE 223
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 2462603315 1398 LAIHSALESASAIAISHTGVLYITETDekkINRLRQVTTNGEI 1440
Cdd:COG4257 224 YPLPGGGARPYGVAVDGDGRVWFAESG---ANRIVRFDPDTEL 263
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2321-2397 |
2.54e-09 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 55.97 E-value: 2.54e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 2321 YTAYGEIYYDSNPDFQmVIGFHGGLYDPLTKLVHFTQRDYDVLAGRWTSPDytmwknvgkePA----PFNLYMFKSNNPL 2396
Cdd:TIGR03696 1 YDPYGEVLSESGAAPN-PLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD----------PIglggGLNLYAYVGNNPV 69
|
.
gi 2462603315 2397 S 2397
Cdd:TIGR03696 70 N 70
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1219-1530 |
2.67e-09 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 60.80 E-value: 2.67e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1219 YYLAVDPvSGSLYVSDTNSRRIYRVkslsgtkDLAgnsevvagTGEqclpFDEARCGDGGkaidatlmSPRGIAVDKNGL 1298
Cdd:COG4257 20 RDVAVDP-DGAVWFTDQGGGRIGRL-------DPA--------TGE----FTEYPLGGGS--------GPHGIAVDPDGN 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1299 MYFVD--ATMIRKVD-QNGIISTLLGSNDLTAvrplscdssmdvaqvrlewPTDLAVNPmDNSLYV--LENNVILRIT-E 1372
Cdd:COG4257 72 LWFTDngNNRIGRIDpKTGEITTFALPGGGSN-------------------PHGIAFDP-DGNLWFtdQGGNRIGRLDpA 131
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1373 NHQVSIIAGRPMHCQvpgidyslsklaihsalesASAIAISHTGVLYITETdekKINRLRQVTT-NGEIcllagaasdcd 1451
Cdd:COG4257 132 TGEVTEFPLPTGGAG-------------------PYGIAVDPDGNLWVTDF---GANAIGRIDPdTGTL----------- 178
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1452 ckndvncncysgdDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSknkPVLNAFNQYeAASPGEQELY--VFNAD 1529
Cdd:COG4257 179 -------------TEYALPTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD---PKTGTVTEY-PLPGGGARPYgvAVDGD 241
|
.
gi 2462603315 1530 G 1530
Cdd:COG4257 242 G 242
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
773-803 |
3.55e-08 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 51.36 E-value: 3.55e-08
10 20 30
....*....|....*....|....*....|.
gi 2462603315 773 AMETSCADNKDNEGDGLVDCLDPDCCLQSAC 803
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1221-1497 |
4.34e-08 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 56.83 E-value: 4.34e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1221 LAVDPvSGSLYVSDTNSRRIYRvkslsgtkdLAgnsevvAGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKNGLM 1299
Cdd:cd14952 15 VAVDA-AGNVYVADSGNNRVLK---------LA------AGSTTQTvLPFTG-------------LYQPQGVAVDAAGTV 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1300 YFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPMDNsLYVLE--NNVILRITenhqvs 1377
Cdd:cd14952 66 YVTDF------GNNRVLKLAAGSTTQTVL-PFT----------GLNDPTGVAVDAAGN-VYVADtgNNRVLKLA------ 121
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1378 iiAGRPMHCQVPGIDyslsklaihsaLESASAIAISHTGVLYITETDEKKINRLRQVTTNGEICLLAGAASDCDCKNDVN 1457
Cdd:cd14952 122 --AGSNTQTVLPFTG-----------LSNPDGVAVDGAGNVYVTDTGNNRVLKLAAGSTTQTVLPFTGLNSPSGVAVDTA 188
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 2462603315 1458 CNCYSGD---------DAYATDAI------LNSPSSLAVAPDGTIYIADLGNIRI 1497
Cdd:cd14952 189 GNVYVTDhgnnrvlklAAGSTTPTvlpftgLNGPLGVAVDAAGNVYVADRGNDRV 243
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
505-664 |
1.25e-07 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 55.78 E-value: 1.25e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 505 DCPRNCHGNGECVSGVCH--------------CFPGFLGADCAKAAC--PVLCsGNGQ----------YSKGTCQ----C 554
Cdd:pfam19232 11 DCTPPCGGTQVCIDRQCKdntlacttdaqcgtCMTCVAGACTPKASCcgGVTC-GAGQtcdaktntcvYVKGYCSadhpC 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 555 YSGwkgAECDVPMNQCI-DPSCG-GHGS-CIDG-----------------NCVCSAG--YKGEH-CEEV--------DCL 603
Cdd:pfam19232 90 PSG---SACDTAKNACIaQPPYGpDSGKgCVRGfgawiweldpatnsgvwRCRCANGslYNSAHeCSPLadqtlcaaENL 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 604 DPTC---------------SSHGVCVN-------------GECLCSPGWGGLNCELARvqcpdQCSGHGTYLPDTGLCSC 655
Cdd:pfam19232 167 DPNAlvpassvpafaaygwGNQPVLINkstagaavpsplaGVCPCKPGWAGGSCTEDR-----TCNGRGTWNETTGQCAC 241
|
250 260
....*....|....*....|....
gi 2462603315 656 ------------DPN---WMGPDC 664
Cdd:pfam19232 242 nidfsghnscgdDNNctsWTGPRC 265
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
574-720 |
2.71e-07 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 52.07 E-value: 2.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 574 SCGGHGS-CIDGNCVCsagykGEHCeeVDC-LDP--------TCSSHGVCVNGECLCSPGwgglncelaRVQCPDQCSgh 643
Cdd:NF041328 13 GCPEPGAvCPEGLSVC-----GGAC--VDLrSDPsncgacgvACGAGQTCVAGACGCGPG---------TVACGGACV-- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 644 gtylpDTglcSCDPNWMGpdcsveVCSVDCGTHGVCIGGACR--CEEGWT--GAAC-DQRVCHPRCIEHGT-CKDGKcEC 717
Cdd:NF041328 75 -----DT---ASDPAHCG------ACGAACAPGQVCEGGACReaCSEGLTrcGGACvDLATDPLHCGACGVaCDPGE-SC 139
|
...
gi 2462603315 718 REG 720
Cdd:NF041328 140 RGG 142
|
|
| PLN02919 |
PLN02919 |
haloacid dehalogenase-like hydrolase family protein |
1221-1498 |
1.62e-06 |
|
haloacid dehalogenase-like hydrolase family protein
Pssm-ID: 215497 [Multi-domain] Cd Length: 1057 Bit Score: 54.09 E-value: 1.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1221 LAVDPVSGSLYVSDTNSRRIYrvkslsgTKDLAGNSEV-VAGTGEQCL---PFDearcgdggkaiDATLMSPRGIAVD-K 1295
Cdd:PLN02919 573 LAIDLLNNRLFISDSNHNRIV-------VTDLDGNFIVqIGSTGEEGLrdgSFE-----------DATFNRPQGLAYNaK 634
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1296 NGLMYFVDAT--MIRKVD-QNGIISTLLGS----NDLTAVRPLScdssmdvAQVrLEWPTDLAVNPMDNSLYVlennvil 1368
Cdd:PLN02919 635 KNLLYVADTEnhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDVCFEPVNEKVYI------- 699
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1369 RITENHQV---SIIAGRPMHCQVPGIDYSLS-KLAIHSALESASAIAIS-HTGVLYITETDEKKINRLrQVTTNGEIcLL 1443
Cdd:PLN02919 700 AMAGQHQIweyNISDGVTRVFSGDGYERNLNgSSGTSTSFAQPSGISLSpDLKELYIADSESSSIRAL-DLKTGGSR-LL 777
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 2462603315 1444 AGAasdcDCKNDVNCNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIR 1498
Cdd:PLN02919 778 AGG----DPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIK 828
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1170-1303 |
2.27e-06 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 51.88 E-value: 2.27e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1170 PVALAVGIDGSLYVGD-FNYIRRIFPSRNVTSIlelrnkEFKH-SNNPAHKYYL---AVDPvSGSLYVSDTNSRRIyRVK 1244
Cdd:cd14957 114 PYGIAVDSNGNIYVADtGNHRIQVFTSSGTFSY------SIGSgGTGPGQFNGPqgiAVDS-DGNIYVADTGNHRI-QVF 185
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 2462603315 1245 SLSGTKDLAgnsevVAGTGEqclpfdearcGDGGkaidatLMSPRGIAVDKNGLMYFVD 1303
Cdd:cd14957 186 TSSGTFQYT-----FGSSGS----------GPGQ------FSDPYGIAVDSDGNIYVAD 223
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1170-1498 |
2.33e-06 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 51.88 E-value: 2.33e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1170 PVALAVGIDGSLYVGDFNYIR-RIF-PSRNVTSIL---ELRNKEFkhsNNPahkYYLAVDPvSGSLYVSDTNSRRIyRVK 1244
Cdd:cd14957 20 PRGIAVDSAGNIYVADTGNNRiQVFtSSGVYSYSIgsgGTGSGQF---NSP---YGIAVDS-NGNIYVADTDNNRI-QVF 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1245 SLSGTKDLAgnsevVAGTGEQCLPFDEarcgdggkaidatlmsPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGS 1322
Cdd:cd14957 92 NSSGVYQYS-----IGTGGSGDGQFNG----------------PYGIAVDSNGNIYVADTgnHRIQVFTSSGTFSYSIGS 150
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1323 ndltavrplscdSSMDVAQVRLewPTDLAVNPMDNsLYVLENNvilriteNHQVSII--AGRPmhcqvpgiDYSL-SKLA 1399
Cdd:cd14957 151 ------------GGTGPGQFNG--PQGIAVDSDGN-IYVADTG-------NHRIQVFtsSGTF--------QYTFgSSGS 200
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1400 IHSALESASAIAISHTGVLYITETDEKKInrlrQVTTNgeicllagaasdcdckndvncncySGDDAYA------TDAIL 1473
Cdd:cd14957 201 GPGQFSDPYGIAVDSDGNIYVADTGNHRI----QVFTS------------------------SGAYQYSigtsgsGNGQF 252
|
330 340
....*....|....*....|....*
gi 2462603315 1474 NSPSSLAVAPDGTIYIADLGNIRIR 1498
Cdd:cd14957 253 NYPYGIAVDNDGKIYVADSNNNRIQ 277
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1285-1564 |
3.31e-06 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 51.50 E-value: 3.31e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1285 LMSPRGIAVDKNGLMYFVDA--TMIRKVDQNGIISTLLGSNDLTavrplscdssmdvaQVRLEWPTDLAVNPMDNsLYVL 1362
Cdd:cd14957 17 FNTPRGIAVDSAGNIYVADTgnNRIQVFTSSGVYSYSIGSGGTG--------------SGQFNSPYGIAVDSNGN-IYVA 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1363 EnnvilriTENHQVSII--AGrpmhcqvpGIDYSL-SKLAIHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGE 1439
Cdd:cd14957 82 D-------TDNNRIQVFnsSG--------VYQYSIgTGGSGDGQFNGPYGIAVDSNGNIYVADTGN---HRIQVFTSSGT 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1440 icllagaasdcdckndvncNCYSGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRavsknkpvlnafnqyeaaspg 1519
Cdd:cd14957 144 -------------------FSYSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQ--------------------- 183
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 2462603315 1520 eqelyVFNADGIHQYTV-SLVTGEYLYNFTYSTDndvtelIDNNGN 1564
Cdd:cd14957 184 -----VFTSSGTFQYTFgSSGSGPGQFSDPYGIA------VDSDGN 218
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1167-1361 |
4.38e-06 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 50.67 E-value: 4.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1167 LLAPVALAVGIDGSLYVGDFNYIR--RIFPSRNVTSILElrnkeFKHSNNPAHkyyLAVDPvSGSLYVSDTNSRRIYRVK 1244
Cdd:cd14952 51 LYQPQGVAVDAAGTVYVTDFGNNRvlKLAAGSTTQTVLP-----FTGLNDPTG---VAVDA-AGNVYVADTGNNRVLKLA 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1245 S------------LSGTKDLA------------GNSEVV---AGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKN 1296
Cdd:cd14952 122 AgsntqtvlpftgLSNPDGVAvdgagnvyvtdtGNNRVLklaAGSTTQTvLPFTG-------------LNSPSGVAVDTA 188
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462603315 1297 GLMYFVDAtmirkvDQNGIISTLLGSNDLTAVrPLScdssmdvaqvRLEWPTDLAVNPmDNSLYV 1361
Cdd:cd14952 189 GNVYVTDH------GNNRVLKLAAGSTTPTVL-PFT----------GLNGPLGVAVDA-AGNVYV 235
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1280-1500 |
1.22e-05 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 49.63 E-value: 1.22e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1280 AIDATLMSPRGIAVDKNGLMYFVDAT--MIRKVD-QNGIISTllgsndltavrplscdssmdVAQVRLEWPTDLAVNPmD 1356
Cdd:COG4257 11 PVPAPGSGPRDVAVDPDGAVWFTDQGggRIGRLDpATGEFTE--------------------YPLGGGSGPHGIAVDP-D 69
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1357 NSLYVLE--NNVILRIT-ENHQVSIIAGrpmhcqvPGIDYSLSKLAIHSAlesasaiaishtGVLYITETDEKKINRLRq 1433
Cdd:COG4257 70 GNLWFTDngNNRIGRIDpKTGEITTFAL-------PGGGSNPHGIAFDPD------------GNLWFTDQGGNRIGRLD- 129
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462603315 1434 vTTNGEIcllagaasdcdckndvncncySGDDAYATDAilnSPSSLAVAPDGTIYIADLGNIRIRAV 1500
Cdd:COG4257 130 -PATGEV---------------------TEFPLPTGGA---GPYGIAVDPDGNLWVTDFGANAIGRI 171
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1462-1503 |
1.97e-05 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 49.45 E-value: 1.97e-05
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2462603315 1462 SGDDAYATDAILNSPSSLAVAPDGTIYIADLGNIRIRAVSKN 1503
Cdd:cd14953 11 GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKITPD 52
|
|
| NHL_like_4 |
cd14955 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1222-1497 |
4.27e-05 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271325 [Multi-domain] Cd Length: 279 Bit Score: 47.96 E-value: 4.27e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1222 AVDPvSGSLYVSDTNSRRIYRVKSlSGTkdlagnseVVAGTGeqclpfdeaRCGDGgkaiDATLMSPRGIAVDKNGLMYF 1301
Cdd:cd14955 69 AVDS-DGNVYVADTGNHRIQKFDS-TGT--------FLTKWG---------SSGSG----DGQFNSPSGIAVDSAGNVYV 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1302 VDAT--MIRKVDQNGIISTLLGSNDltavrplSCDSSMDvaqvrleWPTDLAVnpmDNS--LYVLEnnvilriTENHQV- 1376
Cdd:cd14955 126 TDSGnnRIQKFDSSGTFITKWGSFG-------SGDGQFN-------SPTGIAV---DSAgnVYVAD-------TGNNRIq 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1377 ------SIIAGRpmhcQVPGIDyslsklaiHSALESASAIAISHTGVLYITETDEkkiNRLRQVTTNGEICLLAGAASdc 1450
Cdd:cd14955 182 kftstgTFLTKW----GSEGSG--------DGQFNAPYGIAVDSAGNVYVADTGN---NRIQKFDSSGTFITKWGSEG-- 244
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 2462603315 1451 dckndvncncySGDDAYatdailNSPSSLAVAPDGTIYIADLGNIRI 1497
Cdd:cd14955 245 -----------SGDGQF------NSPSGIAVDSAGNVYVADSGNNRI 274
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1612-1648 |
7.34e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 41.82 E-value: 7.34e-05
10 20 30
....*....|....*....|....*....|....*..
gi 2462603315 1612 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTG 1648
Cdd:pfam05593 1 YDAA-GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
|
|
| Keratin_B2 |
pfam01500 |
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized ... |
597-715 |
1.28e-04 |
|
Keratin, high sulfur B2 protein; High sulfur proteins are cysteine-rich proteins synthesized during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.
Pssm-ID: 366678 [Multi-domain] Cd Length: 161 Bit Score: 44.78 E-value: 1.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 597 CEEVDCLDPTCSSHGVCvnGECLCSPGWGGLNCelarvqCPDQCSGHGTYLPDTGLCSCDPNWMGPDCSVEVCSVDCGTH 676
Cdd:pfam01500 4 CGTSFCGFPTCSTGGTC--GSGCCQPCCCQSSC------CRPSCCQTSCCQPTTFQSSCCRPTCQPCCQTSCCQPTCCQT 75
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 2462603315 677 GVCIGGACRCEEGWTGAA----CDQRVCHPRCIEHGTCKDGKC 715
Cdd:pfam01500 76 SSCQTGCGGIGYGQEGSSgavsSRTRWCRPDCRVEGTCLPPCC 118
|
|
| YvrE |
COG3386 |
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase ... |
1173-1333 |
3.20e-04 |
|
Sugar lactone lactonase YvrE [Carbohydrate transport and metabolism]; Sugar lactone lactonase YvrE is part of the Pathway/BioSystem: Non-phosphorylated Entner-Doudoroff pathway
Pssm-ID: 442613 [Multi-domain] Cd Length: 266 Bit Score: 45.27 E-value: 3.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1173 LAVGIDGSLYVGDFNY------IRRIFPSRNVTSILElrnkEFKHSNNpahkyyLAVDPVSGSLYVSDTNSRRIYRVkSL 1246
Cdd:COG3386 98 GVVDPDGRLYFTDMGEylptgaLYRVDPDGSLRVLAD----GLTFPNG------IAFSPDGRTLYVADTGAGRIYRF-DL 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1247 SGTKDLaGNSEVVAgtgeqclpfdEARCGDGGkaidatlmsPRGIAVDKNGLMY--FVDATMIRKVDQNGiisTLLGSND 1324
Cdd:COG3386 167 DADGTL-GNRRVFA----------DLPDGPGG---------PDGLAVDADGNLWvaLWGGGGVVRFDPDG---ELLGRIE 223
|
....*....
gi 2462603315 1325 LTAVRPLSC 1333
Cdd:COG3386 224 LPERRPTNV 232
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
575-597 |
3.68e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 39.64 E-value: 3.68e-04
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
510-532 |
3.87e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 39.64 E-value: 3.87e-04
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
669-750 |
4.25e-04 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 42.82 E-value: 4.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 669 CSVDCGTHGVCIGGACRCEEGWT--GAAC-----DQR---VCHPRCIEHGTCKDGkcECREGwngehCTIGRQTAGtetD 738
Cdd:NF041328 45 CGVACGAGQTCVAGACGCGPGTVacGGACvdtasDPAhcgACGAACAPGQVCEGG--ACREA-----CSEGLTRCG---G 114
|
90
....*....|..
gi 2462603315 739 GCPDLCNGNGRC 750
Cdd:NF041328 115 ACVDLATDPLHC 126
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
640-664 |
7.54e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 38.87 E-value: 7.54e-04
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1170-1303 |
8.73e-04 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 43.74 E-value: 8.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1170 PVALAVGIDGSLYVGDF--NYIRRIFPSRNVTSILElrnkeFKHSNNPahkYYLAVDPvSGSLYVSDTNSRRIYRVKSLS 1247
Cdd:cd14952 96 PTGVAVDAAGNVYVADTgnNRVLKLAAGSNTQTVLP-----FTGLSNP---DGVAVDG-AGNVYVTDTGNNRVLKLAAGS 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1248 GTK----------------DLAG--------NSEVV---AGTGEQC-LPFDEarcgdggkaidatLMSPRGIAVDKNGLM 1299
Cdd:cd14952 167 TTQtvlpftglnspsgvavDTAGnvyvtdhgNNRVLklaAGSTTPTvLPFTG-------------LNGPLGVAVDAAGNV 233
|
....
gi 2462603315 1300 YFVD 1303
Cdd:cd14952 234 YVAD 237
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1160-1324 |
1.04e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 43.43 E-value: 1.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1160 GLAEGnKLLAPVALAVGIDGSLYVGDFnYIRRI------------FPSRnvtsilelrnKEFKHSNNPAHkyyLAVDpvS 1227
Cdd:cd14963 49 GTGPG-EFKYPYGIAVDSDGNIYVADL-YNGRIqvfdpdgkflkyFPEK----------KDRVKLISPAG---LAID--D 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1228 GSLYVSDTNSRRIYrvkslsgtkdlagnseVVAGTGEQCLPFDEARCGDGgkaidaTLMSPRGIAVDKNGLMYFVDATMI 1307
Cdd:cd14963 112 GKLYVSDVKKHKVI----------------VFDLEGKLLLEFGKPGSEPG------ELSYPNGIAVDEDGNIYVADSGNG 169
|
170 180
....*....|....*....|
gi 2462603315 1308 R-KV-DQNG-IISTLLGSND 1324
Cdd:cd14963 170 RiQVfDKNGkFIKELNGSPD 189
|
|
| DSL |
pfam01414 |
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ... |
686-728 |
1.79e-03 |
|
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.
Pssm-ID: 460202 Cd Length: 46 Bit Score: 38.37 E-value: 1.79e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 2462603315 686 CEEGWTGAACDqRVCHPR--CIEHGTC-KDGKCECREGWNGEHCTI 728
Cdd:pfam01414 1 CDENYYGSTCS-KFCRPRddKFGHYTCdANGNKVCLPGWTGPYCDK 45
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
601-630 |
2.52e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 37.62 E-value: 2.52e-03
10 20 30
....*....|....*....|....*....|....*
gi 2462603315 601 DCLDPT-CSSHGVCVNGE----CLCSPGWGGLNCE 630
Cdd:cd00054 4 ECASGNpCQNGGTCVNTVgsyrCSCPPGYTGRNCE 38
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
1612-1654 |
3.99e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 37.18 E-value: 3.99e-03
10 20 30 40
....*....|....*....|....*....|....*....|...
gi 2462603315 1612 YDGNtGLLATKSDETGWTTFYDYDHEGRLTNVTRPTGVVTSLH 1654
Cdd:TIGR01643 1 YDAA-GRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
568-598 |
4.33e-03 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 36.85 E-value: 4.33e-03
10 20 30
....*....|....*....|....*....|....*.
gi 2462603315 568 NQCIDPS-CGGHGSCIDG----NCVCSAGYKGEHCE 598
Cdd:cd00054 3 DECASGNpCQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1160-1303 |
5.74e-03 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 41.12 E-value: 5.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1160 GLAEGNkLLAPVALAVGIDGSLYVGDFNYIRRIFPSRNVTSILELRNKEFKHS--NNPAHkyyLAVDPvSGSLYVSDTNS 1237
Cdd:cd14963 141 GSEPGE-LSYPNGIAVDEDGNIYVADSGNGRIQVFDKNGKFIKELNGSPDGKSgfVNPRG---IAVDP-DGNLYVVDNLS 215
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462603315 1238 RRIYrVKSLSGTKDLagnseVVAGTGEqclpfdearcgdggkaIDATLMSPRGIAVDKNGLMYFVD 1303
Cdd:cd14963 216 HRVY-VFDEQGKELF-----TFGGRGK----------------DDGQFNLPNGLFIDDDGRLYVTD 259
|
|
| C_rich_MXAN6577 |
NF041328 |
MXAN_6577-like cysteine-rich domain; |
513-685 |
7.46e-03 |
|
MXAN_6577-like cysteine-rich domain;
Pssm-ID: 469225 [Multi-domain] Cd Length: 145 Bit Score: 39.36 E-value: 7.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 513 NGECVSgvchcfpgfLGADCAK-AACPVLCSGNGQYSKGTCQCYSGwkGAECDvpmNQCI----DP-SCGGHGScidgnc 586
Cdd:NF041328 29 GGACVD---------LRSDPSNcGACGVACGAGQTCVAGACGCGPG--TVACG---GACVdtasDPaHCGACGA------ 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 587 vcsagykgehceevdcldpTCSSHGVCVNGECL--CSPGwgglncelaRVQCPDQCSGHGTylpDTGLCScdpnwmgpdc 664
Cdd:NF041328 89 -------------------ACAPGQVCEGGACReaCSEG---------LTRCGGACVDLAT---DPLHCG---------- 127
|
170 180
....*....|....*....|.
gi 2462603315 665 sveVCSVDCGTHGVCIGGACR 685
Cdd:NF041328 128 ---ACGVACDPGESCRGGACT 145
|
|
| NHL-2_like |
cd14951 |
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ... |
1415-1500 |
8.11e-03 |
|
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271321 [Multi-domain] Cd Length: 334 Bit Score: 41.02 E-value: 8.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462603315 1415 TGVLYITETDEKKINRL----RQVTTngeiclLAGaasdcdckndvncncySGDDAYA-TDAILNSPSSLAVAPDGTIYI 1489
Cdd:cd14951 206 DGSVYVADTYNHKIKRVdpatGEVST------LAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLYV 263
|
90
....*....|.
gi 2462603315 1490 ADLGNIRIRAV 1500
Cdd:cd14951 264 ADTNNHRIRRL 274
|
|
| I-EGF_1 |
pfam18372 |
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in ... |
541-558 |
9.88e-03 |
|
Integrin beta epidermal growth factor like domain 1; This is the I-EGF 1 domain found in several integrin betas such as integrin beta 1-7. Structural analysis reveal an epidermal growth factor-like (I-EGF) domains 1 and 2. EGF1 lacks one disulfide (C2-C4) relative to the integrin EGF 2, 3, and 4 domains, this allows the C-terminal end of EGF1 to flex remarkably relative to its N-terminal end.
Pssm-ID: 465729 Cd Length: 29 Bit Score: 35.93 E-value: 9.88e-03
|
|