|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
36-332 |
2.43e-163 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 509.13 E-value: 2.43e-163
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 36 SLT-RRRDAERRYTSSSADSEEGKTP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGANFTLRELGLGEV 113
Cdd:pfam06484 1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 114 TSPHGTLYRTDIGLPHCGYSMGASSDADMEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENTETDHPG 193
Cdd:pfam06484 81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 194 ----------------------------------------------------------------------GLQNHARLRT 203
Cdd:pfam06484 161 ppippsssssspveqhsppppslnenqrpllgnnashpildsdpdeefspnsylvrtgsgpqsapseqppNFQNHSRLRT 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 204 PPPPLSHTHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPASgaQEPAHAQDNWLLNSNIPLETRHFLFKPG- 282
Cdd:pfam06484 241 PPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQST--QESVQLQDSWVLNSNVPLETRHFLFKTGt 317
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 1622865629 283 GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFARPAFNLKKPSKYCNWK 332
Cdd:pfam06484 318 GTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1214-1555 |
5.24e-46 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 169.63 E-value: 5.24e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1214 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILELRNKDFRHSHSPAHKYY----LTTDPmSGA 1283
Cdd:cd14953 11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1284 VFLSDTNSRRVFKIKSTVVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1361
Cdd:cd14953 90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1362 RRIDQNGIISTLLGsndlTSARPLSCDSVMdiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1439
Cdd:cd14953 156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1440 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1519
Cdd:cd14953 226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
|
330 340 350
....*....|....*....|....*....|....*.
gi 1622865629 1520 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1555
Cdd:cd14953 288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2677-2754 |
3.22e-38 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 138.13 E-value: 3.22e-38
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865629 2677 EEKARVLELARQRAVRQAWAREQQRLREGEEGLRAWTEGEKQQVLSTGRVQGYDGFFVISVEQYPELSDSANNIHFMR 2754
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1581-2453 |
7.28e-34 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 143.74 E-value: 7.28e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1581 FDTTGKHLYTQSLPTGDYLYNFTYTGDGDVTLITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALKSVTTQGH 1660
Cdd:COG3209 184 VGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTT 263
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1661 ELAMMTYHGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAF 1740
Cdd:COG3209 264 GAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTG 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1741 YTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFG 1820
Cdd:COG3209 344 GTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAA 423
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1821 RRLRVHNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSlWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYD 1900
Cdd:COG3209 424 GALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGA-ATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTL 502
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1901 QAGRIISRIFADGKTWSYTYLEKSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASV 1980
Cdd:COG3209 503 DDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTT 582
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1981 IQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTINLQNEGFTCTIRYRQIGPLIDRQI 2060
Cdd:COG3209 583 GTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGT 662
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2061 FRFTEEGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAY 2140
Cdd:COG3209 663 TGTGTGVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2141 GRMKEVQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSP 2220
Cdd:COG3209 742 GTLTTTSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVIT 820
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2221 GNSARLTPL-----RYDLRDRITRLGDVQykmdEDGFLRQRggdiFEYNSAGLLIKAynRAGGWSVRYRYDGLGRRVSSK 2295
Cdd:COG3209 821 VGSGGGTDLqdrtyTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTSRT 890
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2296 SSHSHHLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYT 2375
Cdd:COG3209 891 DGGTTTYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYD 949
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865629 2376 AYGEIYMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhELWQHLSssnimpFNLYMFKNNNPISNS 2453
Cdd:COG3209 950 PFGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-PIGLAGG------LNLYAYVGNNPVNYV 1020
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2374-2453 |
4.17e-11 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 60.98 E-value: 4.17e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2374 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwqhlsssnimPF------NLYMFKNN 2447
Cdd:TIGR03696 1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66
|
....*.
gi 1622865629 2448 NPISNS 2453
Cdd:TIGR03696 67 NPVNWV 72
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
825-855 |
2.16e-09 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 54.83 E-value: 2.16e-09
10 20 30
....*....|....*....|....*....|.
gi 1622865629 825 SMETACGDSKDNDGDGLVDCMDPDCCLQPLC 855
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1224-1555 |
1.32e-08 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 58.49 E-value: 1.32e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1224 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilELRNKDFRHSHSpahkyyLTTDPmSGAVFLSDTNSRRVFKI--K 1298
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIdpK 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1299 STVVvkdlvknsEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRID-QNGIISTLLG 1375
Cdd:COG4257 89 TGEI--------TTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1376 snDLTSARplscdsvmdisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAI 1455
Cdd:COG4257 141 --PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYAL 183
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1456 HATLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLA 1535
Cdd:COG4257 184 PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVA 237
|
330 340
....*....|....*....|
gi 1622865629 1536 VCADGELYVADLGNIRIRFI 1555
Cdd:COG4257 238 VDGDGRVWFAESGANRIVRF 257
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
1872-2290 |
2.30e-08 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 60.02 E-value: 2.30e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1872 LNGVNVTYSPGGhiAGIQRGIMSE-------RMEYDQAGRIISRIFADGKTWSYTYLEKS---MVLLLHSQRQYIFEFDK 1941
Cdd:NF041261 401 LNRREVLHTEGE--GGLKRVVKKEhadgsvtRSGYDAAGRLTAQTDAAGRRTEYSLNVVSgdiTDITTPDGRETKFYYND 478
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1942 NDRLSSVTMPNvarqTLETIRsvgyyrniyqppegnasviqDFTEDGHLLhtfylgtgrrviykygklsklAETLYDTTK 2021
Cdd:NF041261 479 GNQLTSVTSPD----GLESRR--------------------EYDEPGRLV---------------------SETSRSGET 513
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2022 VSFTYDETAGMLKTINLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETPlpIDLYR 2101
Cdd:NF041261 514 TRYRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREEG--ISTYR 581
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2102 -YDD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNMGRVVKK 2171
Cdd:NF041261 582 rYDNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAAGRITTL 654
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2172 ELKvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GDV--QYK 2246
Cdd:NF041261 655 TNE-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGEPaeQWQ 727
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1622865629 2247 MDEDGFLRQrggdiFEYNSAGLLIkaynraggwSVRYRYDGLGR 2290
Cdd:NF041261 728 YDEHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
2100-2436 |
4.42e-06 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 52.70 E-value: 4.42e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2100 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2161
Cdd:NF041261 367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2162 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2232
Cdd:NF041261 443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2233 lrDRITRLGDVqyKMDEDGFLRQrggdiFEYNSAGLLIkAYNRAGGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2312
Cdd:NF041261 520 --DPHSELPAT--TTDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2313 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2392
Cdd:NF041261 587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1622865629 2393 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LWQHLSSSNI 2436
Cdd:NF041261 659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLWHYDESDRI 713
|
|
| PLN02919 |
PLN02919 |
haloacid dehalogenase-like hydrolase family protein |
1259-1559 |
1.42e-05 |
|
haloacid dehalogenase-like hydrolase family protein
Pssm-ID: 215497 [Multi-domain] Cd Length: 1057 Bit Score: 51.01 E-value: 1.42e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1259 RNKDFRHSHSPAhKY--YLTTDPMSGAVFLSDTNSRRVfkikstvVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1332
Cdd:PLN02919 556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1333 kateATLTNPRGITVDKFGLIYFVDGT---MIRRID-QNGIISTLLGS----NDLTSARPLScdsvmdiSQVrLEWPTDL 1404
Cdd:PLN02919 621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1405 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1452
Cdd:PLN02919 689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1453 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1504
Cdd:PLN02919 769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1622865629 1505 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1559
Cdd:PLN02919 847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
556-714 |
2.18e-05 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 48.85 E-value: 2.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 556 DNCPSNCYGNGDCISGTCH-----------------CFLGFLGPD---CGRASCpvlcsGNGQ----------YMKGRCL 605
Cdd:pfam19232 10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTPKascCGGVTC-----GAGQtcdaktntcvYVKGYCS 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 606 C-HSGWKGAECDVPTNQCIDVACSNHGT---CIMG-----------------TCIC-NPG-YK-GESC-----------E 650
Cdd:pfam19232 85 AdHPCPSGSACDTAKNACIAQPPYGPDSgkgCVRGfgawiweldpatnsgvwRCRCaNGSlYNsAHECspladqtlcaaE 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 651 EVD-----------------------CMDPTCSGRGVC--VRGECHCSVGWGGTNCETPRAtcldqCSGHGTFLPDTGLC 705
Cdd:pfam19232 165 NLDpnalvpassvpafaaygwgnqpvLINKSTAGAAVPspLAGVCPCKPGWAGGSCTEDRT-----CNGRGTWNETTGQC 239
|
....*....
gi 1622865629 706 SCDPSWTGH 714
Cdd:pfam19232 240 ACNIDFSGH 248
|
|
| DSL |
pfam01414 |
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ... |
738-781 |
4.99e-05 |
|
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.
Pssm-ID: 460202 Cd Length: 46 Bit Score: 42.61 E-value: 4.99e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 1622865629 738 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 781
Cdd:pfam01414 1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
627-650 |
5.88e-04 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 39.54 E-value: 5.88e-04
10 20
....*....|....*....|....*...
gi 1622865629 627 CSNHGTCIMG----TCICNPGYKGESCE 650
Cdd:cd00054 11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1672-1704 |
5.96e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 39.50 E-value: 5.96e-04
10 20 30
....*....|....*....|....*....|...
gi 1622865629 1672 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1704
Cdd:pfam05593 5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| Ten_N |
pfam06484 |
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of ... |
36-332 |
2.43e-163 |
|
Teneurin Intracellular Region; This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
Pssm-ID: 461932 [Multi-domain] Cd Length: 367 Bit Score: 509.13 E-value: 2.43e-163
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 36 SLT-RRRDAERRYTSSSADSEEGKTP-QKSYSSSETLKAYDQDARLAYGSRVKDMVPQEAEEFCRTGANFTLRELGLGEV 113
Cdd:pfam06484 1 SLTkRRRDKERRYTSSSADSEECRVPtQKSYSSSETLKAFDHDSRMLYGNRVKDMVHKEADEFSRQGQNFSLRELGICEP 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 114 TSPHGTLYRTDIGLPHCGYSMGASSDADMEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENTETDHPG 193
Cdd:pfam06484 81 SPRHGLAYCTEMGLPHRGYSISTGSDADTETDGPMSPEHAVRLWGRGTKSGRSSCLSSRSNSALTLTDTEHENKSDNENG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 194 ----------------------------------------------------------------------GLQNHARLRT 203
Cdd:pfam06484 161 ppippsssssspveqhsppppslnenqrpllgnnashpildsdpdeefspnsylvrtgsgpqsapseqppNFQNHSRLRT 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 204 PPPPLSHTHTPNQHHAASINSLNRGNFTPRSNPSPAPTdHSLSGEPPASgaQEPAHAQDNWLLNSNIPLETRHFLFKPG- 282
Cdd:pfam06484 241 PPPPLPPPHKQNQHHHPSINSLNRSSLTNRRNPSPAPT-ASLPAELQST--QESVQLQDSWVLNSNVPLETRHFLFKTGt 317
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|
gi 1622865629 283 GTSPLFCTTSPGYPLTSSTVYSPPPRPLPRSTFARPAFNLKKPSKYCNWK 332
Cdd:pfam06484 318 GTTPLFCTASPGYPLTSGTVYSPPPRPLPRNTFSRPAFKLKKPYKYCSWK 367
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1214-1555 |
5.24e-46 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 169.63 E-value: 5.24e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1214 GLADGNKLLA----PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILELRNKDFRHSHSPAHKYY----LTTDPmSGA 1283
Cdd:cd14953 11 GFSGGGGTAArfnsPSGVAVDAAGNLYVADRgnHRIRKITPDGVVTTVAGTGTAGFADGGGAAAQFNtpsgVAVDA-AGN 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1284 VFLSDTNSRRVFKIKSTVVVKdlvknseVVAGTGDQclpfddtRCGDGGKATEATLTNPRGITVDKFGLIYFVDGT--MI 1361
Cdd:cd14953 90 LYVADTGNHRIRKITPDGVVS-------TLAGTGTA-------GFSDDGGATAAQFNYPTGVAVDAAGNLYVADTGnhRI 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1362 RRIDQNGIISTLLGsndlTSARPLSCDSVMdiSQVRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGRpmh 1439
Cdd:cd14953 156 RKITPDGVVTTVAG----TGGAGYAGDGPA--TAAQFNNPTGVAVDAAGN-LYVADrgNHRIRKITPDGVVTTVAGT--- 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1440 cqvpGIDHFLLSKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcFSGD 1519
Cdd:cd14953 226 ----GTAGFSGDGGATAAQLNNPTGVAVDAAGNLYVADSGN---HRIRKITPAGVVTTVAGGGAG-----------FSGD 287
|
330 340 350
....*....|....*....|....*....|....*.
gi 1622865629 1520 DGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1555
Cdd:cd14953 288 GGPATSAQFNNPTGVAVDAAGNLYVADTGNNRIRKI 323
|
|
| Tox-GHH |
pfam15636 |
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH ... |
2677-2754 |
3.22e-38 |
|
GHH signature containing HNH/Endo VII superfamily nuclease toxin; A predicted toxin of the HNH/Endonuclease VII fold present in bacterial polymorphic toxin systems with a characteriztic sG[HQ]H signature motif. In bacterial polymorphic toxin systems, the toxin is exported by the type 2, type 6, type 7 or TcdB/TcaC-type secretion system. The metazoan teneurin proteins possess an inactive of this domain at their C-terminus.
Pssm-ID: 464783 Cd Length: 78 Bit Score: 138.13 E-value: 3.22e-38
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865629 2677 EEKARVLELARQRAVRQAWAREQQRLREGEEGLRAWTEGEKQQVLSTGRVQGYDGFFVISVEQYPELSDSANNIHFMR 2754
Cdd:pfam15636 1 EERKRLLEHAKKRAVREAWHRERQLLRNGLPGSRDWTDEEKEELLSTGSVPGYDGEYIHPVEQYPELADDPSNIRFRK 78
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1581-2453 |
7.28e-34 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 143.74 E-value: 7.28e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1581 FDTTGKHLYTQSLPTGDYLYNFTYTGDGDVTLITDNNGNMVNVRRDSTGMPLWLVVPDGQVYWVTMGTNSALKSVTTQGH 1660
Cdd:COG3209 184 VGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTT 263
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1661 ELAMMTYHGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAF 1740
Cdd:COG3209 264 GAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTG 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1741 YTLLQDQVRNSYYIGADGSLRLLLANGMEVALQTEPHLLAGTVNPTVGKRNVTLPIDNGLNLVEWRQRKEQARGQVTVFG 1820
Cdd:COG3209 344 GTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAA 423
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1821 RRLRVHNRNLLSLDFDRVTRTEKIYDDHRKFTLRILYDQAGRPSlWSPSSRLNGVNVTYSPGGHIAGIQRGIMSERMEYD 1900
Cdd:COG3209 424 GALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGA-ATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTL 502
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1901 QAGRIISRIFADGKTWSYTYLEKSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASV 1980
Cdd:COG3209 503 DDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTT 582
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1981 IQDFTEDGHLLHTFYLGTGRRVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTINLQNEGFTCTIRYRQIGPLIDRQI 2060
Cdd:COG3209 583 GTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGT 662
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2061 FRFTEEGMVNARFDYNYDNSFRVTSmQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAY 2140
Cdd:COG3209 663 TGTGTGVTAGLTTLATGGTTVGGGT-GTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTT 741
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2141 GRMKEVQYEIFRSLMYWmTVQYDNMGRVVKKELKVGPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSP 2220
Cdd:COG3209 742 GTLTTTSTTTTTTAGAL-TYTYDALGRLTSETTPGGVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVIT 820
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2221 GNSARLTPL-----RYDLRDRITRLGDVQykmdEDGFLRQRggdiFEYNSAGLLIKAynRAGGWSVRYRYDGLGRRVSSK 2295
Cdd:COG3209 821 VGSGGGTDLqdrtyTYDAAGNITSITDAL----RAGTLTQT----YTYDALGRLTSA--TDPGTTESYTYDANGNLTSRT 890
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2296 SSHSHHLQFFYADLtnPTKVTHlynhSSSEITSLYYDLQGHlfamelssgdefyiaCDNIGTPLAVFSGTGLMIKQILYT 2375
Cdd:COG3209 891 DGGTTTYTYDALGR--LVSVTK----PDGTTTTYTYDALGH---------------TDHLGSVRALTDASGQVVWRYDYD 949
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865629 2376 AYGEIYMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhELWQHLSssnimpFNLYMFKNNNPISNS 2453
Cdd:COG3209 950 PFGNLLAETSGAAANPLRFTGQEYDAETGLYYNGARYYDPALGRFLSPD-PIGLAGG------LNLYAYVGNNPVNYV 1020
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1219-1555 |
4.08e-18 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 86.99 E-value: 4.08e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1219 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILELRNKDFRHSHSPAHkyyLTTDPmSGAVFLSDTNSRRVFK 1296
Cdd:cd05819 5 GELNNPQGIAVDSSGNIYVADTgnNRIQVFDPDGNFITSFGSFGSGDGQFNEPAG---VAVDS-DGNLYVADTGNHRIQK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1297 IKStvvvkdlvkNSEVVAGTGdqclpfddtrcGDGGKATEatLTNPRGITVDKFGLIYFVDgTM---IRRIDQNGIISTL 1373
Cdd:cd05819 81 FDP---------DGNFLASFG-----------GSGDGDGE--FNGPRGIAVDSSGNIYVAD-TGnhrIQKFDPDGEFLTT 137
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1374 LGSNDLTSARplscdsvmdisqvrLEWPTDLAINPmDNSLYVLDnnvvlqiSENHQVRIVAgrpmhcqvPGiDHFLL--- 1450
Cdd:cd05819 138 FGSGGSGPGQ--------------FNGPTGVAVDS-DGNIYVAD-------TGNHRIQVFD--------PD-GNFLTtfg 186
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1451 SKVAIHATLESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGAPSGcdckndancdcfsgddgyaKDAKLNT 1530
Cdd:cd05819 187 STGTGPGQFNYPTGIAVDSDGNIYVADSGN---NRVQVFDPDGAGFGGNGNFLG-------------------SDGQFNR 244
|
330 340
....*....|....*....|....*
gi 1622865629 1531 PSSLAVCADGELYVADLGNIRIRFI 1555
Cdd:cd05819 245 PSGLAVDSDGNLYVADTGNNRIQVF 269
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1339-1623 |
6.97e-18 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 86.22 E-value: 6.97e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1339 LTNPRGITVDKFGLIYFVDGTM--IRRIDQNGIISTLLGSNDltsarplscdsvmdISQVRLEWPTDLAINPmDNSLYVL 1416
Cdd:cd05819 7 LNNPQGIAVDSSGNIYVADTGNnrIQVFDPDGNFITSFGSFG--------------SGDGQFNEPAGVAVDS-DGNLYVA 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1417 D--NNVVLQISENHQVRIVAGRPmhcqvpGIDHFLLSkvaihatleSATALAVSHNGVLYIAETDEkkiNRIRQVTTSGE 1494
Cdd:cd05819 72 DtgNHRIQKFDPDGNFLASFGGS------GDGDGEFN---------GPRGIAVDSSGNIYVADTGN---HRIQKFDPDGE 133
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1495 ISLVAGAPSGCDckndancdcfsgddgyakdAKLNTPSSLAVCADGELYVADLGNIRIRFIrknkpflntqnmyelsSPI 1574
Cdd:cd05819 134 FLTTFGSGGSGP-------------------GQFNGPTGVAVDSDGNIYVADTGNHRIQVF----------------DPD 178
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1622865629 1575 DQELYLFDTTGKHLYTQSLPTGDylynfTYTGDGDVtLITDNNGNMVNV 1623
Cdd:cd05819 179 GNFLTTFGSTGTGPGQFNYPTGI-----AVDSDGNI-YVADSGNNRVQV 221
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1211-1424 |
2.05e-14 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 77.19 E-value: 2.05e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1211 SCNGLADGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNilelrnkdfrhshspahkyylttdpmsgavflsd 1288
Cdd:cd14953 176 AGDGPATAAQFNNPTGVAVDAAGNLYVADRgnHRIRKITPDGVVTT---------------------------------- 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1289 tnsrrvfkikstvvvkdlvknsevVAGTGDQclPFddtrcGDGGKATEATLTNPRGITVDKFGLIYFVD---GTmIRRID 1365
Cdd:cd14953 222 ------------------------VAGTGTA--GF-----SGDGGATAAQLNNPTGVAVDAAGNLYVADsgnHR-IRKIT 269
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622865629 1366 QNGIISTLLGSndlTSARPLSCDSVmdiSQVRLEWPTDLAINPmDNSLYVLD--NNVVLQI 1424
Cdd:cd14953 270 PAGVVTTVAGG---GAGFSGDGGPA---TSAQFNNPTGVAVDA-AGNLYVADtgNNRIRKI 323
|
|
| NHL |
cd05819 |
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in ... |
1217-1486 |
2.16e-13 |
|
NHL repeat unit of beta-propeller proteins; The NHL(NCL-1, HT2A and LIN-41)-repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures. The repeats have a catalytic activity in Peptidyl-glycine alpha-amidating monooxygenase; proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localized to the repeats. Tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is mediated by the NHL repeats.
Pssm-ID: 271320 [Multi-domain] Cd Length: 269 Bit Score: 73.12 E-value: 2.16e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1217 DGNKLLAPVALTCGSDGSLYVGDF--NYIRRIFPSGNVTN---ILELRNKDFRHshsPahkYYLTTDPmSGAVFLSDTNS 1291
Cdd:cd05819 50 GDGQFNEPAGVAVDSDGNLYVADTgnHRIQKFDPDGNFLAsfgGSGDGDGEFNG---P---RGIAVDS-SGNIYVADTGN 122
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1292 RRVFKIKStvvvkdlvkNSEVVAGTGdqclpfddtrcgdGGKATEATLTNPRGITVDKFGLIYFVDGT--MIRRIDQNGI 1369
Cdd:cd05819 123 HRIQKFDP---------DGEFLTTFG-------------SGGSGPGQFNGPTGVAVDSDGNIYVADTGnhRIQVFDPDGN 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1370 ISTLLGSNDLTSArplscdsvmdisqvRLEWPTDLAINPMDNsLYVLD--NNVVLQISENHQVRIVAGrpmhcqvpgidh 1447
Cdd:cd05819 181 FLTTFGSTGTGPG--------------QFNYPTGIAVDSDGN-IYVADsgNNRVQVFDPDGAGFGGNG------------ 233
|
250 260 270
....*....|....*....|....*....|....*....
gi 1622865629 1448 fllSKVAIHATLESATALAVSHNGVLYIAETDEKKINRI 1486
Cdd:cd05819 234 ---NFLGSDGQFNRPSGLAVDSDGNLYVADTGNNRIQVF 269
|
|
| Rhs_assc_core |
TIGR03696 |
RHS repeat-associated core domain; This model represents a conserved unique core sequence ... |
2374-2453 |
4.17e-11 |
|
RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.
Pssm-ID: 274730 [Multi-domain] Cd Length: 77 Bit Score: 60.98 E-value: 4.17e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2374 YTAYGEIyMDTNPNFQIIIGYHGGLYDPLTKLVHMGRRDYDVLAGRWTSPDhelwqhlsssnimPF------NLYMFKNN 2447
Cdd:TIGR03696 1 YDPYGEV-LSESGAAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPD-------------PIglggglNLYAYVGN 66
|
....*.
gi 1622865629 2448 NPISNS 2453
Cdd:TIGR03696 67 NPVNWV 72
|
|
| acid_disulf_rpt |
NF033662 |
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with ... |
825-855 |
2.16e-09 |
|
acidic double-disulfide repeat; The acidic double-disulfide repeat is an Asp-rich repeat with four nearly invariant Cys residues in a repeat length of about 35 amino acids.
Pssm-ID: 411265 [Multi-domain] Cd Length: 32 Bit Score: 54.83 E-value: 2.16e-09
10 20 30
....*....|....*....|....*....|.
gi 1622865629 825 SMETACGDSKDNDGDGLVDCMDPDCCLQPLC 855
Cdd:NF033662 2 ATDTTCSDGIDNDGDGLTDCADPDCAGNPVC 32
|
|
| NHL_PKND_like |
cd14952 |
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein ... |
1281-1552 |
5.06e-09 |
|
NHL repeat domain of the protein kinase PknD; PknD is a mycobacterial transmembrane protein with a cytosolic kinase domain and an extracellular sensor domain that contains NHL repeats. It plays a key role in the development of central nervous system tuberculosis, by mediating the invasion of host brain endothelia. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271322 [Multi-domain] Cd Length: 247 Bit Score: 59.53 E-value: 5.06e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1281 SGAVFLSDTNSRRVFKikstvvvkdlvknseVVAGTGDQC-LPFDDtrcgdggkateatLTNPRGITVDKFGLIYFVDGt 1359
Cdd:cd14952 20 AGNVYVADSGNNRVLK---------------LAAGSTTQTvLPFTG-------------LYQPQGVAVDAAGTVYVTDF- 70
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1360 mirriDQNGIISTLLGSNDLTsarPLSCDSvmdisqvrLEWPTDLAINPMDNsLYVLD--NNVVLqisenhqvRIVAGRP 1437
Cdd:cd14952 71 -----GNNRVLKLAAGSTTQT---VLPFTG--------LNDPTGVAVDAAGN-VYVADtgNNRVL--------KLAAGSN 125
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1438 MHCQVPgidhFllskvaihATLESATALAVSHNGVLYIAETDEkkiNRIRQvttsgeisLVAGA------Psgcdcknda 1511
Cdd:cd14952 126 TQTVLP----F--------TGLSNPDGVAVDGAGNVYVTDTGN---NRVLK--------LAAGSttqtvlP--------- 173
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 1622865629 1512 ncdcFSGddgyakdakLNTPSSLAVCADGELYVADLGNIRI 1552
Cdd:cd14952 174 ----FTG---------LNSPSGVAVDTAGNVYVTDHGNNRV 201
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1224-1555 |
1.32e-08 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 58.49 E-value: 1.32e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1224 PVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTnilELRNKDFRHSHSpahkyyLTTDPmSGAVFLSDTNSRRVFKI--K 1298
Cdd:COG4257 19 PRDVAVDPDGAVWFTDQggGRIGRLDPaTGEFT---EYPLGGGSGPHG------IAVDP-DGNLWFTDNGNNRIGRIdpK 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1299 STVVvkdlvknsEVVAGTGDQCLPFddtrcgdggkateatltnprGITVDKFGLIYFVDGT--MIRRID-QNGIISTLLG 1375
Cdd:COG4257 89 TGEI--------TTFALPGGGSNPH--------------------GIAFDPDGNLWFTDQGgnRIGRLDpATGEVTEFPL 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1376 snDLTSARplscdsvmdisqvrlewPTDLAINPmDNSLYVLDNnvvlqisENHQVRIVAGRPMHcqvpgidhflLSKVAI 1455
Cdd:COG4257 141 --PTGGAG-----------------PYGIAVDP-DGNLWVTDF-------GANAIGRIDPDTGT----------LTEYAL 183
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1456 HATLESATALAVSHNGVLYIAETDEKKINRIRqvTTSGEISLVAGAPSGCDckndancdcfsgddgyakdaklntPSSLA 1535
Cdd:COG4257 184 PTPGAGPRGLAVDPDGNLWVADTGSGRIGRFD--PKTGTVTEYPLPGGGAR------------------------PYGVA 237
|
330 340
....*....|....*....|
gi 1622865629 1536 VCADGELYVADLGNIRIRFI 1555
Cdd:COG4257 238 VDGDGRVWFAESGANRIVRF 257
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
1872-2290 |
2.30e-08 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 60.02 E-value: 2.30e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1872 LNGVNVTYSPGGhiAGIQRGIMSE-------RMEYDQAGRIISRIFADGKTWSYTYLEKS---MVLLLHSQRQYIFEFDK 1941
Cdd:NF041261 401 LNRREVLHTEGE--GGLKRVVKKEhadgsvtRSGYDAAGRLTAQTDAAGRRTEYSLNVVSgdiTDITTPDGRETKFYYND 478
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1942 NDRLSSVTMPNvarqTLETIRsvgyyrniyqppegnasviqDFTEDGHLLhtfylgtgrrviykygklsklAETLYDTTK 2021
Cdd:NF041261 479 GNQLTSVTSPD----GLESRR--------------------EYDEPGRLV---------------------SETSRSGET 513
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2022 VSFTYDETAGMLKTINLQNEGFTCTIRYRQIGplidrQIFRFTEEGMVNARFDYNydnsfRVTSMQAVINETPlpIDLYR 2101
Cdd:NF041261 514 TRYRYDDPHSELPATTTDATGSTKQMTWSRYG-----QLLAFTDCSGYQTRYEYD-----RFGQMTAVHREEG--ISTYR 581
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2102 -YDD----VSGKTEQfGKFGVIYY----DINQIITTAVMTHTKHFDAYGR-MKEVQYEIFRSLmywmtvQYDNMGRVVKK 2171
Cdd:NF041261 582 rYDNrgqlTSVKDAQ-GRETRYEYnaagDLTAVITPDGNRSETQYDAWGKaVSTTQGGLTRSM------EYDAAGRITTL 654
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2172 ELKvgpyaNTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNlhLLSPGNSARLTPLRYDLRDRITRL---GDV--QYK 2246
Cdd:NF041261 655 TNE-----NGSHSTFLYDALDRLVQQRGFDGRTQRYHYDLTGK--LTQSEDEGLVTLWHYDESDRITHRtvnGEPaeQWQ 727
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1622865629 2247 MDEDGFLRQrggdiFEYNSAGLLIkaynraggwSVRYRYDGLGR 2290
Cdd:NF041261 728 YDEHGWLTD-----ISHLSEGHRV---------AVHYGYDDKGR 757
|
|
| NHL_like_3 |
cd14956 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1341-1552 |
4.03e-07 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271326 [Multi-domain] Cd Length: 274 Bit Score: 54.21 E-value: 4.03e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1341 NPRGITVDKFGLIYFVD--GTMIRRIDQNGIISTLLGSndlTSARPLScdsvmdisqvrLEWPTDLAINPmDNSLYVLDn 1418
Cdd:cd14956 108 APRGVAVDADGNLYVADfgNQRIQKFDPDGSFLRQWGG---TGIEPGS-----------FNYPRGVAVDP-DGTLYVAD- 171
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1419 nvvlqiSENHQVrivagrpmhcQVPGIDHFLLSKVAIHAT----LESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGE 1494
Cdd:cd14956 172 ------TYNDRI----------QVFDNDGAFLRKWGGRGTgpgqFNYPYGIAIDPDGNVFVADFGN---NRIQKFTADGT 232
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865629 1495 ISLVAGAPSGcdckndancdcfsgddgyaKDAKLNTPSSLAVCADGELYVADLGNIRI 1552
Cdd:cd14956 233 FLTSWGSPGT-------------------GPGQFKNPWGVVVDADGTVYVADSNNNRV 271
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
2100-2436 |
4.42e-06 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 52.70 E-value: 4.42e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2100 YRYDDVSGKTEQFGKFGVIY---YDINQIITTAVMT-----HT----------KHFDAYGRMKEVQYEIFRSLmywmTVQ 2161
Cdd:NF041261 367 YRYDDTGRVTEQLNPAGLSYryqYEQDRITITDSLNrrevlHTegegglkrvvKKEHADGSVTRSGYDAAGRL----TAQ 442
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2162 YDNMGRVVKKELKV---------GPYANTTRYSYeyDADGQLQTVSINDKPLWRYSYDLNGNLhLLSPGNSARLTPLRYD 2232
Cdd:NF041261 443 TDAAGRRTEYSLNVvsgditditTPDGRETKFYY--NDGNQLTSVTSPDGLESRREYDEPGRL-VSETSRSGETTRYRYD 519
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2233 lrDRITRLGDVqyKMDEDGFLRQrggdiFEYNSAGLLIkAYNRAGGWSVRYRYDGLGRRVSSKSSHSHHLqffYADLTNP 2312
Cdd:NF041261 520 --DPHSELPAT--TTDATGSTKQ-----MTWSRYGQLL-AFTDCSGYQTRYEYDRFGQMTAVHREEGIST---YRRYDNR 586
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 2313 TKVTHLYNHSSSEiTSLYYDLQGHLFAMELSSGDEFYIACDNIGTPLAVFSGtGLMiKQILYTAYGEIYMDTNPNfqiii 2392
Cdd:NF041261 587 GQLTSVKDAQGRE-TRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLT-RSMEYDAAGRITTLTNEN----- 658
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1622865629 2393 GYHGG-LYDPLTKLVHMG-------RRDYDvLAGRWTSPDHE----LWQHLSSSNI 2436
Cdd:NF041261 659 GSHSTfLYDALDRLVQQRgfdgrtqRYHYD-LTGKLTQSEDEglvtLWHYDESDRI 713
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1337-1555 |
5.85e-06 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 50.40 E-value: 5.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1337 ATLTNPRGITVDKFGLIYFVD--GTMIRRID-QNGIISTllgsndltsarplscdsvmdISQVRLEWPTDLAINPmDNSL 1413
Cdd:COG4257 14 APGSGPRDVAVDPDGAVWFTDqgGGRIGRLDpATGEFTE--------------------YPLGGGSGPHGIAVDP-DGNL 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1414 YVLD--NNVVLQIS-ENHQVRIVAGrpmhcqvPGIDHFLlskvaihatlesaTALAVSHNGVLYIAETDekkINRIRQVT 1490
Cdd:COG4257 73 WFTDngNNRIGRIDpKTGEITTFAL-------PGGGSNP-------------HGIAFDPDGNLWFTDQG---GNRIGRLD 129
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1491 T-SGEISLV-----AGAPSGCDCKND---------ANC-DCFSGDDG----YAKDAKLNTPSSLAVCADGELYVADLGNI 1550
Cdd:COG4257 130 PaTGEVTEFplptgGAGPYGIAVDPDgnlwvtdfgANAiGRIDPDTGtlteYALPTPGAGPRGLAVDPDGNLWVADTGSG 209
|
....*
gi 1622865629 1551 RIRFI 1555
Cdd:COG4257 210 RIGRF 214
|
|
| PLN02919 |
PLN02919 |
haloacid dehalogenase-like hydrolase family protein |
1259-1559 |
1.42e-05 |
|
haloacid dehalogenase-like hydrolase family protein
Pssm-ID: 215497 [Multi-domain] Cd Length: 1057 Bit Score: 51.01 E-value: 1.42e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1259 RNKDFRHSHSPAhKY--YLTTDPMSGAVFLSDTNSRRVfkikstvVVKDLVKNSEV-VAGTGDQCL---PFDDtrcgdgg 1332
Cdd:PLN02919 556 KDNDPRLLTSPL-KFpgKLAIDLLNNRLFISDSNHNRI-------VVTDLDGNFIVqIGSTGEEGLrdgSFED------- 620
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1333 kateATLTNPRGITVDKFGLIYFVDGT---MIRRID-QNGIISTLLGS----NDLTSARPLScdsvmdiSQVrLEWPTDL 1404
Cdd:PLN02919 621 ----ATFNRPQGLAYNAKKNLLYVADTenhALREIDfVNETVRTLAGNgtkgSDYQGGKKGT-------SQV-LNSPWDV 688
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1405 AINPMDNSLYV------------LDNNVVLQISENHQVRIVAGR----PMHCQVPGI------DHFLLSK---------- 1452
Cdd:PLN02919 689 CFEPVNEKVYIamagqhqiweynISDGVTRVFSGDGYERNLNGSsgtsTSFAQPSGIslspdlKELYIADsesssirald 768
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1453 ----------------------------VAIHATLESATALAVSHNGVLYIAETDEKKINRIRQVTtsGEISLVAGAPSG 1504
Cdd:PLN02919 769 lktggsrllaggdptfsdnlfkfgdhdgVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPAT--KRVTTLAGTGKA 846
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*
gi 1622865629 1505 cdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNK 1559
Cdd:PLN02919 847 ------------GFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNK 889
|
|
| DUF5885 |
pfam19232 |
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown ... |
556-714 |
2.18e-05 |
|
Family of unknown function (DUF5885); This is a family of uncharacterized proteins of unknown function found in viruses.
Pssm-ID: 437064 Cd Length: 265 Bit Score: 48.85 E-value: 2.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 556 DNCPSNCYGNGDCISGTCH-----------------CFLGFLGPD---CGRASCpvlcsGNGQ----------YMKGRCL 605
Cdd:pfam19232 10 DDCTPPCGGTQVCIDRQCKdntlacttdaqcgtcmtCVAGACTPKascCGGVTC-----GAGQtcdaktntcvYVKGYCS 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 606 C-HSGWKGAECDVPTNQCIDVACSNHGT---CIMG-----------------TCIC-NPG-YK-GESC-----------E 650
Cdd:pfam19232 85 AdHPCPSGSACDTAKNACIAQPPYGPDSgkgCVRGfgawiweldpatnsgvwRCRCaNGSlYNsAHECspladqtlcaaE 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 651 EVD-----------------------CMDPTCSGRGVC--VRGECHCSVGWGGTNCETPRAtcldqCSGHGTFLPDTGLC 705
Cdd:pfam19232 165 NLDpnalvpassvpafaaygwgnqpvLINKSTAGAAVPspLAGVCPCKPGWAGGSCTEDRT-----CNGRGTWNETTGQC 239
|
....*....
gi 1622865629 706 SCDPSWTGH 714
Cdd:pfam19232 240 ACNIDFSGH 248
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1342-1575 |
2.76e-05 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 48.42 E-value: 2.76e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1342 PRGITVDKFGLIYFVD--GTMIRRIDQNGIISTLLGSNDltsarplscdsvmdISQVRLEWPTDLAINPMDNsLYVLDnn 1419
Cdd:cd14957 20 PRGIAVDSAGNIYVADtgNNRIQVFTSSGVYSYSIGSGG--------------TGSGQFNSPYGIAVDSNGN-IYVAD-- 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1420 vvlqiSENHQVRIvagrpmhcqvpgidhFLLSKVAIHA---------TLESATALAVSHNGVLYIAETDEkkiNRIrQVT 1490
Cdd:cd14957 83 -----TDNNRIQV---------------FNSSGVYQYSigtggsgdgQFNGPYGIAVDSNGNIYVADTGN---HRI-QVF 138
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1491 TSgeislvAGAPsgcdckndancdCFSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFIRKNKPFLNT-----Q 1565
Cdd:cd14957 139 TS------SGTF------------SYSIGSGGTGPGQFNGPQGIAVDSDGNIYVADTGNHRIQVFTSSGTFQYTfgssgS 200
|
250
....*....|
gi 1622865629 1566 NMYELSSPID 1575
Cdd:cd14957 201 GPGQFSDPYG 210
|
|
| NHL_like_3 |
cd14956 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1331-1552 |
3.86e-05 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271326 [Multi-domain] Cd Length: 274 Bit Score: 48.05 E-value: 3.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1331 GGKATEA-TLTNPRGITVDKFGLIYFVDGT--MIRRIDQNGIISTLLGSNdltSARPLSCDSvmdisqvrlewPTDLAIN 1407
Cdd:cd14956 50 GTTGDGPgQFGRPRGLAVDKDGWLYVADYWgdRIQVFTLTGELQTIGGSS---GSGPGQFNA-----------PRGVAVD 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1408 PmDNSLYVLD--NNVVLQISENHQ-VRIVAGRPmhcQVPGidHFLlskvaihatleSATALAVSHNGVLYIAETdekKIN 1484
Cdd:cd14956 116 A-DGNLYVADfgNQRIQKFDPDGSfLRQWGGTG---IEPG--SFN-----------YPRGVAVDPDGTLYVADT---YND 175
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1622865629 1485 RIRQVTTSGEISLVAGAPSGcdckndancdcFSGDdgyakdakLNTPSSLAVCADGELYVADLGNIRI 1552
Cdd:cd14956 176 RIQVFDNDGAFLRKWGGRGT-----------GPGQ--------FNYPYGIAIDPDGNVFVADFGNNRI 224
|
|
| NHL_like_2 |
cd14957 |
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and ... |
1460-1619 |
4.32e-05 |
|
Uncharacterized NHL-repeat domain in bacterial and archaeal proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271327 [Multi-domain] Cd Length: 280 Bit Score: 48.03 E-value: 4.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1460 ESATALAVSHNGVLYIAETDEkkiNRIRQVTTSGEISLVAGapsgcdckndancdcfSGDDGyakDAKLNTPSSLAVCAD 1539
Cdd:cd14957 65 NSPYGIAVDSNGNIYVADTDN---NRIQVFNSSGVYQYSIG----------------TGGSG---DGQFNGPYGIAVDSN 122
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1540 GELYVADLGNIRIRFIRKNKPFLNT-----QNMYELSSP----IDQE--LYLFDTTGK--HLYTqslPTGDYLYNFTYTG 1606
Cdd:cd14957 123 GNIYVADTGNHRIQVFTSSGTFSYSigsggTGPGQFNGPqgiaVDSDgnIYVADTGNHriQVFT---SSGTFQYTFGSSG 199
|
170
....*....|....*....
gi 1622865629 1607 DGDVTLIT------DNNGN 1619
Cdd:cd14957 200 SGPGQFSDpygiavDSDGN 218
|
|
| DSL |
pfam01414 |
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain ... |
738-781 |
4.99e-05 |
|
Delta serrate ligand; This family has been redefined to correspond to the EGF-like domain defined by structure.
Pssm-ID: 460202 Cd Length: 46 Bit Score: 42.61 E-value: 4.99e-05
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 1622865629 738 CEDGWMGAACDqRACHPRCAE--HGTC-RDGKCECSPGWNGEHCTIA 781
Cdd:pfam01414 1 CDENYYGSTCS-KFCRPRDDKfgHYTCdANGNKVCLPGWTGPYCDKP 46
|
|
| NHL_like_1 |
cd14953 |
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat ... |
1495-1555 |
7.03e-05 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; This bacterial family of NHL-repeat domains is found in a variety of domain architectures. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271323 [Multi-domain] Cd Length: 323 Bit Score: 47.52 E-value: 7.03e-05
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622865629 1495 ISLVAGAPSGcdckndancdcfSGDDGYAKDAKLNTPSSLAVCADGELYVADLGNIRIRFI 1555
Cdd:cd14953 1 VSTVAGSGTA------------GFSGGGGTAARFNSPSGVAVDAAGNLYVADRGNHRIRKI 49
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1224-1365 |
7.59e-05 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 47.32 E-value: 7.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1224 PVALTCGSDGSLYVGDF--NYIRRIFPSGNVTNILELRNKDFRhshsPahkYYLTTDPmSGAVFLSDTNSRRVFKIkstv 1301
Cdd:COG4257 147 PYGIAVDPDGNLWVTDFgaNAIGRIDPDTGTLTEYALPTPGAG----P---RGLAVDP-DGNLWVADTGSGRIGRF---- 214
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622865629 1302 vvkdlvknsevvagtgdqclpfdDTRCGDGGK-ATEATLTNPRGITVDKFGLIYFVDGT--MIRRID 1365
Cdd:COG4257 215 -----------------------DPKTGTVTEyPLPGGGARPYGVAVDGDGRVWFAESGanRIVRFD 258
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
1668-1708 |
2.88e-04 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 40.65 E-value: 2.88e-04
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1622865629 1668 HGNSGLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQVSSF 1708
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRY 41
|
|
| NHL-2_like |
cd14951 |
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ... |
1466-1556 |
3.26e-04 |
|
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271321 [Multi-domain] Cd Length: 334 Bit Score: 45.65 E-value: 3.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1466 AVSHNGVLYIAETdekKINRIRQV-TTSGEISLVAGapsgcdckndancdcfSGDDGYA-KDAKLNTPSSLAVCADGELY 1543
Cdd:cd14951 202 AALPDGSVYVADT---YNHKIKRVdPATGEVSTLAG----------------TGKAGYKdLEAQFSEPSGLVVDGDGRLY 262
|
90
....*....|...
gi 1622865629 1544 VADLGNIRIRFIR 1556
Cdd:cd14951 263 VADTNNHRIRRLD 275
|
|
| NHL_like_5 |
cd14963 |
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) ... |
1220-1478 |
5.40e-04 |
|
Uncharacterized NHL-repeat domain in bacterial proteins; The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271333 [Multi-domain] Cd Length: 268 Bit Score: 44.59 E-value: 5.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1220 KLLAPVALTCGSDGSLYVGDFnYIRRI--F-PSGNVTNILElRNKDFRHSHSPAHkyyLTTDpmSGAVFLSDTNsrrvfk 1296
Cdd:cd14963 54 EFKYPYGIAVDSDGNIYVADL-YNGRIqvFdPDGKFLKYFP-EKKDRVKLISPAG---LAID--DGKLYVSDVK------ 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1297 iKSTVVVKDLvknsevvagTGDQCLPFddtrcGDGGKAtEATLTNPRGITVDKFGLIYFVDgTMIRRI---DQNG-IIST 1372
Cdd:cd14963 121 -KHKVIVFDL---------EGKLLLEF-----GKPGSE-PGELSYPNGIAVDEDGNIYVAD-SGNGRIqvfDKNGkFIKE 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1373 LLGSNDLTSArplscdsvmdisqvrLEWPTDLAINPmDNSLYVLDN--NVVLQISENHQVRIVAGRpmhcqvPGIDhfll 1450
Cdd:cd14963 184 LNGSPDGKSG---------------FVNPRGIAVDP-DGNLYVVDNlsHRVYVFDEQGKELFTFGG------RGKD---- 237
|
250 260
....*....|....*....|....*...
gi 1622865629 1451 skvaiHATLESATALAVSHNGVLYIAET 1478
Cdd:cd14963 238 -----DGQFNLPNGLFIDDDGRLYVTDR 260
|
|
| EGF_CA |
cd00054 |
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ... |
627-650 |
5.88e-04 |
|
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Pssm-ID: 238011 Cd Length: 38 Bit Score: 39.54 E-value: 5.88e-04
10 20
....*....|....*....|....*...
gi 1622865629 627 CSNHGTCIMG----TCICNPGYKGESCE 650
Cdd:cd00054 11 CQNGGTCVNTvgsyRCSCPPGYTGRNCE 38
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1672-1704 |
5.96e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 39.50 E-value: 5.96e-04
10 20 30
....*....|....*....|....*....|...
gi 1622865629 1672 GLLATKSNENGWTTFYEYDSFGRLTNVTFPTGQ 1704
Cdd:pfam05593 5 GRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
626-649 |
9.75e-04 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 38.48 E-value: 9.75e-04
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
756-778 |
3.89e-03 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 36.94 E-value: 3.89e-03
|
| NHL-2_like |
cd14951 |
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL ... |
1332-1434 |
3.99e-03 |
|
NHL repeat domain of NHL repeat-containing protein 2 and similar proteins; NHL repeat-containing protein 2 (NHLRC2) and related bacterial proteins; members of this eukaryotic and bacterial family are uncharacterized, the NHL repeat domain is found C-terminally of a thioredoxin domain. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies, typically as 6 instances. It is about 40 residues long and resembles the WD repeat and other beta-propeller structures.
Pssm-ID: 271321 [Multi-domain] Cd Length: 334 Bit Score: 42.18 E-value: 3.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1332 GKATEATLTNPRGITVDKFGLIYFVDgTM---IRRID-QNGIISTLLGSNDLTSarplscdsvmDISQVRLEWPTDLAIN 1407
Cdd:cd14951 188 GPGAEALLQHPLGVAALPDGSVYVAD-TYnhkIKRVDpATGEVSTLAGTGKAGY----------KDLEAQFSEPSGLVVD 256
|
90 100
....*....|....*....|....*..
gi 1622865629 1408 PmDNSLYVLDNNvvlqiseNHQVRIVA 1434
Cdd:cd14951 257 G-DGRLYVADTN-------NHRIRRLD 275
|
|
| Vgb |
COG4257 |
Streptogramin lyase [Defense mechanisms]; |
1219-1297 |
4.16e-03 |
|
Streptogramin lyase [Defense mechanisms];
Pssm-ID: 443399 [Multi-domain] Cd Length: 270 Bit Score: 41.54 E-value: 4.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622865629 1219 NKLLAPVALTCGSDGSLYVGDF--NYIRRIFP-SGNVTNilelrnkdFRHSHSPAHKYYLTTDPmSGAVFLSDTNSRRVF 1295
Cdd:COG4257 185 TPGAGPRGLAVDPDGNLWVADTgsGRIGRFDPkTGTVTE--------YPLPGGGARPYGVAVDG-DGRVWFAESGANRIV 255
|
..
gi 1622865629 1296 KI 1297
Cdd:COG4257 256 RF 257
|
|
| EGF |
pfam00008 |
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ... |
627-647 |
6.23e-03 |
|
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
Pssm-ID: 394967 Cd Length: 31 Bit Score: 36.59 E-value: 6.23e-03
|
| EGF_2 |
pfam07974 |
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins. |
692-716 |
6.42e-03 |
|
EGF-like domain; This family contains EGF domains found in a variety of extracellular proteins.
Pssm-ID: 400365 Cd Length: 26 Bit Score: 36.17 E-value: 6.42e-03
|
|