NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|446425364|ref|WP_000503219|]
View 

RHS repeat-associated core domain-containing protein [Salmonella enterica]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RHS_core super family cl49306
RHS element core protein;
386-1445 4.53e-110

RHS element core protein;


The actual alignment was detected with superfamily member NF041261:

Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 380.12  E-value: 4.53e-110
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  386 GINMMVQKAGSALNRPVNAATGAKYLAGDDDVdfSLPGHFPLEWQRTYSSRDERTE---GMFGRGWSVLYEVCLERTpdn 462
Cdd:NF041261   33 GVACSVCPGGMTSGNPVNPLLGAKVLPGETDI--ALPGPLPFILSRTYSSYRTRTPapvGVFGPGWKAPSDIRLQLR--- 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  463 pdENCMTYVAPMGRRIDLQAVEPGSGFYSPGEGLAVRR----------------------------------SEQGHWLI 508
Cdd:NF041261  108 --DDGLILNDNGGRSIHFEPLFPGEAVYSRSESLWLVRggvaaqpdghtlaalwqalpedirlsphlylatnSAQGPWWI 185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  509 SSddGVYRLFEAD-----PFSPQRRrLKMLGDRNSNCQHLTYDNHGRLV-EISGDRQRPCIRLHYELAAHPQRVTRIFRH 582
Cdd:NF041261  186 LG--WSERVPGADevlpaPLPPYRV-LTGMVDRFGRTLTFHREAAGDLAgEITGVTDGAGREFRLVLTTQAQRAEEARKQ 262
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  583 H-----------------------------------------------PEGEPEL-LRRYRYDEAGRLNGVVDNAGQYQR 614
Cdd:NF041261  263 RtsslsspdgprplsssafpdtlpggteygpdngirlsavwlthdpayPESLPAApLVRYTYTEAGELLAVYDRSNTQVR 342
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  615 EFAYDDNDC--MTMHREPGGERYYYTWawfegpdDAAWRVTGHHTDSGEQYRLDWNlaERSLCVTDSLGRTRC-HWWDAQ 691
Cdd:NF041261  343 AFTYDAQHPgrMVAHRYAGRPEMCYRY-------DDTGRVTEQLNPAGLSYRYQYE--QDRITITDSLNRREVlHTEGEG 413
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  692 GLVTAYRDE-AGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRL-GHLTETHDPLGRvEQTQWHPVWHQPETEVDAAGAA 769
Cdd:NF041261  414 GLKRVVKKEhADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVsGDITDITTPDGR-ETKFYYNDGNQLTSVTSPDGLE 492
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  770 WRYEYDERGNLQAVSDPLHQRTVYGYDR-HGQV-VRITDARGGDKYLQWNEDGQLMRHTDCSGSQTAWFYDERTRLERVT 847
Cdd:NF041261  493 SRREYDEPGRLVSETSRSGETTRYRYDDpHSELpATTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVH 572
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  848 DAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDAtGRRTAYEYDAYGRL 927
Cdd:NF041261  573 REEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRI 651
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  928 TTLTNENGESYRFRYDVLDRLTEQTDPGGSRRVYGYNalnaVTAVIYGGERGGEIRHgLERDAAGRLTAK-ITPETRTKY 1006
Cdd:NF041261  652 TTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYD----LTGKLTQSEDEGLVTL-WHYDESDRITHRtVNGEPAEQW 726
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1007 RYDAADRLLEIRRRQHdaaegGEPEVIRFSYDSAGNLLSE-------ETAQGVLQHR----YDVQG--NRtetQMPDG-R 1072
Cdd:NF041261  727 QYDEHGWLTDISHLSE-----GHRVAVHYGYDDKGRLTGErqtvenpETGELLWQHEtghaYNEQGlaNR---VTPDSlP 798
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1073 TLRYLYYGSGHLQQINLGRDVISEFTRDHLHREVQRSQGRLDTRRMYDRTGRLTR--KLTCKGMRGVVpetfIDREYAYS 1150
Cdd:NF041261  799 PVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGGAGSNAAYELTTAYTPagQLQSQHLNSLV----YDRDYTWN 874
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1151 GQDELLKKRHSRQgVTDYFYDTTGRITACRNEAY-LD---SWQYDAAANLLDrrQGETAQAGAGSVVPFNRITSYRGLHY 1226
Cdd:NF041261  875 DNGDLVRISGPRQ-TREYGYSATGRLTGVHTTAAnLDiriPYATDPAGNRLP--DPELHPDSTLTAWPDNRIAEDAHYVY 951
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1227 RYDEYGRVVEKRGR----------NGTQHYRWDAEHRLteVAVTR---GSTVRRYGYVYDAPGRRVEK----HELDAEG- 1288
Cdd:NF041261  952 RYDEYGRLTEKTDRipegvirtddERTHHYHYDSQHRL--VFYTRiqhGEPLVESRYLYDPLGRRMAKrvwrRERDLTGw 1029
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1289 -----KPyNRTTFLWDGMRLAQ-ECRLGRSSSLYiysDQGSHEPLARVDRAA---------------------------- 1334
Cdd:NF041261 1030 mslsrKP-EVTWYGWDGDRLTTvQTDTTRIQTVY---QPGSFTPLIRVETENgerakaqrrslaetlqqegsenghgvvf 1105
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1335 PGE---------------------------------------------ADEVLYYHTDVNGAPEEMTDGGGNIVWEAGYQ 1369
Cdd:NF041261 1106 PAElvrmldrleeeiradrvseesrawlaqcgltveqmarqvepeytpARKLHLYHCDHRGLPLALISEEGNTAWQGEYD 1185
                        1210      1220      1230      1240      1250      1260      1270
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 446425364 1370 VWGNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISLKGGINLYAYAPNPLSWIDPLGL 1445
Cdd:NF041261 1186 EWGNLLNEENPHHLQQPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
CdiA-CT_Ec-like cd20692
C-terminal (CT) domain of the contact-dependent growth inhibition (CDI) system (CdiA-CT) ...
1444-1537 7.73e-39

C-terminal (CT) domain of the contact-dependent growth inhibition (CDI) system (CdiA-CT) protein CdiA of Escherichia coli A0 34/86, and similar proteins; CDI toxins are expressed by gram-negative bacteria as part of a mechanism to inhibit the growth of neighboring cells. CdiA secretion is dependent on the outer membrane protein CdiB. Upon binding to a receptor on the surface of target bacteria, the CDI toxin is delivered via the C-terminal domain. A wide variety of C-terminal toxin domains appear to exist; this particular model contains the C-terminal (CT) domain Escherichia coli A0 34/86 CdiA. Activity of this E. coli CdiA-CT is as yet unknown. CDI(+) bacteria also produce a CDI immunity protein (CdiI) to specifically neutralize the CdiA-CT toxins to prevent auto-inhibition. This CdiA-CT binds its cognate CdiI with high affinity.


:

Pssm-ID: 411005  Cd Length: 99  Bit Score: 140.01  E-value: 7.73e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1444 GLKCGSSYEQARNKALKWLEERGFKAERVNIGKFGSTRGKPVGMTTADGKTGFRIEYDERSGAHINVFSGKDKGE---HF 1520
Cdd:cd20692     3 ILPKFKSYEQARNKALELLGDLGFKDSKPYIGRLGTGYGKVIGRQSADGKKGWRLDYDPEKGAHINVWDGKGDKAkkpAI 82
                          90
                  ....*....|....*..
gi 446425364 1521 LFDASESIVTKLQKLFD 1537
Cdd:cd20692    83 PFEGTEKTVKKLLKRLN 99
PAAR_like super family cl21497
proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR ...
266-342 4.88e-10

proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat superfamily, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. The PAAR-repeat proteins form a diverse superfamily with several subgroups extended both N- and C-terminally by domains with various predicted functions; the termini are exposed to solution, and do not distort the VgrG binding site. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


The actual alignment was detected with superfamily member cd14742:

Pssm-ID: 451275  Cd Length: 86  Bit Score: 57.60  E-value: 4.88e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 446425364  266 DVLETGFQAASALIGSVSnlfkGDDEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVVDEpengvhvSGDVRIG 342
Cdd:cd14742    21 NVFINGKPAARAADSTVA----CSKHPPPPQLIAEGSETVFINGQPAARKGDKTTCSAVISEG-------SPNVFIG 86
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
386-1445 4.53e-110

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 380.12  E-value: 4.53e-110
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  386 GINMMVQKAGSALNRPVNAATGAKYLAGDDDVdfSLPGHFPLEWQRTYSSRDERTE---GMFGRGWSVLYEVCLERTpdn 462
Cdd:NF041261   33 GVACSVCPGGMTSGNPVNPLLGAKVLPGETDI--ALPGPLPFILSRTYSSYRTRTPapvGVFGPGWKAPSDIRLQLR--- 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  463 pdENCMTYVAPMGRRIDLQAVEPGSGFYSPGEGLAVRR----------------------------------SEQGHWLI 508
Cdd:NF041261  108 --DDGLILNDNGGRSIHFEPLFPGEAVYSRSESLWLVRggvaaqpdghtlaalwqalpedirlsphlylatnSAQGPWWI 185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  509 SSddGVYRLFEAD-----PFSPQRRrLKMLGDRNSNCQHLTYDNHGRLV-EISGDRQRPCIRLHYELAAHPQRVTRIFRH 582
Cdd:NF041261  186 LG--WSERVPGADevlpaPLPPYRV-LTGMVDRFGRTLTFHREAAGDLAgEITGVTDGAGREFRLVLTTQAQRAEEARKQ 262
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  583 H-----------------------------------------------PEGEPEL-LRRYRYDEAGRLNGVVDNAGQYQR 614
Cdd:NF041261  263 RtsslsspdgprplsssafpdtlpggteygpdngirlsavwlthdpayPESLPAApLVRYTYTEAGELLAVYDRSNTQVR 342
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  615 EFAYDDNDC--MTMHREPGGERYYYTWawfegpdDAAWRVTGHHTDSGEQYRLDWNlaERSLCVTDSLGRTRC-HWWDAQ 691
Cdd:NF041261  343 AFTYDAQHPgrMVAHRYAGRPEMCYRY-------DDTGRVTEQLNPAGLSYRYQYE--QDRITITDSLNRREVlHTEGEG 413
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  692 GLVTAYRDE-AGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRL-GHLTETHDPLGRvEQTQWHPVWHQPETEVDAAGAA 769
Cdd:NF041261  414 GLKRVVKKEhADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVsGDITDITTPDGR-ETKFYYNDGNQLTSVTSPDGLE 492
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  770 WRYEYDERGNLQAVSDPLHQRTVYGYDR-HGQV-VRITDARGGDKYLQWNEDGQLMRHTDCSGSQTAWFYDERTRLERVT 847
Cdd:NF041261  493 SRREYDEPGRLVSETSRSGETTRYRYDDpHSELpATTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVH 572
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  848 DAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDAtGRRTAYEYDAYGRL 927
Cdd:NF041261  573 REEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRI 651
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  928 TTLTNENGESYRFRYDVLDRLTEQTDPGGSRRVYGYNalnaVTAVIYGGERGGEIRHgLERDAAGRLTAK-ITPETRTKY 1006
Cdd:NF041261  652 TTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYD----LTGKLTQSEDEGLVTL-WHYDESDRITHRtVNGEPAEQW 726
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1007 RYDAADRLLEIRRRQHdaaegGEPEVIRFSYDSAGNLLSE-------ETAQGVLQHR----YDVQG--NRtetQMPDG-R 1072
Cdd:NF041261  727 QYDEHGWLTDISHLSE-----GHRVAVHYGYDDKGRLTGErqtvenpETGELLWQHEtghaYNEQGlaNR---VTPDSlP 798
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1073 TLRYLYYGSGHLQQINLGRDVISEFTRDHLHREVQRSQGRLDTRRMYDRTGRLTR--KLTCKGMRGVVpetfIDREYAYS 1150
Cdd:NF041261  799 PVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGGAGSNAAYELTTAYTPagQLQSQHLNSLV----YDRDYTWN 874
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1151 GQDELLKKRHSRQgVTDYFYDTTGRITACRNEAY-LD---SWQYDAAANLLDrrQGETAQAGAGSVVPFNRITSYRGLHY 1226
Cdd:NF041261  875 DNGDLVRISGPRQ-TREYGYSATGRLTGVHTTAAnLDiriPYATDPAGNRLP--DPELHPDSTLTAWPDNRIAEDAHYVY 951
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1227 RYDEYGRVVEKRGR----------NGTQHYRWDAEHRLteVAVTR---GSTVRRYGYVYDAPGRRVEK----HELDAEG- 1288
Cdd:NF041261  952 RYDEYGRLTEKTDRipegvirtddERTHHYHYDSQHRL--VFYTRiqhGEPLVESRYLYDPLGRRMAKrvwrRERDLTGw 1029
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1289 -----KPyNRTTFLWDGMRLAQ-ECRLGRSSSLYiysDQGSHEPLARVDRAA---------------------------- 1334
Cdd:NF041261 1030 mslsrKP-EVTWYGWDGDRLTTvQTDTTRIQTVY---QPGSFTPLIRVETENgerakaqrrslaetlqqegsenghgvvf 1105
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1335 PGE---------------------------------------------ADEVLYYHTDVNGAPEEMTDGGGNIVWEAGYQ 1369
Cdd:NF041261 1106 PAElvrmldrleeeiradrvseesrawlaqcgltveqmarqvepeytpARKLHLYHCDHRGLPLALISEEGNTAWQGEYD 1185
                        1210      1220      1230      1240      1250      1260      1270
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 446425364 1370 VWGNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISLKGGINLYAYAPNPLSWIDPLGL 1445
Cdd:NF041261 1186 EWGNLLNEENPHHLQQPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
594-1517 1.52e-47

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 186.50  E-value: 1.52e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  594 YRYDEAGRLNGVVDNAGQYQREFAYDDNDCMTMHREPGGERYYYTWAWFEGPDDAAWRVTGHHTDSGEQYRLDWNLAERS 673
Cdd:COG3209   319 GTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGS 398
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  674 LCVTDSLGRTRCHWWDAQGLVTAYRDEAGQMTTFRWSDEERLLLGMTDAQ--GGKWRYVYDRLGHLTETHDPLGRVEQTQ 751
Cdd:COG3209   399 STTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDAttTTGGAGASGTLTTTGGAATGATTGGGTE 478
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  752 WHPVWHQPETEVDAAGAAWRYEYDERGNLQAVSDPLHQRTVYGYDRHGQVVRITDARGGDKYLQWNEDGQLMRHTDCSGS 831
Cdd:COG3209   479 AGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGT 558
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  832 QTAWFYDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQTD 911
Cdd:COG3209   559 STGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTG 638
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  912 ATGRRTAYEYDAYGRLTTLTNENGESYRFRYDVLDRLTEQTDPGGSRRVYGYNALNAVTAVIYGGERGGEIRHGleRDAA 991
Cdd:COG3209   639 STTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLA--GGTT 716
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  992 GRLTAKITPETRTKYRYDAADRLLEIRRRQHDAAEGGEPEVIRFSYDSAGNLLSEETAQGV------LQHRYDVQGNRTE 1065
Cdd:COG3209   717 TRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGVtqgtytTRYTYDALGRLTS 796
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1066 TQMPDGRTLRYLYYGSGHLQQInlgrdviseftrdhLHREVQRSQGRLDTRRMYDRTGRLTRKltckgmrgvvpetfidR 1145
Cdd:COG3209   797 VTYPDGETVTYTYDALGRLTSV--------------ITVGSGGGTDLQDRTYTYDAAGNITSI----------------T 846
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1146 EYAYSGQDellkkrhsrqgVTDYFYDTTGRITACRNEAYLDSWQYDAAANLLdrrqgetaqagagsvvpfnritsyrglh 1225
Cdd:COG3209   847 DALRAGTL-----------TQTYTYDALGRLTSATDPGTTESYTYDANGNLT---------------------------- 887
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1226 yrydeygrvveKRGRNGTQHYRWDAEHRLTEVAVTRGSTVRrygYVYDAPGrrvekheldaegkpynrttflwdgmrlaq 1305
Cdd:COG3209   888 -----------SRTDGGTTTYTYDALGRLVSVTKPDGTTTT---YTYDALG----------------------------- 924
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1306 ecrlgrssslyiysdqgsheplarvdraapgeadevlyyHTDVNGAPEEMTDGGGNIVWEAGYQVWGNLTHEKETrPVQQ 1385
Cdd:COG3209   925 ---------------------------------------HTDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSG-AAAN 964
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1386 NLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISLKGGINLYAYA-PNPLSWIDPLGLKCGSSYEQARNKALKWLEE 1464
Cdd:COG3209   965 PLRFTGQEYDAETGLYYNGARYYDPALGRFLSPDPIGLAGGLNLYAYVgNNPVNYVDPLGLAALLGTTGLGGGAGVGAGA 1044
                         890       900       910       920       930
                  ....*....|....*....|....*....|....*....|....*....|...
gi 446425364 1465 RGFKAERVNIGKFGSTRGKPVGMTTADGKTGFRIEYDERSGAHINVFSGKDKG 1517
Cdd:COG3209  1045 AGGGAAAAGGSAGAGAAGGGAGGAGAGGAGGGAGAGAGAAAGAAGGAGGGAGA 1097
CdiA-CT_Ec-like cd20692
C-terminal (CT) domain of the contact-dependent growth inhibition (CDI) system (CdiA-CT) ...
1444-1537 7.73e-39

C-terminal (CT) domain of the contact-dependent growth inhibition (CDI) system (CdiA-CT) protein CdiA of Escherichia coli A0 34/86, and similar proteins; CDI toxins are expressed by gram-negative bacteria as part of a mechanism to inhibit the growth of neighboring cells. CdiA secretion is dependent on the outer membrane protein CdiB. Upon binding to a receptor on the surface of target bacteria, the CDI toxin is delivered via the C-terminal domain. A wide variety of C-terminal toxin domains appear to exist; this particular model contains the C-terminal (CT) domain Escherichia coli A0 34/86 CdiA. Activity of this E. coli CdiA-CT is as yet unknown. CDI(+) bacteria also produce a CDI immunity protein (CdiI) to specifically neutralize the CdiA-CT toxins to prevent auto-inhibition. This CdiA-CT binds its cognate CdiI with high affinity.


Pssm-ID: 411005  Cd Length: 99  Bit Score: 140.01  E-value: 7.73e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1444 GLKCGSSYEQARNKALKWLEERGFKAERVNIGKFGSTRGKPVGMTTADGKTGFRIEYDERSGAHINVFSGKDKGE---HF 1520
Cdd:cd20692     3 ILPKFKSYEQARNKALELLGDLGFKDSKPYIGRLGTGYGKVIGRQSADGKKGWRLDYDPEKGAHINVWDGKGDKAkkpAI 82
                          90
                  ....*....|....*..
gi 446425364 1521 LFDASESIVTKLQKLFD 1537
Cdd:cd20692    83 PFEGTEKTVKKLLKRLN 99
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1368-1445 3.89e-32

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 119.91  E-value: 3.89e-32
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 446425364  1368 YQVWGNLTHEKEtrPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISLKGGINLYAYAP-NPLSWIDPLGL 1445
Cdd:TIGR03696    1 YDPYGEVLSESG--AAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGnNPVNWVDPLGL 77
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
401-478 1.44e-20

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 87.20  E-value: 1.44e-20
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 446425364   401 PVNAATGAKYLagdDDVDFSLPGHFPLEWQRTYSSRDERTeGMFGRGWSVLYEVCLERTpdnpDENCMTYVAPMGRRI 478
Cdd:pfam20148    3 PVNVATGNKVL---EETDFSLPGPLPLVWTRTYNSSSERD-GPLGPGWSHPYDQRLELE----GDGGVVYIDADGREV 72
PAAR_RHS cd14742
proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement ...
266-342 4.88e-10

proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement hotspot (Rhs) extensions; This PAAR (proline-alanine-alanine-arginine) repeat subfamily, which forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS), contains C- and N-terminal domain extensions. These include Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences at the C-terminal, and various predicted functions at N- and C-terminal extensions. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269827  Cd Length: 86  Bit Score: 57.60  E-value: 4.88e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 446425364  266 DVLETGFQAASALIGSVSnlfkGDDEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVVDEpengvhvSGDVRIG 342
Cdd:cd14742    21 NVFINGKPAARAADSTVA----CSKHPPPPQLIAEGSETVFINGQPAARKGDKTTCSAVISEG-------SPNVFIG 86
Ntox47 pfam15540
Bacterial toxin 47; A predicted RNase toxin found in bacterial polymorphic toxin systems that ...
1430-1510 1.25e-04

Bacterial toxin 47; A predicted RNase toxin found in bacterial polymorphic toxin systems that is proposed to adopt the BECR (Barnase-EndoU-ColicinE5/D-RelE) fold, and contains two conserved aspartates, a glutamate, a histidine and an arginine residue and an RT motif. In bacterial polymorphic toxin systems, the toxin is usually exported by the type 2, type 6 or type 7 secretion system.


Pssm-ID: 406082  Cd Length: 111  Bit Score: 43.04  E-value: 1.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  1430 YAYAPNPLSWIDPLGLkcGSSYEQARNKALKwleERGFKAERVNIGKFGStrgkpvgmtTADGKTgFRIEYDERSGAHIN 1509
Cdd:pfam15540    1 YQYKFNPIRDIDPRGL--GIEYQSALDEAFR---RTGVPKEDFTVTKWGK---------DVDGKS-TPVEFKGPNGAKVN 65

                   .
gi 446425364  1510 V 1510
Cdd:pfam15540   66 Y 66
PAAR_motif pfam05488
PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. ...
274-326 6.05e-04

PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. It is also found as a triplet of tandem repeats comprising the entire length in a another family of hypothetical proteins.


Pssm-ID: 428491  Cd Length: 71  Bit Score: 39.86  E-value: 6.05e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 446425364   274 AASALIGSVSNLFKGD----DEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVV 326
Cdd:pfam05488   15 SPTVLIGGKPAARVGDlvvcPPCGGGGPIAEGSPTVLINGKPAAREGDKTACGATLI 71
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
298-343 2.69e-03

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 38.26  E-value: 2.69e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 446425364  298 IAEGTRDVRINSQPAARSGVRCTCEAKVVDepenGvhvSGDVRIGG 343
Cdd:COG4104    49 IAEGSPTVLINGKPAARVGDKTACGGTIIS----G---SPTVLIGG 87
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
386-1445 4.53e-110

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 380.12  E-value: 4.53e-110
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  386 GINMMVQKAGSALNRPVNAATGAKYLAGDDDVdfSLPGHFPLEWQRTYSSRDERTE---GMFGRGWSVLYEVCLERTpdn 462
Cdd:NF041261   33 GVACSVCPGGMTSGNPVNPLLGAKVLPGETDI--ALPGPLPFILSRTYSSYRTRTPapvGVFGPGWKAPSDIRLQLR--- 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  463 pdENCMTYVAPMGRRIDLQAVEPGSGFYSPGEGLAVRR----------------------------------SEQGHWLI 508
Cdd:NF041261  108 --DDGLILNDNGGRSIHFEPLFPGEAVYSRSESLWLVRggvaaqpdghtlaalwqalpedirlsphlylatnSAQGPWWI 185
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  509 SSddGVYRLFEAD-----PFSPQRRrLKMLGDRNSNCQHLTYDNHGRLV-EISGDRQRPCIRLHYELAAHPQRVTRIFRH 582
Cdd:NF041261  186 LG--WSERVPGADevlpaPLPPYRV-LTGMVDRFGRTLTFHREAAGDLAgEITGVTDGAGREFRLVLTTQAQRAEEARKQ 262
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  583 H-----------------------------------------------PEGEPEL-LRRYRYDEAGRLNGVVDNAGQYQR 614
Cdd:NF041261  263 RtsslsspdgprplsssafpdtlpggteygpdngirlsavwlthdpayPESLPAApLVRYTYTEAGELLAVYDRSNTQVR 342
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  615 EFAYDDNDC--MTMHREPGGERYYYTWawfegpdDAAWRVTGHHTDSGEQYRLDWNlaERSLCVTDSLGRTRC-HWWDAQ 691
Cdd:NF041261  343 AFTYDAQHPgrMVAHRYAGRPEMCYRY-------DDTGRVTEQLNPAGLSYRYQYE--QDRITITDSLNRREVlHTEGEG 413
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  692 GLVTAYRDE-AGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRL-GHLTETHDPLGRvEQTQWHPVWHQPETEVDAAGAA 769
Cdd:NF041261  414 GLKRVVKKEhADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVsGDITDITTPDGR-ETKFYYNDGNQLTSVTSPDGLE 492
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  770 WRYEYDERGNLQAVSDPLHQRTVYGYDR-HGQV-VRITDARGGDKYLQWNEDGQLMRHTDCSGSQTAWFYDERTRLERVT 847
Cdd:NF041261  493 SRREYDEPGRLVSETSRSGETTRYRYDDpHSELpATTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVH 572
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  848 DAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDAtGRRTAYEYDAYGRL 927
Cdd:NF041261  573 REEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRI 651
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  928 TTLTNENGESYRFRYDVLDRLTEQTDPGGSRRVYGYNalnaVTAVIYGGERGGEIRHgLERDAAGRLTAK-ITPETRTKY 1006
Cdd:NF041261  652 TTLTNENGSHSTFLYDALDRLVQQRGFDGRTQRYHYD----LTGKLTQSEDEGLVTL-WHYDESDRITHRtVNGEPAEQW 726
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1007 RYDAADRLLEIRRRQHdaaegGEPEVIRFSYDSAGNLLSE-------ETAQGVLQHR----YDVQG--NRtetQMPDG-R 1072
Cdd:NF041261  727 QYDEHGWLTDISHLSE-----GHRVAVHYGYDDKGRLTGErqtvenpETGELLWQHEtghaYNEQGlaNR---VTPDSlP 798
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1073 TLRYLYYGSGHLQQINLGRDVISEFTRDHLHREVQRSQGRLDTRRMYDRTGRLTR--KLTCKGMRGVVpetfIDREYAYS 1150
Cdd:NF041261  799 PVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGGAGSNAAYELTTAYTPagQLQSQHLNSLV----YDRDYTWN 874
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1151 GQDELLKKRHSRQgVTDYFYDTTGRITACRNEAY-LD---SWQYDAAANLLDrrQGETAQAGAGSVVPFNRITSYRGLHY 1226
Cdd:NF041261  875 DNGDLVRISGPRQ-TREYGYSATGRLTGVHTTAAnLDiriPYATDPAGNRLP--DPELHPDSTLTAWPDNRIAEDAHYVY 951
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1227 RYDEYGRVVEKRGR----------NGTQHYRWDAEHRLteVAVTR---GSTVRRYGYVYDAPGRRVEK----HELDAEG- 1288
Cdd:NF041261  952 RYDEYGRLTEKTDRipegvirtddERTHHYHYDSQHRL--VFYTRiqhGEPLVESRYLYDPLGRRMAKrvwrRERDLTGw 1029
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1289 -----KPyNRTTFLWDGMRLAQ-ECRLGRSSSLYiysDQGSHEPLARVDRAA---------------------------- 1334
Cdd:NF041261 1030 mslsrKP-EVTWYGWDGDRLTTvQTDTTRIQTVY---QPGSFTPLIRVETENgerakaqrrslaetlqqegsenghgvvf 1105
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1335 PGE---------------------------------------------ADEVLYYHTDVNGAPEEMTDGGGNIVWEAGYQ 1369
Cdd:NF041261 1106 PAElvrmldrleeeiradrvseesrawlaqcgltveqmarqvepeytpARKLHLYHCDHRGLPLALISEEGNTAWQGEYD 1185
                        1210      1220      1230      1240      1250      1260      1270
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 446425364 1370 VWGNLTHEKETRPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISLKGGINLYAYAPNPLSWIDPLGL 1445
Cdd:NF041261 1186 EWGNLLNEENPHHLQQPYRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
594-1517 1.52e-47

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 186.50  E-value: 1.52e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  594 YRYDEAGRLNGVVDNAGQYQREFAYDDNDCMTMHREPGGERYYYTWAWFEGPDDAAWRVTGHHTDSGEQYRLDWNLAERS 673
Cdd:COG3209   319 GTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGS 398
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  674 LCVTDSLGRTRCHWWDAQGLVTAYRDEAGQMTTFRWSDEERLLLGMTDAQ--GGKWRYVYDRLGHLTETHDPLGRVEQTQ 751
Cdd:COG3209   399 STTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDAttTTGGAGASGTLTTTGGAATGATTGGGTE 478
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  752 WHPVWHQPETEVDAAGAAWRYEYDERGNLQAVSDPLHQRTVYGYDRHGQVVRITDARGGDKYLQWNEDGQLMRHTDCSGS 831
Cdd:COG3209   479 AGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGT 558
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  832 QTAWFYDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQRDGQGRVRRQTD 911
Cdd:COG3209   559 STGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTG 638
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  912 ATGRRTAYEYDAYGRLTTLTNENGESYRFRYDVLDRLTEQTDPGGSRRVYGYNALNAVTAVIYGGERGGEIRHGleRDAA 991
Cdd:COG3209   639 STTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLA--GGTT 716
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  992 GRLTAKITPETRTKYRYDAADRLLEIRRRQHDAAEGGEPEVIRFSYDSAGNLLSEETAQGV------LQHRYDVQGNRTE 1065
Cdd:COG3209   717 TRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGVtqgtytTRYTYDALGRLTS 796
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1066 TQMPDGRTLRYLYYGSGHLQQInlgrdviseftrdhLHREVQRSQGRLDTRRMYDRTGRLTRKltckgmrgvvpetfidR 1145
Cdd:COG3209   797 VTYPDGETVTYTYDALGRLTSV--------------ITVGSGGGTDLQDRTYTYDAAGNITSI----------------T 846
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1146 EYAYSGQDellkkrhsrqgVTDYFYDTTGRITACRNEAYLDSWQYDAAANLLdrrqgetaqagagsvvpfnritsyrglh 1225
Cdd:COG3209   847 DALRAGTL-----------TQTYTYDALGRLTSATDPGTTESYTYDANGNLT---------------------------- 887
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1226 yrydeygrvveKRGRNGTQHYRWDAEHRLTEVAVTRGSTVRrygYVYDAPGrrvekheldaegkpynrttflwdgmrlaq 1305
Cdd:COG3209   888 -----------SRTDGGTTTYTYDALGRLVSVTKPDGTTTT---YTYDALG----------------------------- 924
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1306 ecrlgrssslyiysdqgsheplarvdraapgeadevlyyHTDVNGAPEEMTDGGGNIVWEAGYQVWGNLTHEKETrPVQQ 1385
Cdd:COG3209   925 ---------------------------------------HTDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSG-AAAN 964
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1386 NLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISLKGGINLYAYA-PNPLSWIDPLGLKCGSSYEQARNKALKWLEE 1464
Cdd:COG3209   965 PLRFTGQEYDAETGLYYNGARYYDPALGRFLSPDPIGLAGGLNLYAYVgNNPVNYVDPLGLAALLGTTGLGGGAGVGAGA 1044
                         890       900       910       920       930
                  ....*....|....*....|....*....|....*....|....*....|...
gi 446425364 1465 RGFKAERVNIGKFGSTRGKPVGMTTADGKTGFRIEYDERSGAHINVFSGKDKG 1517
Cdd:COG3209  1045 AGGGAAAAGGSAGAGAAGGGAGGAGAGGAGGGAGAGAGAAAGAAGGAGGGAGA 1097
CdiA-CT_Ec-like cd20692
C-terminal (CT) domain of the contact-dependent growth inhibition (CDI) system (CdiA-CT) ...
1444-1537 7.73e-39

C-terminal (CT) domain of the contact-dependent growth inhibition (CDI) system (CdiA-CT) protein CdiA of Escherichia coli A0 34/86, and similar proteins; CDI toxins are expressed by gram-negative bacteria as part of a mechanism to inhibit the growth of neighboring cells. CdiA secretion is dependent on the outer membrane protein CdiB. Upon binding to a receptor on the surface of target bacteria, the CDI toxin is delivered via the C-terminal domain. A wide variety of C-terminal toxin domains appear to exist; this particular model contains the C-terminal (CT) domain Escherichia coli A0 34/86 CdiA. Activity of this E. coli CdiA-CT is as yet unknown. CDI(+) bacteria also produce a CDI immunity protein (CdiI) to specifically neutralize the CdiA-CT toxins to prevent auto-inhibition. This CdiA-CT binds its cognate CdiI with high affinity.


Pssm-ID: 411005  Cd Length: 99  Bit Score: 140.01  E-value: 7.73e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364 1444 GLKCGSSYEQARNKALKWLEERGFKAERVNIGKFGSTRGKPVGMTTADGKTGFRIEYDERSGAHINVFSGKDKGE---HF 1520
Cdd:cd20692     3 ILPKFKSYEQARNKALELLGDLGFKDSKPYIGRLGTGYGKVIGRQSADGKKGWRLDYDPEKGAHINVWDGKGDKAkkpAI 82
                          90
                  ....*....|....*..
gi 446425364 1521 LFDASESIVTKLQKLFD 1537
Cdd:cd20692    83 PFEGTEKTVKKLLKRLN 99
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1368-1445 3.89e-32

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 119.91  E-value: 3.89e-32
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 446425364  1368 YQVWGNLTHEKEtrPVQQNLRFQGQYLDRETGLHYNLYRFYDPDIGKFISGDPISLKGGINLYAYAP-NPLSWIDPLGL 1445
Cdd:TIGR03696    1 YDPYGEVLSESG--AAPNPLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGnNPVNWVDPLGL 77
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
401-478 1.44e-20

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 87.20  E-value: 1.44e-20
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 446425364   401 PVNAATGAKYLagdDDVDFSLPGHFPLEWQRTYSSRDERTeGMFGRGWSVLYEVCLERTpdnpDENCMTYVAPMGRRI 478
Cdd:pfam20148    3 PVNVATGNKVL---EETDFSLPGPLPLVWTRTYNSSSERD-GPLGPGWSHPYDQRLELE----GDGGVVYIDADGREV 72
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
593-1008 2.83e-20

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 98.29  E-value: 2.83e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  593 RYRYDEAGRLNGVVDNAGQYQREFAYDDNDCMTMHREPGGERYYYTWAWFEGPDDAAWRVTGHHTDSGEQYRLDWNLAER 672
Cdd:COG3209   554 VGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERA 633
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  673 SLCVTDSLGRTRCHWWDAQGLVTAYRDEAGQMTTFRWSDEERLLLGMTDAQGGKWRYVYDRLGHLTETHDPLGRVEQTQW 752
Cdd:COG3209   634 TASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAG 713
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  753 HPVWHQPETEVDAAGAAWRYEYDERGNLQAVSDPLHQRT------VYGYDRHGQVVRITDARGgdkylqwnedgqlmrhT 826
Cdd:COG3209   714 GTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTttagalTYTYDALGRLTSETTPGG----------------V 777
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  827 DCSGSQTAWFYDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGRTE------RYQPDAAGRLVKYTSPAGQITRWQR 900
Cdd:COG3209   778 TQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGtdlqdrTYTYDAAGNITSITDALRAGTLTQT 857
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  901 ---DGQGRVRRQTDATGRRTaYEYDAYGRLTTLTNENGESYrfRYDVLDRLTEQTDPGGSRRVYGYNALN------AVTA 971
Cdd:COG3209   858 ytyDALGRLTSATDPGTTES-YTYDANGNLTSRTDGGTTTY--TYDALGRLVSVTKPDGTTTTYTYDALGhtdhlgSVRA 934
                         410       420       430
                  ....*....|....*....|....*....|....*..
gi 446425364  972 VIyggERGGEIRHGLERDAAGRLTAKITPETRTKYRY 1008
Cdd:COG3209   935 LT---DASGQVVWRYDYDPFGNLLAETSGAAANPLRF 968
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
564-942 1.10e-19

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 96.36  E-value: 1.10e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  564 RLHYELAAHPQRVTRIFRHHPEGEPELLRRYRYDEAGRLNGVVDNAGQYQREFAYDDNDCMTMHREPGGERYYYTWAWFE 643
Cdd:COG3209   595 TTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLT 674
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  644 GPDDAAWRVTGHHTDSGEQYRLDWNLAERSLCVTDSLGRTRCHWWDAQGLVTAYRDEAGQMTTFRWSDEERLLLGMTDAQ 723
Cdd:COG3209   675 TLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTT 754
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  724 GGKWRYVYDRLGHLTETHDPLGRVEQTQWHpvwhqpetevdaagaawRYEYDERGNLQAVSDPLHQRTVYGYDRHGQVVR 803
Cdd:COG3209   755 AGALTYTYDALGRLTSETTPGGVTQGTYTT-----------------RYTYDALGRLTSVTYPDGETVTYTYDALGRLTS 817
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  804 ITDARGGDKY------LQWNEDGQLMRHTD---CSGSQTAWFYDERTRLERVTDAeSNSTRYSYDGNGHLTEVMFADGRT 874
Cdd:COG3209   818 VITVGSGGGTdlqdrtYTYDAAGNITSITDalrAGTLTQTYTYDALGRLTSATDP-GTTESYTYDANGNLTSRTDGGTTT 896
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 446425364  875 ERYqpDAAGRLVKYTSPAGQITRWQRDG------QGRVRRQTDATGRRTA-YEYDAYGRLTTLTNENGEsYRFRY 942
Cdd:COG3209   897 YTY--DALGRLVSVTKPDGTTTTYTYDAlghtdhLGSVRALTDASGQVVWrYDYDPFGNLLAETSGAAA-NPLRF 968
RHS pfam03527
RHS protein;
1343-1377 1.77e-11

RHS protein;


Pssm-ID: 427349 [Multi-domain]  Cd Length: 38  Bit Score: 60.01  E-value: 1.77e-11
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 446425364  1343 YYHTDVNGAPEEMTDGGGNIVWEAGYQVWGNLTHE 1377
Cdd:pfam03527    3 YYHTDHLGTPEELTDEAGEIVWSAEYDAWGNVTEE 37
PAAR_RHS cd14742
proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement ...
266-342 4.88e-10

proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement hotspot (Rhs) extensions; This PAAR (proline-alanine-alanine-arginine) repeat subfamily, which forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS), contains C- and N-terminal domain extensions. These include Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences at the C-terminal, and various predicted functions at N- and C-terminal extensions. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269827  Cd Length: 86  Bit Score: 57.60  E-value: 4.88e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 446425364  266 DVLETGFQAASALIGSVSnlfkGDDEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVVDEpengvhvSGDVRIG 342
Cdd:cd14742    21 NVFINGKPAARAADSTVA----CSKHPPPPQLIAEGSETVFINGQPAARKGDKTTCSAVISEG-------SPNVFIG 86
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
921-957 2.72e-08

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 51.06  E-value: 2.72e-08
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425364   921 YDAYGRLTTLTNENGESYRFRYDVLDRLTEQTDPGGS 957
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
901-936 6.07e-08

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 49.90  E-value: 6.07e-08
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 446425364   901 DGQGRVRRQTDATGRRTAYEYDAYGRLTTLTNENGE 936
Cdd:pfam05593    2 DAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
880-920 3.30e-07

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 47.97  E-value: 3.30e-07
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 446425364   880 DAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDATGRRTAYE 920
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
858-894 5.29e-07

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 47.21  E-value: 5.29e-07
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425364   858 YDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQ 894
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
921-959 4.48e-06

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 44.89  E-value: 4.48e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 446425364   921 YDAYGRLTTLTNENGESYRFRYDVLDRLTEQTDPGGSRR 959
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGST 39
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
774-809 4.53e-06

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 44.90  E-value: 4.53e-06
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 446425364   774 YDERGNLQAVSDPLHQRTVYGYDRHGQVVRITDARG 809
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
880-915 2.57e-05

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 42.59  E-value: 2.57e-05
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 446425364   880 DAAGRLVKYTSPAGQITRWQRDGQGRVRRQTDATGR 915
Cdd:pfam05593    2 DAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
837-878 7.43e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 41.42  E-value: 7.43e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 446425364   837 YDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGRTERYQ 878
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
511-785 8.07e-05

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 47.44  E-value: 8.07e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  511 DDGVYRLFEADPFSPQRRRLKMLGDRNSNCQHLTYDNHGRLVEISGDRQRPC----IRLHYELAAHPQRVTrifrhHPEG 586
Cdd:COG3209   728 GGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGVTQgtytTRYTYDALGRLTSVT-----YPDG 802
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  587 EpelLRRYRYDEAGRLNGVVDNAGQ-----YQREFAYDDND---CMTMHREPGGERYYYTWawfegpdDAAWRVTGHHTD 658
Cdd:COG3209   803 E---TVTYTYDALGRLTSVITVGSGggtdlQDRTYTYDAAGnitSITDALRAGTLTQTYTY-------DALGRLTSATDP 872
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  659 SG-EQYRldwnlaerslcvtdslgrtrchwWDAQGLVTayRDEAGQMTTFRWsDEERLLLGMTDAQGGKWRYVYDRLGHl 737
Cdd:COG3209   873 GTtESYT-----------------------YDANGNLT--SRTDGGTTTYTY-DALGRLVSVTKPDGTTTTYTYDALGH- 925
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*....
gi 446425364  738 tetHDPLGrveqtqwhpvwhQPETEVDAAGA-AWRYEYDERGNLQAVSD 785
Cdd:COG3209   926 ---TDHLG------------SVRALTDASGQvVWRYDYDPFGNLLAETS 959
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
837-873 1.17e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 40.66  E-value: 1.17e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425364   837 YDERTRLERVTDAESNSTRYSYDGNGHLTEVMFADGR 873
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
Ntox47 pfam15540
Bacterial toxin 47; A predicted RNase toxin found in bacterial polymorphic toxin systems that ...
1430-1510 1.25e-04

Bacterial toxin 47; A predicted RNase toxin found in bacterial polymorphic toxin systems that is proposed to adopt the BECR (Barnase-EndoU-ColicinE5/D-RelE) fold, and contains two conserved aspartates, a glutamate, a histidine and an arginine residue and an RT motif. In bacterial polymorphic toxin systems, the toxin is usually exported by the type 2, type 6 or type 7 secretion system.


Pssm-ID: 406082  Cd Length: 111  Bit Score: 43.04  E-value: 1.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446425364  1430 YAYAPNPLSWIDPLGLkcGSSYEQARNKALKwleERGFKAERVNIGKFGStrgkpvgmtTADGKTgFRIEYDERSGAHIN 1509
Cdd:pfam15540    1 YQYKFNPIRDIDPRGL--GIEYQSALDEAFR---RTGVPKEDFTVTKWGK---------DVDGKS-TPVEFKGPNGAKVN 65

                   .
gi 446425364  1510 V 1510
Cdd:pfam15540   66 Y 66
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
816-850 4.58e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.12  E-value: 4.58e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 446425364   816 WNEDGQLMRHTDCSGSQTAWFYDERTRLERVTDAE 850
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPD 35
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
942-976 5.47e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 38.73  E-value: 5.47e-04
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 446425364   942 YDVLDRLTEQTDPGGSRRVYGYNALNAVTAVIYGG 976
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPD 35
PAAR_motif pfam05488
PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. ...
274-326 6.05e-04

PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. It is also found as a triplet of tandem repeats comprising the entire length in a another family of hypothetical proteins.


Pssm-ID: 428491  Cd Length: 71  Bit Score: 39.86  E-value: 6.05e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 446425364   274 AASALIGSVSNLFKGD----DEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVV 326
Cdd:pfam05488   15 SPTVLIGGKPAARVGDlvvcPPCGGGGPIAEGSPTVLINGKPAAREGDKTACGATLI 71
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
647-684 7.21e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 38.73  E-value: 7.21e-04
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 446425364   647 DAAWRVTGHHTDSGEQYRLDWNLAERSLCVTDSLGRTR 684
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGST 39
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
858-899 7.50e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 38.73  E-value: 7.50e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 446425364   858 YDGNGHLTEVMFADGRTERYQPDAAGRLVKYTSPAGQITRWQ 899
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
774-810 8.44e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 38.34  E-value: 8.44e-04
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425364   774 YDERGNLQAVSDPLHQRTVYGYDRHGQVVRITDARGG 810
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGG 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
715-746 1.07e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.96  E-value: 1.07e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 446425364   715 LLLGMTDAQGGKWRYVYDRLGHLTETHDPLGR 746
Cdd:pfam05593    6 RLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
1037-1072 1.46e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.58  E-value: 1.46e-03
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 446425364  1037 YDSAGNLLSEETAQG-VLQHRYDVQGNRTETQMPDGR 1072
Cdd:pfam05593    1 YDAAGRLTSVTDPDGrVTTYTYDAAGRLTAVTDPDGT 37
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
731-786 2.08e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.19  E-value: 2.08e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 446425364   731 YDRLGHLTETHDPLGRVeqtqwhpvwhqpetevdaagaaWRYEYDERGNLQAVSDP 786
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRV----------------------TTYTYDAAGRLTAVTDP 34
PAAR_like cd14671
proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR ...
290-326 2.65e-03

proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat superfamily, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. The PAAR-repeat proteins form a diverse superfamily with several subgroups extended both N- and C-terminally by domains with various predicted functions; the termini are exposed to solution, and do not distort the VgrG binding site. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269821  Cd Length: 77  Bit Score: 38.07  E-value: 2.65e-03
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 446425364  290 DEPPAAEYIAEGTRDVRINSQPAARSGVRCTCEAKVV 326
Cdd:cd14671    39 DHPGGGNAIVSGSGTVFINGKPAARVGDRTSCGGVIV 75
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
298-343 2.69e-03

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 38.26  E-value: 2.69e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 446425364  298 IAEGTRDVRINSQPAARSGVRCTCEAKVVDepenGvhvSGDVRIGG 343
Cdd:COG4104    49 IAEGSPTVLINGKPAARVGDKTACGGTIIS----G---SPTVLIGG 87
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
711-751 3.73e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 36.80  E-value: 3.73e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 446425364   711 DEERLLLGMTDAQGGKWRYVYDRLGHLTETHDPLGRVEQTQ 751
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
901-940 3.92e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 36.41  E-value: 3.92e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 446425364   901 DGQGRVRRQTDATGRRTAYEYDAYGRLTTLTNENGESYRF 940
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRY 41
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
989-1018 5.52e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 36.04  E-value: 5.52e-03
                           10        20        30
                   ....*....|....*....|....*....|.
gi 446425364   989 DAAGRLTAKITPE-TRTKYRYDAADRLLEIR 1018
Cdd:pfam05593    2 DAAGRLTSVTDPDgRVTTYTYDAAGRLTAVT 32
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH