NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|15829859|ref|NP_308632|]
View 

type IV secretion protein Rhs [Escherichia coli O157:H7 str. Sakai]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
RHS_core super family cl49306
RHS element core protein;
607-1461 2.26e-50

RHS element core protein;


The actual alignment was detected with superfamily member NF041261:

Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 196.38  E-value: 2.26e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   607 YEYDA--ADRIIRWSDNDQTWSRFTYDAQGRCVTVTGAEGYyNATLDYGDGCTTVTDGKGIHRYYYDPDG----NILREE 680
Cdd:NF041261  344 FTYDAqhPGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGL-SYRYQYEQDRITITDSLNRREVLHTEGEgglkRVVKKE 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   681 APDGSTTTYEWDEFHHLLARHSPAGRVEKFEYNAAHGQLSRYTAADGADWQYCYDERGLLSNITAPAGQTWTQQCDERGL 760
Cdd:NF041261  423 HADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVSGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGR 502
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   761 PVSLVSPQGEETRLAYtpqgllsgifrqderrlgiEYDHHNWPETLTDVMgrehhteysghdlpvkmrgpgGQSVRLQWQ 840
Cdd:NF041261  503 LVSETSRSGETTRYRY-------------------DDPHSELPATTTDAT---------------------GSTKQMTWS 542
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   841 QHHKLSGLERAGTGAEGFRYDRHGNLLAYTDGNGVVWTMEYGPFDLPVARTDGEGHRWQYRYDKDTlQLTEVINPQGESY 920
Cdd:NF041261  543 RYGQLLAFTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAG-DLTAVITPDGNRS 621
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   921 LYILDNCGRVTEERDwGGVVCRYRYDADGLCTARV--NGLEETILYsrDAAGRLAEVITPEGKTQ-YAYDKSGRLT---- 993
Cdd:NF041261  622 ETQYDAWGKAVSTTQ-GGLTRSMEYDAAGRITTLTneNGSHSTFLY--DALDRLVQQRGFDGRTQrYHYDLTGKLTqsed 698
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   994 -GIFS------PDGTSQRT---------GYDERG---RVNVTTQGRR-AIEYHYPDEHTvirciLPPEDERDRHPD-GSL 1052
Cdd:NF041261  699 eGLVTlwhydeSDRITHRTvngepaeqwQYDEHGwltDISHLSEGHRvAVHYGYDDKGR-----LTGERQTVENPEtGEL 773
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1053 L---KTTYRYNAAGELTEVI---LPGDETLT------------------FSRDEAGREVLR-----HSNRGFACEQGWNA 1103
Cdd:NF041261  774 LwqhETGHAYNEQGLANRVTpdsLPPVEWLTygsgylagmklggtplveYTRDRLHRETVRsfggaGSNAAYELTTAYTP 853
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1104 AGQPVSQRAGlfpaeatwgglLPSLLREYRYDSAGNVSGVTSredyGREThREYRLDRNGQVTAVTASGTGLgygEGDET 1183
Cdd:NF041261  854 AGQLQSQHLN-----------SLVYDRDYTWNDNGDLVRISG----PRQT-REYGYSATGRLTGVHTTAANL---DIRIP 914
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1184 YGYDSCGYlKAQSAGRHRISGETdqYAAGHRLKQAGNTQYDYDAAGRMVSRTK-------HRDGYRpeTERFRWDSRDQL 1256
Cdd:NF041261  915 YATDPAGN-RLPDPELHPDSTLT--AWPDNRIAEDAHYVYRYDEYGRLTEKTDripegviRTDDER--THHYHYDSQHRL 989
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1257 TGY-RSAQGE---QWEYRHDASGRRTEKRCDRKKIRFT-------------YLWDGD----------------------- 1296
Cdd:NF041261  990 VFYtRIQHGEplvESRYLYDPLGRRMAKRVWRRERDLTgwmslsrkpevtwYGWDGDrlttvqtdttriqtvyqpgsftp 1069
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1297 -----------------SIAEI--------------------------REYRDDKLYSVRHLVFNGFELISQQFSrvRQP 1333
Cdd:NF041261 1070 lirvetengerakaqrrSLAETlqqegsenghgvvfpaelvrmldrleEEIRADRVSEESRAWLAQCGLTVEQMA--RQV 1147
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1334 HPSVAPQWVTRTNHAvsDLTGRPLMLFNSEGKTVWRpGQTSLWGLALslpadtdypdprGERDPEADPGLL-YAGQWQDA 1412
Cdd:NF041261 1148 EPEYTPARKLHLYHC--DHRGLPLALISEEGNTAWQ-GEYDEWGNLL------------NEENPHHLQQPYrLPGQQYDE 1212
                         970       980       990      1000
                  ....*....|....*....|....*....|....*....|....*....
gi 15829859  1413 ESGLCYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYVPNPCGYIDPLGL 1461
Cdd:NF041261 1213 ESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
PAAR_RHS cd14742
proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement ...
252-305 2.36e-24

proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement hotspot (Rhs) extensions; This PAAR (proline-alanine-alanine-arginine) repeat subfamily, which forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS), contains C- and N-terminal domain extensions. These include Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences at the C-terminal, and various predicted functions at N- and C-terminal extensions. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


:

Pssm-ID: 269827  Cd Length: 86  Bit Score: 98.43  E-value: 2.36e-24
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 15829859  252 AGEDTALCDKENKPP-RIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIG 305
Cdd:cd14742   32 AADSTVACSKHPPPPqLIAEGSETVFINGQPAARKGDKTTCSAVISEGSPNVFIG 86
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
393-454 2.78e-13

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


:

Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 66.40  E-value: 2.78e-13
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15829859    393 DPVDPVTGAYCDERTDFTLGQTLPLSFTRFHSSVLPLHGLTGVGWSDSWSEY---------AWVREQGNRV 454
Cdd:pfam20148    2 DPVNVATGNKVLEETDFSLPGPLPLVWTRTYNSSSERDGPLGPGWSHPYDQRlelegdggvVYIDADGREV 72
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
553-593 6.57e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


:

Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 41.81  E-value: 6.57e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 15829859    553 DTQYRITGVSHTDGIRLKLTYHASGYLKAIHRTDNGIQTLA 593
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
607-1461 2.26e-50

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 196.38  E-value: 2.26e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   607 YEYDA--ADRIIRWSDNDQTWSRFTYDAQGRCVTVTGAEGYyNATLDYGDGCTTVTDGKGIHRYYYDPDG----NILREE 680
Cdd:NF041261  344 FTYDAqhPGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGL-SYRYQYEQDRITITDSLNRREVLHTEGEgglkRVVKKE 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   681 APDGSTTTYEWDEFHHLLARHSPAGRVEKFEYNAAHGQLSRYTAADGADWQYCYDERGLLSNITAPAGQTWTQQCDERGL 760
Cdd:NF041261  423 HADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVSGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGR 502
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   761 PVSLVSPQGEETRLAYtpqgllsgifrqderrlgiEYDHHNWPETLTDVMgrehhteysghdlpvkmrgpgGQSVRLQWQ 840
Cdd:NF041261  503 LVSETSRSGETTRYRY-------------------DDPHSELPATTTDAT---------------------GSTKQMTWS 542
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   841 QHHKLSGLERAGTGAEGFRYDRHGNLLAYTDGNGVVWTMEYGPFDLPVARTDGEGHRWQYRYDKDTlQLTEVINPQGESY 920
Cdd:NF041261  543 RYGQLLAFTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAG-DLTAVITPDGNRS 621
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   921 LYILDNCGRVTEERDwGGVVCRYRYDADGLCTARV--NGLEETILYsrDAAGRLAEVITPEGKTQ-YAYDKSGRLT---- 993
Cdd:NF041261  622 ETQYDAWGKAVSTTQ-GGLTRSMEYDAAGRITTLTneNGSHSTFLY--DALDRLVQQRGFDGRTQrYHYDLTGKLTqsed 698
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   994 -GIFS------PDGTSQRT---------GYDERG---RVNVTTQGRR-AIEYHYPDEHTvirciLPPEDERDRHPD-GSL 1052
Cdd:NF041261  699 eGLVTlwhydeSDRITHRTvngepaeqwQYDEHGwltDISHLSEGHRvAVHYGYDDKGR-----LTGERQTVENPEtGEL 773
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1053 L---KTTYRYNAAGELTEVI---LPGDETLT------------------FSRDEAGREVLR-----HSNRGFACEQGWNA 1103
Cdd:NF041261  774 LwqhETGHAYNEQGLANRVTpdsLPPVEWLTygsgylagmklggtplveYTRDRLHRETVRsfggaGSNAAYELTTAYTP 853
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1104 AGQPVSQRAGlfpaeatwgglLPSLLREYRYDSAGNVSGVTSredyGREThREYRLDRNGQVTAVTASGTGLgygEGDET 1183
Cdd:NF041261  854 AGQLQSQHLN-----------SLVYDRDYTWNDNGDLVRISG----PRQT-REYGYSATGRLTGVHTTAANL---DIRIP 914
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1184 YGYDSCGYlKAQSAGRHRISGETdqYAAGHRLKQAGNTQYDYDAAGRMVSRTK-------HRDGYRpeTERFRWDSRDQL 1256
Cdd:NF041261  915 YATDPAGN-RLPDPELHPDSTLT--AWPDNRIAEDAHYVYRYDEYGRLTEKTDripegviRTDDER--THHYHYDSQHRL 989
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1257 TGY-RSAQGE---QWEYRHDASGRRTEKRCDRKKIRFT-------------YLWDGD----------------------- 1296
Cdd:NF041261  990 VFYtRIQHGEplvESRYLYDPLGRRMAKRVWRRERDLTgwmslsrkpevtwYGWDGDrlttvqtdttriqtvyqpgsftp 1069
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1297 -----------------SIAEI--------------------------REYRDDKLYSVRHLVFNGFELISQQFSrvRQP 1333
Cdd:NF041261 1070 lirvetengerakaqrrSLAETlqqegsenghgvvfpaelvrmldrleEEIRADRVSEESRAWLAQCGLTVEQMA--RQV 1147
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1334 HPSVAPQWVTRTNHAvsDLTGRPLMLFNSEGKTVWRpGQTSLWGLALslpadtdypdprGERDPEADPGLL-YAGQWQDA 1412
Cdd:NF041261 1148 EPEYTPARKLHLYHC--DHRGLPLALISEEGNTAWQ-GEYDEWGNLL------------NEENPHHLQQPYrLPGQQYDE 1212
                         970       980       990      1000
                  ....*....|....*....|....*....|....*....|....*....
gi 15829859  1413 ESGLCYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYVPNPCGYIDPLGL 1461
Cdd:NF041261 1213 ESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
384-1464 6.04e-34

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.59  E-value: 6.04e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  384 ANKVIRWVTDPVDPVTGAYCDERTDFTLGQTLPLSFTRFHSSVLPLHGLTGVGWSDSWSEYAWVREQGNRVDVISLGATL 463
Cdd:COG3209   52 AATLTARSASTTDVVGTLTGAGGTSAGGVTALGDASAAGGGYVGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTA 131
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  464 NFAFDGESDTAVNPYHAQYILRRRDDYLELFDRDALSSRFFYDAFPGmrlRHPVTDDTSDDRLAHSPADRMYMLGGMSDT 543
Cdd:COG3209  132 ATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAA---GPATGVGTGAVTLATGLAGSALLALGSGAI 208
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  544 ASNRITFERDTQYRITGVSHTDGIRLKLTYHASGYLKAIHRTDNGIQTLATYEQDARLDYHLFYEYDAADRIIRWSDNDQ 623
Cdd:COG3209  209 LGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNA 288
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  624 TWSRFTYDAQG------RCVTVTGAEGYYNATLDYGDGCTTVTDGKGIHRYYYDPDGNILREEAPDGSTTTYEWDEFHHL 697
Cdd:COG3209  289 AATAGGLGGAGlgsggaGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTT 368
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  698 LARHSPAGRVEKFEYNAAHGQLSRYTAADGADWQYCYDERGLLSNITAPAGQTWTQQCDERGLPVSLVSPQGEETRLAYT 777
Cdd:COG3209  369 SVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDA 448
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  778 PQGLLSGIFRQDERRLGIEYDHHNWPETLTDVMGREHHTEYSGHDLPVKMRGPGGQSVRLQWQQHHKLSGLERAGTGAEG 857
Cdd:COG3209  449 TTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLT 528
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  858 FRYDRHGNLLAYTDGNGVVWTMEYGPFDLPVARTDGEGHRWQYRYDKDTLQLTEVINPQGESYLYILDNCGRVTEERDWG 937
Cdd:COG3209  529 LGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTT 608
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  938 GVVCRYRYDADGLCTARVNGLEETILYSRDAAGRLAEVitpegkTQYAYDKSGRLTGIFSPDGTSQRTGYDERGRVNVTT 1017
Cdd:COG3209  609 TTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTG------TGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTT 682
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1018 QGRRAIEYHYPDEHTVIRCILPPEDERDRHPDGSLLKTTYRYNAAGELTEVilpgDETLTFSRDEAGREVLRHSNRGFAC 1097
Cdd:COG3209  683 VGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTD----GTGTGGTTGTLTTTSTTTTTTAGAL 758
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1098 EQGWNAAGQPVSQRAGLFPAEATWggllpslLREYRYDSAGNVSGVTSREdyGRETHREYrlDRNGQVTAVTASGTGLGY 1177
Cdd:COG3209  759 TYTYDALGRLTSETTPGGVTQGTY-------TTRYTYDALGRLTSVTYPD--GETVTYTY--DALGRLTSVITVGSGGGT 827
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1178 GEGDETYGYDSCGYLKAQSAGRHRISGETD-QYAAGHRLKQA----GNTQYDYDAAGRMVSRTkhrdgyRPETERFRWDS 1252
Cdd:COG3209  828 DLQDRTYTYDAAGNITSITDALRAGTLTQTyTYDALGRLTSAtdpgTTESYTYDANGNLTSRT------DGGTTTYTYDA 901
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1253 RDQLTGYRSAQGEQWEYRHDASGrrtekrcdrkkirftylwdgdsiaeireyrddklysvrhlvfngfelisqqfsrvrq 1332
Cdd:COG3209  902 LGRLVSVTKPDGTTTTYTYDALG--------------------------------------------------------- 924
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1333 phpsvapqwvtrtnhaVSDLTGRPLMLFNSEGKTVWRpGQTSLWGLalslpadtdypdPRGERDPEADPGLLYAGQWQDA 1412
Cdd:COG3209  925 ----------------HTDHLGSVRALTDASGQVVWR-YDYDPFGN------------LLAETSGAAANPLRFTGQEYDA 975
                       1050      1060      1070      1080      1090
                 ....*....|....*....|....*....|....*....|....*....|...
gi 15829859 1413 ESGLCYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYV-PNPCGYIDPLGLAIC 1464
Cdd:COG3209  976 ETGLYYNGARYYDPALGRFLSPDPIGLAGGLNLYAYVgNNPVNYVDPLGLAAL 1028
PAAR_RHS cd14742
proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement ...
252-305 2.36e-24

proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement hotspot (Rhs) extensions; This PAAR (proline-alanine-alanine-arginine) repeat subfamily, which forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS), contains C- and N-terminal domain extensions. These include Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences at the C-terminal, and various predicted functions at N- and C-terminal extensions. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269827  Cd Length: 86  Bit Score: 98.43  E-value: 2.36e-24
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 15829859  252 AGEDTALCDKENKPP-RIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIG 305
Cdd:cd14742   32 AADSTVACSKHPPPPqLIAEGSETVFINGQPAARKGDKTTCSAVISEGSPNVFIG 86
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1390-1461 4.77e-21

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 88.71  E-value: 4.77e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 15829859   1390 DPRGERDPEADPG---LLYAGQWQDAESGLCYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYVP-NPCGYIDPLGL 1461
Cdd:TIGR03696    2 DPYGEVLSESGAApnpLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGnNPVNWVDPLGL 77
RHS_core NF041261
RHS element core protein;
869-1279 9.50e-18

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 90.06  E-value: 9.50e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   869 YTDGNGV----VWTM---EYgPFDLPVARTdgegHRWQYRYDKDTLQLTEVINPQGESYLYILDNCGRVTEERDWGGVVC 941
Cdd:NF041261  291 YGPDNGIrlsaVWLThdpAY-PESLPAAPL----VRYTYTEAGELLAVYDRSNTQVRAFTYDAQHPGRMVAHRYAGRPEM 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   942 RYRYDADGLCTARVN--GLEETILYSRD-----------------AAGRLAEVITPE----GKTQYAYDKSGRLTGifSP 998
Cdd:NF041261  366 CYRYDDTGRVTEQLNpaGLSYRYQYEQDrititdslnrrevlhteGEGGLKRVVKKEhadgSVTRSGYDAAGRLTA--QT 443
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   999 DGTSQRTGYD---ERGRV-NVTTQGRRAIEYHYPDEHTVIRCIlppederdrHPDGslLKTTYRYNAAGELTEVILPGDE 1074
Cdd:NF041261  444 DAAGRRTEYSlnvVSGDItDITTPDGRETKFYYNDGNQLTSVT---------SPDG--LESRREYDEPGRLVSETSRSGE 512
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1075 TLTFSRDEAgrevlrHSNRGFACEQGWNAAGQPVSQRAGLFPAEATWGGLLPsllrEYRYDSAGNVSGVTSREdyGRETH 1154
Cdd:NF041261  513 TTRYRYDDP------HSELPATTTDATGSTKQMTWSRYGQLLAFTDCSGYQT----RYEYDRFGQMTAVHREE--GISTY 580
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1155 REYrlDRNGQVTAVTASgtglgygEGDET-YGYDSCGYLKAQ-SAGRHRISGETDQYAAGHRLKQAGNTQ-YDYDAAGRM 1231
Cdd:NF041261  581 RRY--DNRGQLTSVKDA-------QGRETrYEYNAAGDLTAViTPDGNRSETQYDAWGKAVSTTQGGLTRsMEYDAAGRI 651
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*...
gi 15829859  1232 VSRTkHRDGYRPEterFRWDSRDQLTGYRSAQGEQWEYRHDASGRRTE 1279
Cdd:NF041261  652 TTLT-NENGSHST---FLYDALDRLVQQRGFDGRTQRYHYDLTGKLTQ 695
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
224-306 1.24e-15

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 73.31  E-value: 1.24e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  224 FPAGPVLMEFATM-VGGRgeikkdvdfPEAGE-DTALCDKeNKPPRIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPD 301
Cdd:COG4104   13 SHGGPVISGSPTVlIGGR---------PAARVgDKVSCPK-HGPDTIAEGSPTVLINGKPAARVGDKTACGGTIISGSPT 82

                 ....*
gi 15829859  302 VFIGG 306
Cdd:COG4104   83 VLIGG 87
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
393-454 2.78e-13

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 66.40  E-value: 2.78e-13
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15829859    393 DPVDPVTGAYCDERTDFTLGQTLPLSFTRFHSSVLPLHGLTGVGWSDSWSEY---------AWVREQGNRV 454
Cdd:pfam20148    2 DPVNVATGNKVLEETDFSLPGPLPLVWTRTYNSSSERDGPLGPGWSHPYDQRlelegdggvVYIDADGREV 72
PAAR_motif pfam05488
PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. ...
267-309 3.91e-09

PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. It is also found as a triplet of tandem repeats comprising the entire length in a another family of hypothetical proteins.


Pssm-ID: 428491  Cd Length: 71  Bit Score: 54.50  E-value: 3.91e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 15829859    267 RIAQGSSNVFINNQPAARKGDKLEC-----SAAIVEGSPDVFIGGEQV 309
Cdd:pfam05488   10 VVITGSPTVLIGGKPAARVGDLVVCppcggGGPIAEGSPTVLINGKPA 57
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
670-706 8.36e-06

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 44.13  E-value: 8.36e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 15829859    670 YDPDGNILREEAPDGSTTTYEWDEFHHLLARHSPAGR 706
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
553-593 6.57e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 41.81  E-value: 6.57e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 15829859    553 DTQYRITGVSHTDGIRLKLTYHASGYLKAIHRTDNGIQTLA 593
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
Bacuni_01323_like cd12871
Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded ...
625-744 7.27e-05

Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded beta barrels resembling outer membrane porins. The interior of the barrels is mostly occupied by an insert with partially helical structure.


Pssm-ID: 214015 [Multi-domain]  Cd Length: 231  Bit Score: 45.87  E-value: 7.27e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  625 WSRFTYDAQGRCVTVTGAEGYYNA------TLDYGDGCTTVTD--GKGIHRYYYDPDGNILREEAPD-GSTTTYEWDefh 695
Cdd:cd12871   18 EYTFEYDADGRLTSITTTQEGEAEeityttTITYEPNVITVTDdgGKTVSTYTLNEKGYVTSCTETEyGKGQLRTYT--- 94
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 15829859  696 hllarhspagrvekFEYNAAhGQLSRYTAADGADWQYC---YDERGLLSNIT 744
Cdd:cd12871   95 --------------FTYNAD-GQLTKIVESIGTEYSTItitWNNGDIVSIST 131
 
Name Accession Description Interval E-value
RHS_core NF041261
RHS element core protein;
607-1461 2.26e-50

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 196.38  E-value: 2.26e-50
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   607 YEYDA--ADRIIRWSDNDQTWSRFTYDAQGRCVTVTGAEGYyNATLDYGDGCTTVTDGKGIHRYYYDPDG----NILREE 680
Cdd:NF041261  344 FTYDAqhPGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGL-SYRYQYEQDRITITDSLNRREVLHTEGEgglkRVVKKE 422
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   681 APDGSTTTYEWDEFHHLLARHSPAGRVEKFEYNAAHGQLSRYTAADGADWQYCYDERGLLSNITAPAGQTWTQQCDERGL 760
Cdd:NF041261  423 HADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNVVSGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGR 502
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   761 PVSLVSPQGEETRLAYtpqgllsgifrqderrlgiEYDHHNWPETLTDVMgrehhteysghdlpvkmrgpgGQSVRLQWQ 840
Cdd:NF041261  503 LVSETSRSGETTRYRY-------------------DDPHSELPATTTDAT---------------------GSTKQMTWS 542
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   841 QHHKLSGLERAGTGAEGFRYDRHGNLLAYTDGNGVVWTMEYGPFDLPVARTDGEGHRWQYRYDKDTlQLTEVINPQGESY 920
Cdd:NF041261  543 RYGQLLAFTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAG-DLTAVITPDGNRS 621
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   921 LYILDNCGRVTEERDwGGVVCRYRYDADGLCTARV--NGLEETILYsrDAAGRLAEVITPEGKTQ-YAYDKSGRLT---- 993
Cdd:NF041261  622 ETQYDAWGKAVSTTQ-GGLTRSMEYDAAGRITTLTneNGSHSTFLY--DALDRLVQQRGFDGRTQrYHYDLTGKLTqsed 698
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   994 -GIFS------PDGTSQRT---------GYDERG---RVNVTTQGRR-AIEYHYPDEHTvirciLPPEDERDRHPD-GSL 1052
Cdd:NF041261  699 eGLVTlwhydeSDRITHRTvngepaeqwQYDEHGwltDISHLSEGHRvAVHYGYDDKGR-----LTGERQTVENPEtGEL 773
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1053 L---KTTYRYNAAGELTEVI---LPGDETLT------------------FSRDEAGREVLR-----HSNRGFACEQGWNA 1103
Cdd:NF041261  774 LwqhETGHAYNEQGLANRVTpdsLPPVEWLTygsgylagmklggtplveYTRDRLHRETVRsfggaGSNAAYELTTAYTP 853
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1104 AGQPVSQRAGlfpaeatwgglLPSLLREYRYDSAGNVSGVTSredyGREThREYRLDRNGQVTAVTASGTGLgygEGDET 1183
Cdd:NF041261  854 AGQLQSQHLN-----------SLVYDRDYTWNDNGDLVRISG----PRQT-REYGYSATGRLTGVHTTAANL---DIRIP 914
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1184 YGYDSCGYlKAQSAGRHRISGETdqYAAGHRLKQAGNTQYDYDAAGRMVSRTK-------HRDGYRpeTERFRWDSRDQL 1256
Cdd:NF041261  915 YATDPAGN-RLPDPELHPDSTLT--AWPDNRIAEDAHYVYRYDEYGRLTEKTDripegviRTDDER--THHYHYDSQHRL 989
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1257 TGY-RSAQGE---QWEYRHDASGRRTEKRCDRKKIRFT-------------YLWDGD----------------------- 1296
Cdd:NF041261  990 VFYtRIQHGEplvESRYLYDPLGRRMAKRVWRRERDLTgwmslsrkpevtwYGWDGDrlttvqtdttriqtvyqpgsftp 1069
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1297 -----------------SIAEI--------------------------REYRDDKLYSVRHLVFNGFELISQQFSrvRQP 1333
Cdd:NF041261 1070 lirvetengerakaqrrSLAETlqqegsenghgvvfpaelvrmldrleEEIRADRVSEESRAWLAQCGLTVEQMA--RQV 1147
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1334 HPSVAPQWVTRTNHAvsDLTGRPLMLFNSEGKTVWRpGQTSLWGLALslpadtdypdprGERDPEADPGLL-YAGQWQDA 1412
Cdd:NF041261 1148 EPEYTPARKLHLYHC--DHRGLPLALISEEGNTAWQ-GEYDEWGNLL------------NEENPHHLQQPYrLPGQQYDE 1212
                         970       980       990      1000
                  ....*....|....*....|....*....|....*....|....*....
gi 15829859  1413 ESGLCYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYVPNPCGYIDPLGL 1461
Cdd:NF041261 1213 ESGLYYNRNRYYDPLQGRYITQDPIGLKGGWNLYQYPLNPIRFIDPLGL 1261
RhsA COG3209
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ...
384-1464 6.04e-34

Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];


Pssm-ID: 442442 [Multi-domain]  Cd Length: 1103  Bit Score: 142.59  E-value: 6.04e-34
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  384 ANKVIRWVTDPVDPVTGAYCDERTDFTLGQTLPLSFTRFHSSVLPLHGLTGVGWSDSWSEYAWVREQGNRVDVISLGATL 463
Cdd:COG3209   52 AATLTARSASTTDVVGTLTGAGGTSAGGVTALGDASAAGGGYVGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTA 131
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  464 NFAFDGESDTAVNPYHAQYILRRRDDYLELFDRDALSSRFFYDAFPGmrlRHPVTDDTSDDRLAHSPADRMYMLGGMSDT 543
Cdd:COG3209  132 ATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAA---GPATGVGTGAVTLATGLAGSALLALGSGAI 208
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  544 ASNRITFERDTQYRITGVSHTDGIRLKLTYHASGYLKAIHRTDNGIQTLATYEQDARLDYHLFYEYDAADRIIRWSDNDQ 623
Cdd:COG3209  209 LGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNA 288
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  624 TWSRFTYDAQG------RCVTVTGAEGYYNATLDYGDGCTTVTDGKGIHRYYYDPDGNILREEAPDGSTTTYEWDEFHHL 697
Cdd:COG3209  289 AATAGGLGGAGlgsggaGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTT 368
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  698 LARHSPAGRVEKFEYNAAHGQLSRYTAADGADWQYCYDERGLLSNITAPAGQTWTQQCDERGLPVSLVSPQGEETRLAYT 777
Cdd:COG3209  369 SVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDA 448
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  778 PQGLLSGIFRQDERRLGIEYDHHNWPETLTDVMGREHHTEYSGHDLPVKMRGPGGQSVRLQWQQHHKLSGLERAGTGAEG 857
Cdd:COG3209  449 TTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLT 528
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  858 FRYDRHGNLLAYTDGNGVVWTMEYGPFDLPVARTDGEGHRWQYRYDKDTLQLTEVINPQGESYLYILDNCGRVTEERDWG 937
Cdd:COG3209  529 LGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTT 608
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  938 GVVCRYRYDADGLCTARVNGLEETILYSRDAAGRLAEVitpegkTQYAYDKSGRLTGIFSPDGTSQRTGYDERGRVNVTT 1017
Cdd:COG3209  609 TTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTG------TGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTT 682
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1018 QGRRAIEYHYPDEHTVIRCILPPEDERDRHPDGSLLKTTYRYNAAGELTEVilpgDETLTFSRDEAGREVLRHSNRGFAC 1097
Cdd:COG3209  683 VGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTD----GTGTGGTTGTLTTTSTTTTTTAGAL 758
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1098 EQGWNAAGQPVSQRAGLFPAEATWggllpslLREYRYDSAGNVSGVTSREdyGRETHREYrlDRNGQVTAVTASGTGLGY 1177
Cdd:COG3209  759 TYTYDALGRLTSETTPGGVTQGTY-------TTRYTYDALGRLTSVTYPD--GETVTYTY--DALGRLTSVITVGSGGGT 827
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1178 GEGDETYGYDSCGYLKAQSAGRHRISGETD-QYAAGHRLKQA----GNTQYDYDAAGRMVSRTkhrdgyRPETERFRWDS 1252
Cdd:COG3209  828 DLQDRTYTYDAAGNITSITDALRAGTLTQTyTYDALGRLTSAtdpgTTESYTYDANGNLTSRT------DGGTTTYTYDA 901
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1253 RDQLTGYRSAQGEQWEYRHDASGrrtekrcdrkkirftylwdgdsiaeireyrddklysvrhlvfngfelisqqfsrvrq 1332
Cdd:COG3209  902 LGRLVSVTKPDGTTTTYTYDALG--------------------------------------------------------- 924
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859 1333 phpsvapqwvtrtnhaVSDLTGRPLMLFNSEGKTVWRpGQTSLWGLalslpadtdypdPRGERDPEADPGLLYAGQWQDA 1412
Cdd:COG3209  925 ----------------HTDHLGSVRALTDASGQVVWR-YDYDPFGN------------LLAETSGAAANPLRFTGQEYDA 975
                       1050      1060      1070      1080      1090
                 ....*....|....*....|....*....|....*....|....*....|...
gi 15829859 1413 ESGLCYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYV-PNPCGYIDPLGLAIC 1464
Cdd:COG3209  976 ETGLYYNGARYYDPALGRFLSPDPIGLAGGLNLYAYVgNNPVNYVDPLGLAAL 1028
PAAR_RHS cd14742
proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement ...
252-305 2.36e-24

proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement hotspot (Rhs) extensions; This PAAR (proline-alanine-alanine-arginine) repeat subfamily, which forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS), contains C- and N-terminal domain extensions. These include Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences at the C-terminal, and various predicted functions at N- and C-terminal extensions. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269827  Cd Length: 86  Bit Score: 98.43  E-value: 2.36e-24
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 15829859  252 AGEDTALCDKENKPP-RIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIG 305
Cdd:cd14742   32 AADSTVACSKHPPPPqLIAEGSETVFINGQPAARKGDKTTCSAVISEGSPNVFIG 86
Rhs_assc_core TIGR03696
RHS repeat-associated core domain; This model represents a conserved unique core sequence ...
1390-1461 4.77e-21

RHS repeat-associated core domain; This model represents a conserved unique core sequence shared by large numbers of proteins. It is occasional in the Archaea Methanosarcina barkeri) but common in bacteria and eukaryotes. Most fall into two large classes. One class consists of long proteins in which two classes of repeats are abundant: an FG-GAP repeat (pfam01839) class, and an RHS repeat (pfam05593) or YD repeat (TIGR01643). This class includes secreted bacterial insecticidal toxins and intercellular signalling proteins such as the teneurins in animals. The other class consists of uncharacterized proteins shorter than 400 amino acids, where this core domain of about 75 amino acids tends to occur in the N-terminal half. Over twenty such proteins are found in Pseudomonas putida alone; little sequence similarity or repeat structure is found among these proteins outside the region modeled by this domain.


Pssm-ID: 274730 [Multi-domain]  Cd Length: 77  Bit Score: 88.71  E-value: 4.77e-21
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 15829859   1390 DPRGERDPEADPG---LLYAGQWQDAESGLCYNRFRYYEPETGMYLVSDPLGLQGGEQTYRYVP-NPCGYIDPLGL 1461
Cdd:TIGR03696    2 DPYGEVLSESGAApnpLRFTGQYYDAETGLYYNGARYYDPELGRFLSPDPIGLGGGLNLYAYVGnNPVNWVDPLGL 77
RHS_core NF041261
RHS element core protein;
869-1279 9.50e-18

RHS element core protein;


Pssm-ID: 469161 [Multi-domain]  Cd Length: 1261  Bit Score: 90.06  E-value: 9.50e-18
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   869 YTDGNGV----VWTM---EYgPFDLPVARTdgegHRWQYRYDKDTLQLTEVINPQGESYLYILDNCGRVTEERDWGGVVC 941
Cdd:NF041261  291 YGPDNGIrlsaVWLThdpAY-PESLPAAPL----VRYTYTEAGELLAVYDRSNTQVRAFTYDAQHPGRMVAHRYAGRPEM 365
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   942 RYRYDADGLCTARVN--GLEETILYSRD-----------------AAGRLAEVITPE----GKTQYAYDKSGRLTGifSP 998
Cdd:NF041261  366 CYRYDDTGRVTEQLNpaGLSYRYQYEQDrititdslnrrevlhteGEGGLKRVVKKEhadgSVTRSGYDAAGRLTA--QT 443
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859   999 DGTSQRTGYD---ERGRV-NVTTQGRRAIEYHYPDEHTVIRCIlppederdrHPDGslLKTTYRYNAAGELTEVILPGDE 1074
Cdd:NF041261  444 DAAGRRTEYSlnvVSGDItDITTPDGRETKFYYNDGNQLTSVT---------SPDG--LESRREYDEPGRLVSETSRSGE 512
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1075 TLTFSRDEAgrevlrHSNRGFACEQGWNAAGQPVSQRAGLFPAEATWGGLLPsllrEYRYDSAGNVSGVTSREdyGRETH 1154
Cdd:NF041261  513 TTRYRYDDP------HSELPATTTDATGSTKQMTWSRYGQLLAFTDCSGYQT----RYEYDRFGQMTAVHREE--GISTY 580
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  1155 REYrlDRNGQVTAVTASgtglgygEGDET-YGYDSCGYLKAQ-SAGRHRISGETDQYAAGHRLKQAGNTQ-YDYDAAGRM 1231
Cdd:NF041261  581 RRY--DNRGQLTSVKDA-------QGRETrYEYNAAGDLTAViTPDGNRSETQYDAWGKAVSTTQGGLTRsMEYDAAGRI 651
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*...
gi 15829859  1232 VSRTkHRDGYRPEterFRWDSRDQLTGYRSAQGEQWEYRHDASGRRTE 1279
Cdd:NF041261  652 TTLT-NENGSHST---FLYDALDRLVQQRGFDGRTQRYHYDLTGKLTQ 695
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
224-306 1.24e-15

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 73.31  E-value: 1.24e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  224 FPAGPVLMEFATM-VGGRgeikkdvdfPEAGE-DTALCDKeNKPPRIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPD 301
Cdd:COG4104   13 SHGGPVISGSPTVlIGGR---------PAARVgDKVSCPK-HGPDTIAEGSPTVLINGKPAARVGDKTACGGTIISGSPT 82

                 ....*
gi 15829859  302 VFIGG 306
Cdd:COG4104   83 VLIGG 87
DUF6531 pfam20148
Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.
393-454 2.78e-13

Domain of unknown function (DUF6531); This putative domain is found in a range of RHS proteins.


Pssm-ID: 466309 [Multi-domain]  Cd Length: 74  Bit Score: 66.40  E-value: 2.78e-13
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15829859    393 DPVDPVTGAYCDERTDFTLGQTLPLSFTRFHSSVLPLHGLTGVGWSDSWSEY---------AWVREQGNRV 454
Cdd:pfam20148    2 DPVNVATGNKVLEETDFSLPGPLPLVWTRTYNSSSERDGPLGPGWSHPYDQRlelegdggvVYIDADGREV 72
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
260-309 1.53e-11

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 61.75  E-value: 1.53e-11
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....
gi 15829859  260 DKENKPPRIAQGSSNVFINNQPAARKGDKLECS----AAIVEGSPDVFIGGEQV 309
Cdd:COG4104   10 DKTSHGGPVISGSPTVLIGGRPAARVGDKVSCPkhgpDTIAEGSPTVLINGKPA 63
PAAR_like cd14671
proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR ...
268-310 2.49e-09

proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat superfamily, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. The PAAR-repeat proteins form a diverse superfamily with several subgroups extended both N- and C-terminally by domains with various predicted functions; the termini are exposed to solution, and do not distort the VgrG binding site. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269821  Cd Length: 77  Bit Score: 55.41  E-value: 2.49e-09
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*.
gi 15829859  268 IAQGSSNVFINNQPAARKGDKLEC---SAAIVEGSPDVFIGGEQVT 310
Cdd:cd14671   17 VISGSPNVFINGRPAARVGDVGDHpggGNAIVSGSGTVFINGKPAA 62
PAAR_1 cd14737
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
260-305 2.62e-09

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269822  Cd Length: 94  Bit Score: 55.75  E-value: 2.62e-09
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*....
gi 15829859  260 DKENKPP---RIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIG 305
Cdd:cd14737   46 TCPKHPPhggVIASGSSTVFINGKPAARVGDPVSCGGTVAGGSPNVFIG 94
PAAR_like cd14671
proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR ...
265-298 3.37e-09

proline-alanine-alanine-arginine (PAAR) repeat superfamily; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat superfamily, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. The PAAR-repeat proteins form a diverse superfamily with several subgroups extended both N- and C-terminally by domains with various predicted functions; the termini are exposed to solution, and do not distort the VgrG binding site. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269821  Cd Length: 77  Bit Score: 55.02  E-value: 3.37e-09
                         10        20        30
                 ....*....|....*....|....*....|....
gi 15829859  265 PPRIAQGSSNVFINNQPAARKGDKLECSAAIVEG 298
Cdd:cd14671   44 GNAIVSGSGTVFINGKPAARVGDRTSCGGVIVSG 77
PAAR_motif pfam05488
PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. ...
267-309 3.91e-09

PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. It is also found as a triplet of tandem repeats comprising the entire length in a another family of hypothetical proteins.


Pssm-ID: 428491  Cd Length: 71  Bit Score: 54.50  E-value: 3.91e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 15829859    267 RIAQGSSNVFINNQPAARKGDKLEC-----SAAIVEGSPDVFIGGEQV 309
Cdd:pfam05488   10 VVITGSPTVLIGGKPAARVGDLVVCppcggGGPIAEGSPTVLINGKPA 57
PAAR_2 cd14738
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
255-305 6.58e-09

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269823  Cd Length: 94  Bit Score: 54.56  E-value: 6.58e-09
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|..
gi 15829859  255 DTALCdkeNKPP-RIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIG 305
Cdd:cd14738   46 DMCVC---VGPPdTIVQGSSTVLIGGKPAARMGDSTAHGGVIVSGVPTVLIG 94
PAAR_motif pfam05488
PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. ...
255-296 1.70e-08

PAAR motif; This motif is found usually in pairs in a family of bacterial membrane proteins. It is also found as a triplet of tandem repeats comprising the entire length in a another family of hypothetical proteins.


Pssm-ID: 428491  Cd Length: 71  Bit Score: 52.57  E-value: 1.70e-08
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 15829859    255 DTALCDKENKPPRIAQGSSNVFINNQPAARKGDKLECSAAIV 296
Cdd:pfam05488   30 DLVVCPPCGGGGPIAEGSPTVLINGKPAAREGDKTACGATLI 71
PAAR_CT_1 cd14743
proline-alanine-alanine-arginine (PAAR) domain with C-terminal extension; This domain is found ...
255-312 1.75e-08

proline-alanine-alanine-arginine (PAAR) domain with C-terminal extension; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family of mostly gamma-proteobacteria, and forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). Some members contains C-terminal domain extensions corresponding to Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences as well as uncharacterized domains. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269828  Cd Length: 78  Bit Score: 53.07  E-value: 1.75e-08
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*...
gi 15829859  255 DTALCDKENKPPRIAQGSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIGGEQVTYL 312
Cdd:cd14743    7 DPHACPLPGHGSTPIGSSSADFFDGLPAARVGDKTSCGATIVSGSINVLINGKPAAVL 64
PAAR_2 cd14738
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
266-306 7.75e-07

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269823  Cd Length: 94  Bit Score: 48.78  E-value: 7.75e-07
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....
gi 15829859  266 PRIAQGSSNVFINNQPAARKGDKLECSA---AIVEGSPDVFIGG 306
Cdd:cd14738   25 PIVGPGPTTVLIGGLPAARVGDMCVCVGppdTIVQGSSTVLIGG 68
PAAR_1 cd14737
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
265-309 1.09e-06

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269822  Cd Length: 94  Bit Score: 48.43  E-value: 1.09e-06
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 15829859  265 PPR-IAQGSSNVFINNQPAARKGDKLE---C------SAAIVEGSPDVFIGGEQV 309
Cdd:cd14737   17 PPTpVIAGSPDVTVNGKPVLRQGDALAphtCpkhpphGGVIASGSSTVFINGKPA 71
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
670-711 1.20e-06

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 46.43  E-value: 1.20e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 15829859    670 YDPDGNILREEAPDGSTTTYEWDEFHHLLARHSPAGRVEKFE 711
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
PAAR_5 cd14741
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
252-304 1.47e-06

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family in bacteria as well as some archaea, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269826  Cd Length: 95  Bit Score: 48.16  E-value: 1.47e-06
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 15829859  252 AGEDTALCDKENKPPR-----IAQGSSNVFINNQPAARKGDKLECSAA---IVEGSPDVFI 304
Cdd:cd14741   35 AGGDGHVCPLVTGPVPhvggvVAAGSTTVLINGLPAARMGDMIVEGGPpntIAMGAPTVLI 95
PAAR_3 cd14739
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
264-305 7.65e-06

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family, where it forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). The T6SS is responsible for translocation of a wide variety of toxic effector molecules, allowing predatory cells to kill prokaryotic as well as eukaryotic prey cells. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes. It has been shown that PAAR proteins are essential for T6SS-mediated secretion and target cell killing by Vibrio cholerae (encodes two PAAR proteins) and Acinetobacter baylyi (encodes three PAAR proteins); inactivation of all these PAAR genes results in inactivation of Hcp secretion as well as T6SS-dependent killing of E. coli.


Pssm-ID: 269824  Cd Length: 90  Bit Score: 45.81  E-value: 7.65e-06
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|
gi 15829859  264 KPPRIAQ--------GSSNVFINNQPAARKGDKLECSAAIVEGSPDVFIG 305
Cdd:cd14739   41 IPPPPAHppaspfppGSATVLIGGRPAARVGDACGCGATIVVGAPTVLIG 90
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
670-706 8.36e-06

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 44.13  E-value: 8.36e-06
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 15829859    670 YDPDGNILREEAPDGSTTTYEWDEFHHLLARHSPAGR 706
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
553-593 6.57e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 41.81  E-value: 6.57e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 15829859    553 DTQYRITGVSHTDGIRLKLTYHASGYLKAIHRTDNGIQTLA 593
Cdd:TIGR01643    2 DAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
Bacuni_01323_like cd12871
Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded ...
625-744 7.27e-05

Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded beta barrels resembling outer membrane porins. The interior of the barrels is mostly occupied by an insert with partially helical structure.


Pssm-ID: 214015 [Multi-domain]  Cd Length: 231  Bit Score: 45.87  E-value: 7.27e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15829859  625 WSRFTYDAQGRCVTVTGAEGYYNA------TLDYGDGCTTVTD--GKGIHRYYYDPDGNILREEAPD-GSTTTYEWDefh 695
Cdd:cd12871   18 EYTFEYDADGRLTSITTTQEGEAEeityttTITYEPNVITVTDdgGKTVSTYTLNEKGYVTSCTETEyGKGQLRTYT--- 94
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 15829859  696 hllarhspagrvekFEYNAAhGQLSRYTAADGADWQYC---YDERGLLSNIT 744
Cdd:cd12871   95 --------------FTYNAD-GQLTKIVESIGTEYSTItitWNNGDIVSIST 131
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
967-1005 9.64e-05

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 41.04  E-value: 9.64e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 15829859    967 DAAGRLAEVITPEGK-TQYAYDKSGRLTGIFSPDGTSQRT 1005
Cdd:TIGR01643    2 DAAGRLTGSTDADGTtTRYTYDAAGRLVEITDADGGSTRY 41
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
860-895 1.14e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 40.66  E-value: 1.14e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 15829859    860 YDRHGNLLAYTDGNGVVWTMEYGPFDLPVARTDGEG 895
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
609-644 2.38e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.89  E-value: 2.38e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 15829859    609 YDAADRIIRWSDNDQTWSRFTYDAQGRCVTVTGAEG 644
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
967-1001 2.48e-04

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 39.89  E-value: 2.48e-04
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 15829859    967 DAAGRLAEVITPEGK-TQYAYDKSGRLTGIFSPDGT 1001
Cdd:pfam05593    2 DAAGRLTSVTDPDGRvTTYTYDAAGRLTAVTDPDGT 37
PAAR_CT_2 cd14744
proline-alanine-alanine-arginine (PAAR) domain with uncharacterized C-terminal extension; This ...
267-309 3.81e-04

proline-alanine-alanine-arginine (PAAR) domain with uncharacterized C-terminal extension; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family of mostly beta- and gamma-proteobacteria, and forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). Most members contain C-terminal domain extensions corresponding to several uncharacterized domains such as S-type pyocin, DUF2235, DUF2345 and cytotoxic proteins. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269829  Cd Length: 78  Bit Score: 40.62  E-value: 3.81e-04
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....*...
gi 15829859  267 RIAQGSSNVFINNQPAARKGDKLECSA-----AIVEGSPDVFIGGEQV 309
Cdd:cd14744   15 VVISGSSTFTIDGRPVARVGDKVTCPKckgtgPIVEGGPTFTVDGRPV 62
PAAR_4 cd14740
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
265-306 6.67e-04

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family of bacteria, and forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). A few members contains C-terminal domain extensions corresponding to Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences as well as uncharacterized domains such as DUF4150. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269825  Cd Length: 121  Bit Score: 41.26  E-value: 6.67e-04
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 15829859  265 PPRIAQGSSNVFINNQPAARKGDKLEC---------------SAAIVEGSPDVFIGG 306
Cdd:cd14740   34 GLIVGGLSPTVLIGGMPAATVGSTAGNtpggvpggpsvppanPGTIVMGSSTVFING 90
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
1250-1281 8.73e-04

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 38.34  E-value: 8.73e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 15829859   1250 WDSRDQLTGYRSAQGEQWEYRHDASGRRTEKR 1281
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEIT 32
PAAR COG4104
Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular ...
280-306 1.12e-03

Zn-binding Pro-Ala-Ala-Arg (PAAR) domain, involved in Type VI secretion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 443280  Cd Length: 87  Bit Score: 39.41  E-value: 1.12e-03
                         10        20
                 ....*....|....*....|....*..
gi 15829859  280 QPAARKGDKLECSAAIVEGSPDVFIGG 306
Cdd:COG4104    3 KPAARLGDKTSHGGPVISGSPTVLIGG 29
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
860-900 1.29e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 37.95  E-value: 1.29e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 15829859    860 YDRHGNLLAYTDGNGVVWTMEYGPFDLPVARTDGEGHRWQY 900
Cdd:TIGR01643    1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRY 41
PAAR_CT_1 cd14743
proline-alanine-alanine-arginine (PAAR) domain with C-terminal extension; This domain is found ...
268-299 1.79e-03

proline-alanine-alanine-arginine (PAAR) domain with C-terminal extension; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family of mostly gamma-proteobacteria, and forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). Some members contains C-terminal domain extensions corresponding to Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences as well as uncharacterized domains. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269828  Cd Length: 78  Bit Score: 38.82  E-value: 1.79e-03
                         10        20        30
                 ....*....|....*....|....*....|..
gi 15829859  268 IAQGSSNVFINNQPAARKGDKLECSAAIVEGS 299
Cdd:cd14743   47 IVSGSINVLINGKPAAVLGSTTSHGGVVIGGS 78
PAAR_RHS cd14742
proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement ...
281-309 2.03e-03

proline-alanine-alanine-arginine (PAAR) domain, also containing C-terminal Rearrangement hotspot (Rhs) extensions; This PAAR (proline-alanine-alanine-arginine) repeat subfamily, which forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS), contains C- and N-terminal domain extensions. These include Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences at the C-terminal, and various predicted functions at N- and C-terminal extensions. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269827  Cd Length: 86  Bit Score: 38.72  E-value: 2.03e-03
                         10        20
                 ....*....|....*....|....*....
gi 15829859  281 PAARKGDKLECSAAIVEGSPDVFIGGEQV 309
Cdd:cd14742    1 PAARVGDPIAHTGTITSGSPNVFINGKPA 29
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
986-1017 2.16e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 37.19  E-value: 2.16e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 15829859    986 YDKSGRLTGIFSPDGTSQRTGYDERGRVNVTT 1017
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVT 32
PAAR_4 cd14740
proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR ...
263-287 2.27e-03

proline-alanine-alanine-arginine (PAAR) domain; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family of bacteria, and forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). A few members contains C-terminal domain extensions corresponding to Rearrangement hotspot (Rhs) protein repeats and conserved Rhs repeat-associated unique core sequences as well as uncharacterized domains such as DUF4150. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. Rhs and related YD-peptide repeat proteins are widely distributed in bacteria. Rhs shares similar architecture with distantly related WapA proteins of Bacillus and Listeria species, suggesting intercellular growth inhibition as its primary function. Additionally, a plasmid-encoded Rhs protein has been implicated in bacteriocin production in Pseudomonas savastanoi. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269825  Cd Length: 121  Bit Score: 39.72  E-value: 2.27e-03
                         10        20
                 ....*....|....*....|....*
gi 15829859  263 NKPPRIAQGSSNVFINNQPAARKGD 287
Cdd:cd14740   74 ANPGTIVMGSSTVFINGKPAARMGD 98
RHS_repeat pfam05593
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ...
945-980 3.04e-03

RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.


Pssm-ID: 461685 [Multi-domain]  Cd Length: 37  Bit Score: 36.81  E-value: 3.04e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 15829859    945 YDADGLCTARVNGLEETILYSRDAAGRLAEVITPEG 980
Cdd:pfam05593    1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
YD_repeat_2x TIGR01643
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ...
712-751 3.17e-03

YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.


Pssm-ID: 273728 [Multi-domain]  Cd Length: 42  Bit Score: 36.80  E-value: 3.17e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|
gi 15829859    712 YNAAhGQLSRYTAADGADWQYCYDERGLLSNITAPAGQTW 751
Cdd:TIGR01643    1 YDAA-GRLTGSTDADGTTTRYTYDAAGRLVEITDADGGST 39
PAAR_CT_2 cd14744
proline-alanine-alanine-arginine (PAAR) domain with uncharacterized C-terminal extension; This ...
255-291 7.15e-03

proline-alanine-alanine-arginine (PAAR) domain with uncharacterized C-terminal extension; This domain is found in the PAAR (proline-alanine-alanine-arginine) repeat family of mostly beta- and gamma-proteobacteria, and forms a sharp conical extension on the VgrG spike, a trimeric protein complex of the bacterial type VI secretion system (T6SS). Most members contain C-terminal domain extensions corresponding to several uncharacterized domains such as S-type pyocin, DUF2235, DUF2345 and cytotoxic proteins. However, these terminal domains are exposed to solution, and do not distort the binding site of VgrG. The pointed tip of the PAAR domain is stabilized by a zinc atom positioned close to the cone's vertex and is likely to be important for its integrity during penetration of the target cell envelope. VgrG proteins are orthologous to the central baseplate spikes of bacteriophages with contractile tails, and genes encoding proteins with PAAR motifs have been frequently found immediately downstream from vgrG-like genes.


Pssm-ID: 269829  Cd Length: 78  Bit Score: 37.15  E-value: 7.15e-03
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 15829859  255 DTALCDKENKPPRIAQGSSNVFINNQPAARKGDKLEC 291
Cdd:cd14744   35 DKVTCPKCKGTGPIVEGGPTFTVDGRPVALDGDRVAC 71
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH