NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|818522861|gb|KKR75634|]
View 

MAG: Transcription antitermination protein nusG [Parcubacteria group bacterium GW2011_GWB2_40_8]

Protein Classification

transcription termination/antitermination protein NusG( domain architecture ID 11415705)

transcription termination/antitermination protein NusG (N-Utilization Substance G) is involved in transcription elongation and termination in bacteria

Gene Symbol:  nusG
SCOP:  4002609

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NusG COG0250
Transcription termination/antitermination protein NusG [Transcription];
10-181 4.28e-69

Transcription termination/antitermination protein NusG [Transcription];


:

Pssm-ID: 440020 [Multi-domain]  Cd Length: 171  Bit Score: 207.37  E-value: 4.28e-69
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  10 RKWYAVHTYAGYEDTVVRNLKQRIEslGMEdvifnAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNT 89
Cdd:COG0250    5 KRWYVVHTYSGYEKKVKENLERRIE--GIE-----VFVPTEEVVEIKNGKKKTVERPLFPGYVFVRMDLTDESWYLVRNT 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  90 PRVTGFVGAGTTPVPLDEKEIKILFDRM--GVSEPKYKLDVKLGEVVKITDGPFKDFEGNVSEIDEERGKIKILISMFGR 167
Cdd:COG0250   78 PGVTGFVGFGGKPAPLPDEEVERILARLeeGEEKPRPKVDFEVGDRVRITDGPFAGFEGTVEEVDPEKGRVKVLVSIFGR 157
                        170
                 ....*....|....
gi 818522861 168 ETPVEIDFLQVKKI 181
Cdd:COG0250  158 ETPVELDFSQVEKI 171
 
Name Accession Description Interval E-value
NusG COG0250
Transcription termination/antitermination protein NusG [Transcription];
10-181 4.28e-69

Transcription termination/antitermination protein NusG [Transcription];


Pssm-ID: 440020 [Multi-domain]  Cd Length: 171  Bit Score: 207.37  E-value: 4.28e-69
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  10 RKWYAVHTYAGYEDTVVRNLKQRIEslGMEdvifnAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNT 89
Cdd:COG0250    5 KRWYVVHTYSGYEKKVKENLERRIE--GIE-----VFVPTEEVVEIKNGKKKTVERPLFPGYVFVRMDLTDESWYLVRNT 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  90 PRVTGFVGAGTTPVPLDEKEIKILFDRM--GVSEPKYKLDVKLGEVVKITDGPFKDFEGNVSEIDEERGKIKILISMFGR 167
Cdd:COG0250   78 PGVTGFVGFGGKPAPLPDEEVERILARLeeGEEKPRPKVDFEVGDRVRITDGPFAGFEGTVEEVDPEKGRVKVLVSIFGR 157
                        170
                 ....*....|....
gi 818522861 168 ETPVEIDFLQVKKI 181
Cdd:COG0250  158 ETPVELDFSQVEKI 171
nusG TIGR00922
transcription termination/antitermination factor NusG; NusG proteins are transcription factors ...
12-180 6.98e-68

transcription termination/antitermination factor NusG; NusG proteins are transcription factors which are aparrently universal in prokaryotes (archaea and eukaryotes have homologs that may have related functions). The essential components of these factors include an N-terminal RNP-like (ribonucleoprotein) domain and a C-terminal KOW motif (pfam00467) believed to be a nucleic acid binding domain. In E. coli, NusA has been shown to interact with RNA polymerase and termination factor Rho. This model covers a wide variety of bacterial species but excludes mycoplasmas which are covered by a separate model (TIGR01956).The function of all of these NusG proteins is likely to be the same at the level of interaction with RNA and other protein factors to affect termination; however different species may utilize NusG towards different processes and in combination with different suites of affector proteins.In E. coli, NusG promotes rho-dependent termination. It is an essential gene. In Streptomyces virginiae and related species, an additional N-terminal sequence is also present and is suggested to play a role in butyrolactone-mediated autoregulation. In Thermotoga maritima, NusG has a long insert, fails to substitute for E. coli NusG (with or without the long insert), is a large 0.7 % of total cellular protein, and has a general, sequence non-specific DNA and RNA binding activity that blocks ethidium staining, yet permits transcription.Archaeal proteins once termed NusG share the KOW domain but are actually a ribosomal protein corresponding to L24p in bacterial and L26e in eukaryotes (TIGR00405). [Transcription, Transcription factors]


Pssm-ID: 273341 [Multi-domain]  Cd Length: 172  Bit Score: 204.46  E-value: 6.98e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   12 WYAVHTYAGYEDTVVRNLKQRIESLGMEDVIFNAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTPR 91
Cdd:TIGR00922   1 WYVVQTYSGYEKKVKQNLEELIELLGMGDYIFEVIVPTEEVVEIKKGKKKVVERKIFPGYVLVKMDLTDVSWHLVKNTPG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   92 VTGFVGAGTTPVPLDEK-EIKILFDRM--GVSEPKYKLDVKLGEVVKITDGPFKDFEGNVSEIDEERGKIKILISMFGRE 168
Cdd:TIGR00922  81 VTGFVGSGGKPKALSEDeEVKNILNALeeGKDKPKPKIDFEVGEQVRVNDGPFANFTGTVEEVDYEKSKLKVSVSIFGRE 160
                         170
                  ....*....|..
gi 818522861  169 TPVEIDFLQVKK 180
Cdd:TIGR00922 161 TPVELEFSQVEK 172
NGN_Bact_1 cd09891
Bacterial N-Utilization Substance G (NusG) N-terminal (NGN) domain, subgroup 1; The ...
11-117 1.26e-48

Bacterial N-Utilization Substance G (NusG) N-terminal (NGN) domain, subgroup 1; The N-Utilization Substance G (NusG) protein is involved in transcription elongation and termination in bacteria. NusG is essential in Escherichia coli and associates with RNA polymerase elongation and Rho-termination. Homologs of the NusG gene exist in all bacteria. The NusG N-terminal domain (NGN) is similar in all NusG homologs, but its C-terminal domain and the linker that separates these two domains are different. The domain organization of NusG suggests that the common properties of NusG and its homologs are due to their similar NGN domains.


Pssm-ID: 193580 [Multi-domain]  Cd Length: 107  Bit Score: 153.02  E-value: 1.26e-48
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  11 KWYAVHTYAGYEDTVVRNLKQRIESLGMEDVIFNAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTP 90
Cdd:cd09891    1 KWYVVHTYSGYENKVKENLEKRIESEGLEDYIGEVLVPTEEVVEVKNGKKKVKERKLFPGYVLVEMDMNDDTWHLVRNTP 80
                         90       100
                 ....*....|....*....|....*..
gi 818522861  91 RVTGFVGAGTTPVPLDEKEIKILFDRM 117
Cdd:cd09891   81 GVTGFVGSGGKPVPLSEEEVERILGQV 107
NGN smart00738
In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, ...
11-114 2.25e-34

In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, this domain may confer affinity for Spt4p.Spt4p


Pssm-ID: 197850 [Multi-domain]  Cd Length: 106  Bit Score: 117.09  E-value: 2.25e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861    11 KWYAVHTYAGYEDTVVRNLKQRIESLGMEDVIFNAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTP 90
Cdd:smart00738   1 NWYAVRTTSGQEKRVAENLERKAEALGLEDKIVSILVPTEEVKEIRRGKKKVVERKLFPGYIFVEADLEDEVWTAIRGTP 80
                           90       100
                   ....*....|....*....|....
gi 818522861    91 RVTGFVGAGTTPVPLDEKEIKILF 114
Cdd:smart00738  81 GVRGFVGGGGKPTPVPDDEIEKIL 104
NusG pfam02357
Transcription termination factor nusG;
11-113 4.36e-32

Transcription termination factor nusG;


Pssm-ID: 426736 [Multi-domain]  Cd Length: 96  Bit Score: 110.78  E-value: 4.36e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   11 KWYAVHTYAGYEDTVvrnlKQRIESLGMEdvifnAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTP 90
Cdd:pfam02357   2 KWYVLQTYSGKEKKV----KENLERQGIE-----VFLPTEEVVEIRNGKKKVVERPLFPGYVFVRMDMTDETWHLVRSTP 72
                          90       100
                  ....*....|....*....|...
gi 818522861   91 RVTGFVGAGTTPVPLDEKEIKIL 113
Cdd:pfam02357  73 GVTGFVGGSGKPTPIPDEEVERI 95
rfaH PRK09014
transcription/translation regulatory transformer protein RfaH;
54-181 4.30e-10

transcription/translation regulatory transformer protein RfaH;


Pssm-ID: 181611 [Multi-domain]  Cd Length: 162  Bit Score: 55.67  E-value: 4.30e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  54 KIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTPRVTGFVGAGTTP--VPLDekeikILFDRMGVSEPKYKLDV--K 129
Cdd:PRK09014  37 KIVRGKRTEVSEPLFPNYLFVEFDPEVIHTTTIRSTRGVSHFVRFGAQPaiVPSD-----VIYQLSVYKPEKIVDPEtpK 111
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 818522861 130 LGEVVKITDGPFKDFEGNVSEIDEERGKIkILISMFGRETPVEIDFLQVKKI 181
Cdd:PRK09014 112 PGDKVIITEGAFEGLQAIYTEPDGEARSI-LLLNLLNKQVKHSVDNTQFRKI 162
antiterm_UpxY NF033644
UpxY family transcription antiterminator; The UpxY family of NusG-related transcription ...
12-162 4.16e-08

UpxY family transcription antiterminator; The UpxY family of NusG-related transcription antiterminators was described originally from a paralogous family of eight members from Bacteriodes fragilis, UpaY to UphY, each of which was associated with a distinct capsular polysaccharide biosynthesis locus. There is no UpxY protein per se.


Pssm-ID: 468125 [Multi-domain]  Cd Length: 162  Bit Score: 50.16  E-value: 4.16e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  12 WYAVHTYAGYEDTVvrnlKQRIESLGMEdvifnAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDmVVTDNSWYVVRNTPR 91
Cdd:NF033644   1 WYALYTRPRREKKV----AELLEKKGIE-----SFLPMQKEIRQWSDRKKRVEVPLIPNLVFVH-ITEKELDEVLEQTPG 70
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 818522861  92 VTGFVGA-GTTPVPLD--EKEIKILfdRMGVSEPK-----YKLDVKLGEVVKITDGPFKDFEGNVSEIdeeRGKIKILI 162
Cdd:NF033644  71 VVRYIRDdRGKSKPAIipDKQMERF--RLMLDPSDevvvyLEAPLKKGDKVRVIGGPLKGLEGELVRV---KGKKRVVV 144
 
Name Accession Description Interval E-value
NusG COG0250
Transcription termination/antitermination protein NusG [Transcription];
10-181 4.28e-69

Transcription termination/antitermination protein NusG [Transcription];


Pssm-ID: 440020 [Multi-domain]  Cd Length: 171  Bit Score: 207.37  E-value: 4.28e-69
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  10 RKWYAVHTYAGYEDTVVRNLKQRIEslGMEdvifnAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNT 89
Cdd:COG0250    5 KRWYVVHTYSGYEKKVKENLERRIE--GIE-----VFVPTEEVVEIKNGKKKTVERPLFPGYVFVRMDLTDESWYLVRNT 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  90 PRVTGFVGAGTTPVPLDEKEIKILFDRM--GVSEPKYKLDVKLGEVVKITDGPFKDFEGNVSEIDEERGKIKILISMFGR 167
Cdd:COG0250   78 PGVTGFVGFGGKPAPLPDEEVERILARLeeGEEKPRPKVDFEVGDRVRITDGPFAGFEGTVEEVDPEKGRVKVLVSIFGR 157
                        170
                 ....*....|....
gi 818522861 168 ETPVEIDFLQVKKI 181
Cdd:COG0250  158 ETPVELDFSQVEKI 171
nusG TIGR00922
transcription termination/antitermination factor NusG; NusG proteins are transcription factors ...
12-180 6.98e-68

transcription termination/antitermination factor NusG; NusG proteins are transcription factors which are aparrently universal in prokaryotes (archaea and eukaryotes have homologs that may have related functions). The essential components of these factors include an N-terminal RNP-like (ribonucleoprotein) domain and a C-terminal KOW motif (pfam00467) believed to be a nucleic acid binding domain. In E. coli, NusA has been shown to interact with RNA polymerase and termination factor Rho. This model covers a wide variety of bacterial species but excludes mycoplasmas which are covered by a separate model (TIGR01956).The function of all of these NusG proteins is likely to be the same at the level of interaction with RNA and other protein factors to affect termination; however different species may utilize NusG towards different processes and in combination with different suites of affector proteins.In E. coli, NusG promotes rho-dependent termination. It is an essential gene. In Streptomyces virginiae and related species, an additional N-terminal sequence is also present and is suggested to play a role in butyrolactone-mediated autoregulation. In Thermotoga maritima, NusG has a long insert, fails to substitute for E. coli NusG (with or without the long insert), is a large 0.7 % of total cellular protein, and has a general, sequence non-specific DNA and RNA binding activity that blocks ethidium staining, yet permits transcription.Archaeal proteins once termed NusG share the KOW domain but are actually a ribosomal protein corresponding to L24p in bacterial and L26e in eukaryotes (TIGR00405). [Transcription, Transcription factors]


Pssm-ID: 273341 [Multi-domain]  Cd Length: 172  Bit Score: 204.46  E-value: 6.98e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   12 WYAVHTYAGYEDTVVRNLKQRIESLGMEDVIFNAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTPR 91
Cdd:TIGR00922   1 WYVVQTYSGYEKKVKQNLEELIELLGMGDYIFEVIVPTEEVVEIKKGKKKVVERKIFPGYVLVKMDLTDVSWHLVKNTPG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   92 VTGFVGAGTTPVPLDEK-EIKILFDRM--GVSEPKYKLDVKLGEVVKITDGPFKDFEGNVSEIDEERGKIKILISMFGRE 168
Cdd:TIGR00922  81 VTGFVGSGGKPKALSEDeEVKNILNALeeGKDKPKPKIDFEVGEQVRVNDGPFANFTGTVEEVDYEKSKLKVSVSIFGRE 160
                         170
                  ....*....|..
gi 818522861  169 TPVEIDFLQVKK 180
Cdd:TIGR00922 161 TPVELEFSQVEK 172
NGN_Bact_1 cd09891
Bacterial N-Utilization Substance G (NusG) N-terminal (NGN) domain, subgroup 1; The ...
11-117 1.26e-48

Bacterial N-Utilization Substance G (NusG) N-terminal (NGN) domain, subgroup 1; The N-Utilization Substance G (NusG) protein is involved in transcription elongation and termination in bacteria. NusG is essential in Escherichia coli and associates with RNA polymerase elongation and Rho-termination. Homologs of the NusG gene exist in all bacteria. The NusG N-terminal domain (NGN) is similar in all NusG homologs, but its C-terminal domain and the linker that separates these two domains are different. The domain organization of NusG suggests that the common properties of NusG and its homologs are due to their similar NGN domains.


Pssm-ID: 193580 [Multi-domain]  Cd Length: 107  Bit Score: 153.02  E-value: 1.26e-48
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  11 KWYAVHTYAGYEDTVVRNLKQRIESLGMEDVIFNAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTP 90
Cdd:cd09891    1 KWYVVHTYSGYENKVKENLEKRIESEGLEDYIGEVLVPTEEVVEVKNGKKKVKERKLFPGYVLVEMDMNDDTWHLVRNTP 80
                         90       100
                 ....*....|....*....|....*..
gi 818522861  91 RVTGFVGAGTTPVPLDEKEIKILFDRM 117
Cdd:cd09891   81 GVTGFVGSGGKPVPLSEEEVERILGQV 107
NGN smart00738
In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, ...
11-114 2.25e-34

In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, this domain may confer affinity for Spt4p.Spt4p


Pssm-ID: 197850 [Multi-domain]  Cd Length: 106  Bit Score: 117.09  E-value: 2.25e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861    11 KWYAVHTYAGYEDTVVRNLKQRIESLGMEDVIFNAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTP 90
Cdd:smart00738   1 NWYAVRTTSGQEKRVAENLERKAEALGLEDKIVSILVPTEEVKEIRRGKKKVVERKLFPGYIFVEADLEDEVWTAIRGTP 80
                           90       100
                   ....*....|....*....|....
gi 818522861    91 RVTGFVGAGTTPVPLDEKEIKILF 114
Cdd:smart00738  81 GVRGFVGGGGKPTPVPDDEIEKIL 104
NusG pfam02357
Transcription termination factor nusG;
11-113 4.36e-32

Transcription termination factor nusG;


Pssm-ID: 426736 [Multi-domain]  Cd Length: 96  Bit Score: 110.78  E-value: 4.36e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   11 KWYAVHTYAGYEDTVvrnlKQRIESLGMEdvifnAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTP 90
Cdd:pfam02357   2 KWYVLQTYSGKEKKV----KENLERQGIE-----VFLPTEEVVEIRNGKKKVVERPLFPGYVFVRMDMTDETWHLVRSTP 72
                          90       100
                  ....*....|....*....|...
gi 818522861   91 RVTGFVGAGTTPVPLDEKEIKIL 113
Cdd:pfam02357  73 GVTGFVGGSGKPTPIPDEEVERI 95
KOW_NusG cd06091
NusG contains an NGN domain at its N-terminus and KOW motif at its C-terminus; KOW_NusG motif ...
125-180 5.99e-24

NusG contains an NGN domain at its N-terminus and KOW motif at its C-terminus; KOW_NusG motif is one of the two domains of N-Utilization Substance G (NusG) a transcription elongation and Rho-termination factor in bacteria and archaea. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The eukaryotic ortholog of NusG is Spt5 with multiple KOW motifs at its C-terminus.


Pssm-ID: 240515 [Multi-domain]  Cd Length: 56  Bit Score: 88.67  E-value: 5.99e-24
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 818522861 125 KLDVKLGEVVKITDGPFKDFEGNVSEIDEERGKIKILISMFGRETPVEIDFLQVKK 180
Cdd:cd06091    1 EVDFEVGDTVRIISGPFAGFEGKVEEIDEEKGKVKVLVEMFGRETPVELDFDQVEK 56
NusG_myco TIGR01956
NusG family protein; This model represents a family of Mycoplasma proteins orthologous to the ...
11-180 2.62e-19

NusG family protein; This model represents a family of Mycoplasma proteins orthologous to the bacterial transcription termination/antitermination factor NusG. These sequences from Mycoplasma are notably diverged (long branches in a Neighbor-joining phylogenetic tree) from the bacterial species. And although NusA and ribosomal protein S10 (NusE) appear to be present, NusB may be absent in Mycoplasmas calling into question whether these species have a functional Nus system including this family as a member.


Pssm-ID: 273895 [Multi-domain]  Cd Length: 258  Bit Score: 82.32  E-value: 2.62e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   11 KWYAVHTYAGYEDTVVRNLKQRIESLGMEDVIFNAVVPTEKKIKIQ----------------------------SGKRKV 62
Cdd:TIGR01956   1 QWYIATTINGNEDEVIENIKAKVRALGLENYISDFKILKEREIEEKvfepkngqaprsmkntattkwetldetkYKKTKI 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   63 VEEKIYPGYVLVDMVVTDNSWYVVRNTPRVTGFVGA---GTTPVPLDEKEIKILFDRMGVSEPKYKLDV----------- 128
Cdd:TIGR01956  81 SEKNKYNGYIYIKMIMTEDAWFLIRNTENVTGLVGSsgkGAKPIPISADADKLKMLKGISENTKKRVLVtntaivemeen 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  129 ----------------------------------------------KLGEVVKITDGPFKDFEGNVSEIDEERGKIKILI 162
Cdd:TIGR01956 161 kfdekcqyilkhkqvkpeaiaqvsesgeiideiveefqlvdnlskfRVGNFVKIVDGPFKGIVGKIKKIDQEKKKAIVEV 240
                         250
                  ....*....|....*...
gi 818522861  163 SMFGRETPVEIDFLQVKK 180
Cdd:TIGR01956 241 EILGKSVDVDLNFKHLKL 258
NGN cd08000
N-Utilization Substance G (NusG) N-terminal (NGN) domain Superfamily; The N-Utilization ...
11-114 2.60e-18

N-Utilization Substance G (NusG) N-terminal (NGN) domain Superfamily; The N-Utilization Substance G (NusG) and its eukaryotic homolog Spt5 are involved in transcription elongation and termination. NusG contains an NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus in bacteria and archaea. The eukaryotic ortholog, Spt5, is a large protein composed of an acidic N-terminus, an NGN domain, and multiple KOW motifs at its C-terminus. Spt5 forms a Spt4-Spt5 complex that is an essential RNA Polymerase II elongation factor. NusG was originally discovered as an N-dependent antitermination enhancing activity in Escherichia coli and has a variety of functions, such as being involved in RNA polymerase elongation and Rho-termination in bacteria. Orthologs of the NusG gene exist in all bacteria, but its functions and requirements are different. The diverse activities suggest that, after diverging from a common ancestor, NusG proteins became specialized in different bacteria.


Pssm-ID: 193574 [Multi-domain]  Cd Length: 99  Bit Score: 75.43  E-value: 2.60e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  11 KWYAVHTYAGYEDTVVRNLKQRIESlgmedVIFNAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTP 90
Cdd:cd08000    1 NWYVLFVKTGREEKVEKLLEKRFEA-----NDIEAFVPKKEVPERKRGKIEEVIKPLFPGYVFVETDLSPELYELIREVP 75
                         90       100
                 ....*....|....*....|....
gi 818522861  91 RVTGFVGAGTTPVPLDEKEIKILF 114
Cdd:cd08000   76 GVIGILGNGEEPSPVSDEEIEMIL 99
NGN_SP_RfaH cd09892
N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), RfaH; ...
12-110 1.07e-11

N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), RfaH; RfaH is an operon-specific virulence regulator, thought to have arisen from an early duplication of N-Utilization Substance G (NusG). Paralogs of eubacterial NusG, NusG SP (Specialized Paralog of NusG), are more diverse and often found as the first ORF in operons encoding secreted proteins and LPS biosynthesis genes. NusG SP family members are operon-specific transcriptional antitermination factors. NusG is essential in Escherichia coli and is associated with RNA polymerase elongation and Rho-termination in bacteria. In contrast, RfaH is a non-essential protein that controls expression of operons containing an ops (operon polarity suppressor) element in their transcribed DNA. RfaH and NusG are different in their response to Rho-dependent terminators and regulatory targets. The NusG N-terminal (NGN) domain is quite similar in all NusG orthologs, but its C-terminal domains and the linker that separate these two domains are different. The domain organization of NusG and its homologs suggest that the common properties of NusG and RfaH are due to their similar NGN domains.


Pssm-ID: 193581 [Multi-domain]  Cd Length: 96  Bit Score: 58.34  E-value: 1.07e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  12 WYAVHTYAGYEDTVVRNLKQRieslGmedviFNAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTPR 91
Cdd:cd09892    2 WYLLYTKPRQEERAAENLERQ----G-----FEVFLPMIRVEKRRRGKRTVVTEPLFPGYLFVRLDPEVQNWRPIRSTRG 72
                         90
                 ....*....|....*....
gi 818522861  92 VTGFVGAGTTPVPLDEKEI 110
Cdd:cd09892   73 VSRLVRFGGEPAPVPDALI 91
NGN_SP cd09886
N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP); The ...
11-110 6.64e-11

N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP); The N-Utilization Substance G (NusG) protein is involved in transcription elongation and termination. NusG is essential in Escherichia coli and is associated with RNA polymerase elongation and Rho-termination in bacteria. Paralogs of eubacterial NusG, NusG SP (Specialized Paralog of NusG), are more diverse and often found as the first ORF in operons encoding secreted proteins and LPS biosynthesis genes. NusG SP family members are operon-specific transcriptional antitermination factors. The NusG N-terminal (NGN) domain is quite similar in all NusG orthologs, but its C-terminal domains and the linker that separate these two domains are different. The domain organization of NusG and its orthologs suggest that the common properties of NusG and its orthologs and paralogs are due to their similar NGN domains.


Pssm-ID: 193575 [Multi-domain]  Cd Length: 97  Bit Score: 56.22  E-value: 6.64e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  11 KWYAVHTYAGYEDTVVRNLKQRieslGMEdvifnAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDM-VVTDNSWYVVRNT 89
Cdd:cd09886    1 SWYALRTNPGCEQRAEEALEAR----GVE-----AFLPMLTEERKRRRKKFDVERPLFPGYVFARLdRSKGQDTSTIRAC 71
                         90       100
                 ....*....|....*....|.
gi 818522861  90 PRVTGFVGAGTTPVPLDEKEI 110
Cdd:cd09886   72 DGVLGVVGFDGRPAPVPEQEM 92
rfaH PRK09014
transcription/translation regulatory transformer protein RfaH;
54-181 4.30e-10

transcription/translation regulatory transformer protein RfaH;


Pssm-ID: 181611 [Multi-domain]  Cd Length: 162  Bit Score: 55.67  E-value: 4.30e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  54 KIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNTPRVTGFVGAGTTP--VPLDekeikILFDRMGVSEPKYKLDV--K 129
Cdd:PRK09014  37 KIVRGKRTEVSEPLFPNYLFVEFDPEVIHTTTIRSTRGVSHFVRFGAQPaiVPSD-----VIYQLSVYKPEKIVDPEtpK 111
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 818522861 130 LGEVVKITDGPFKDFEGNVSEIDEERGKIkILISMFGRETPVEIDFLQVKKI 181
Cdd:PRK09014 112 PGDKVIITEGAFEGLQAIYTEPDGEARSI-LLLNLLNKQVKHSVDNTQFRKI 162
NGN_SP_TaA cd09893
N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), TaA; ...
12-110 1.03e-09

N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), TaA; The N-Utilization Substance G (NusG) protein is involved in transcription elongation and termination. NusG is essential in Escherichia coli and is associated with RNA polymerase elongation and Rho-termination in bacteria. Paralogs of eubacterial NusG, NusG SP (Specialized Paralog of NusG), are more diverse and often found as the first ORF in operons encoding secreted proteins and LPS biosynthesis genes. NusG SP family members are operon-specific transcriptional antiterminationn factors. TaA is a NusG SP factor that is required for synthesis of a polyketide antibiotic TA in Myxococcus xanthus. Orthologs of the NusG gene exist in all bacteria, but its functions and requirements are different. The NusG N-terminal (NGN) domain is quite similar in all NusG orthologs, but its C-terminal domains and the linker that separate these two domains are different. The domain organization of NusG and its orthologs suggest that the common properties of NusG and its orthologs and paralogs are due to their similar NGN domains.


Pssm-ID: 193582 [Multi-domain]  Cd Length: 95  Bit Score: 53.08  E-value: 1.03e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  12 WYAVHTYAGYEDTVVRNLKQR-IESLgmedvifnavVPTEKKIKIQSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRnTP 90
Cdd:cd09893    2 WYALYTRSRHEKKVADRLAKKgIESF----------LPLYEVLSRWKDRKKKIKVPLFPGYLFVRFQLDPERLRILK-TP 70
                         90       100
                 ....*....|....*....|
gi 818522861  91 RVTGFVGAGTTPVPLDEKEI 110
Cdd:cd09893   71 GVVRIVGNSGGPIPIPDEEI 90
NGN_Bact_2 cd09889
Bacterial N-Utilization Substance G (NusG) N-terminal (NGN) domain, subgroup 2; The ...
11-113 2.88e-09

Bacterial N-Utilization Substance G (NusG) N-terminal (NGN) domain, subgroup 2; The N-Utilization Substance G (NusG) protein is involved in transcription elongation and termination. NusG is essential in Escherichia coli and associates with RNA polymerase elongation and Rho-termination. Paralogs of eubacterial NusG, NusG SP (Specialized Paralog of NusG), are more diverse and often found as the first ORF in operons encoding secreted proteins and LPS biosynthesis genes. NusG SP family members are operon-specific transcriptional antitermination factors. The NusG N-terminal domain (NGN) is quite similar in all NusG orthologs, but its C-terminal domain and the linker that separates these two domains are different. The domain organization of NusG and its orthologs suggests that the common properties of NusG and its orthologs and paralogs are due to their similar NGN domains.


Pssm-ID: 193578 [Multi-domain]  Cd Length: 100  Bit Score: 51.94  E-value: 2.88e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  11 KWYAVHTYAGYEDTVVrnlkQRIESLGMEDVIFNAVVPT-EKKIKIQsGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNT 89
Cdd:cd09889    1 MWYVVQVRTGREKAVL----ELLEKLVGPDVLQECFIPQyERKKRSQ-GVWRERKYTLFPGYVFVVTDDIDELYYELKRV 75
                         90       100
                 ....*....|....*....|....
gi 818522861  90 PRVTGFVGAGTTPVPLDEKEIKIL 113
Cdd:cd09889   76 PGFTRLLGNDGSFFPLTPEEADFI 99
NGN_plant cd09890
Plant N-Utilization Substance G (NusG) N-terminal (NGN) domain; The N-Utilization Substance G ...
12-110 5.11e-09

Plant N-Utilization Substance G (NusG) N-terminal (NGN) domain; The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. NusG contains a NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus in bacteria and archaea. The eukaryotic ortholog, Spt5, is a large protein comprising an acidic N-terminus, an NGN domain, and multiple KOW motifs at its C-terminus. Spt5 forms an Spt4-Spt5 complex that is an essential RNA polymerase II elongation factor. The bacterial infected plants contain bacterial DNA, such as NGN sequences, that can be used to clone the DNA of uncultured organisms.


Pssm-ID: 193579  Cd Length: 113  Bit Score: 51.58  E-value: 5.11e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  12 WYAVHTYAGYEDTVVRNLKQRIESL--GMEDVIFNAVVPTEKKIKiqSGKRKVVEEKIYPGYVLVDMVVTDNSWYVVRNT 89
Cdd:cd09890    2 WYMLRVPAGRENQAAEALERALATEfpDREFEVWVPSIPVDRKLK--NGSISVKEKPLFPGYVLLRCVLNKEVYDFIRDN 79
                         90       100
                 ....*....|....*....|.
gi 818522861  90 PRVTGFVGAGTTPVPLDEKEI 110
Cdd:cd09890   80 DSVYGFVGSKVGKTGKRQIEI 100
antiterm_UpxY NF033644
UpxY family transcription antiterminator; The UpxY family of NusG-related transcription ...
12-162 4.16e-08

UpxY family transcription antiterminator; The UpxY family of NusG-related transcription antiterminators was described originally from a paralogous family of eight members from Bacteriodes fragilis, UpaY to UphY, each of which was associated with a distinct capsular polysaccharide biosynthesis locus. There is no UpxY protein per se.


Pssm-ID: 468125 [Multi-domain]  Cd Length: 162  Bit Score: 50.16  E-value: 4.16e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  12 WYAVHTYAGYEDTVvrnlKQRIESLGMEdvifnAVVPTEKKIKIQSGKRKVVEEKIYPGYVLVDmVVTDNSWYVVRNTPR 91
Cdd:NF033644   1 WYALYTRPRREKKV----AELLEKKGIE-----SFLPMQKEIRQWSDRKKRVEVPLIPNLVFVH-ITEKELDEVLEQTPG 70
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 818522861  92 VTGFVGA-GTTPVPLD--EKEIKILfdRMGVSEPK-----YKLDVKLGEVVKITDGPFKDFEGNVSEIdeeRGKIKILI 162
Cdd:NF033644  71 VVRYIRDdRGKSKPAIipDKQMERF--RLMLDPSDevvvyLEAPLKKGDKVRVIGGPLKGLEGELVRV---KGKKRVVV 144
nusG PRK08559
transcription antitermination protein NusG; Validated
7-158 3.10e-05

transcription antitermination protein NusG; Validated


Pssm-ID: 181467 [Multi-domain]  Cd Length: 153  Bit Score: 42.16  E-value: 3.10e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   7 QVGRKWYAVHTYAGYEDTVVRNLKQRIESLGMEdvIFNAVVPTEKKikiqsgkrkvveekiypGYVLVDMVVTDNSWYVV 86
Cdd:PRK08559   3 PEMSMIFAVKTTAGQERNVALMLAMRAKKENLP--IYAILAPPELK-----------------GYVLVEAESKGAVEEAI 63
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 818522861  87 RNTPRVTGFVgAGTTPVpldeKEIKILfdrmgVSEPKYKLDVKLGEVVKITDGPFKDFEGNVSEIDEERGKI 158
Cdd:PRK08559  64 RGIPHVRGVV-PGEISF----EEVEHF-----LKPKPIVEGIKEGDIVELIAGPFKGEKARVVRVDESKEEV 125
KOW_elon_Spt5 TIGR00405
transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial ...
13-181 1.93e-04

transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial NusG and the uL24 (previously L24p/L26e) family of ribosomal proteins. The most recent papers and crystal structures make this a transcription elongation factor rather than a ribosomal protein.


Pssm-ID: 129499 [Multi-domain]  Cd Length: 145  Bit Score: 39.87  E-value: 1.93e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   13 YAVHTYAGYEDTVVRNLKQRIESLGMEdvIFNAVVPTEKKikiqsgkrkvveekiypGYVLVDMVVTDNSWYVVRNTPRV 92
Cdd:TIGR00405   1 FAVKTSVGQEKNVARLMARKARKSGLE--VYSILAPESLK-----------------GYILVEAETKIDMRNPIIGVPHV 61
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861   93 TGFVgagttPVPLDEKEIKILFDRMGVSEpkyklDVKLGEVVKITDGPFKDFEGNVSEIDEERGKIKILISMFGRETPVE 172
Cdd:TIGR00405  62 RGVV-----EGEIDFEEIERFLTPKKIIE-----SIKKGDIVEIISGPFKGERAKVIRVDESKEEVTLELIEAAVPIPVT 131

                  ....*....
gi 818522861  173 IDFLQVKKI 181
Cdd:TIGR00405 132 VKGDQVRII 140
NGN_SP_UpxY cd09895
N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), UpxY; ...
11-74 6.01e-04

N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), UpxY; The N-Utilization Substance G (NusG) proteins are involved in transcription elongation and termination. NusG is essential in Escherichia coli and is associated with RNA polymerase elongation and Rho-termination. Paralogs of eubacterial NusG, NusG SP (Specialized Paralog of NusG), are more diverse and often found as the first ORF in operons encoding secreted proteins and LPS (lipopolysaccharide) biosynthesis genes. NusG SP family members are operon-specific transcriptional antitermination factors. UpxY proteins, UpxY proteins, where the x is replaced by the letter designation of the specific polysaccharide (UpaY to UphY), are a family of NusG SP factors that act specifically in transcriptional antitermination of operons from which they are encoded. UpxYs are necessary and specific for transcription regulation of the polysaccharide biosynthesis operon. Orthologs of the NusG gene exist in all bacteria, but their functions and requirements are different. The NusG N-terminal (NGN) domain is similar in all NusG orthologs, but its C-terminal domain and the linker that separate these two domains are different. The domain organization of NusG and its orthologs suggests that the common properties of NusG and its orthologs and paralogs are due to their similar NGN domains.


Pssm-ID: 193584 [Multi-domain]  Cd Length: 95  Bit Score: 37.56  E-value: 6.01e-04
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 818522861  11 KWYAVHTYAGYEDTVVRNLKQR-IESLgmedvifnavVPTEKKIKIQSGKRKVVEEKIYPGYVLV 74
Cdd:cd09895    1 PWYALYTFPRREKKVAEYLEKKgIECF----------LPMQYEVRQWSGRKKRVEVPLFPNLVFV 55
KOW smart00739
KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
127-154 3.68e-03

KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.


Pssm-ID: 128978  Cd Length: 28  Bit Score: 33.46  E-value: 3.68e-03
                           10        20
                   ....*....|....*....|....*...
gi 818522861   127 DVKLGEVVKITDGPFKDFEGNVSEIDEE 154
Cdd:smart00739   1 KFEVGDTVRVIAGPFKGKVGKVLEVDGE 28
NGN_SP_AnfA1 cd09894
N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), AnFA1; ...
10-110 7.33e-03

N-Utilization Substance G (NusG) N-terminal domain in the NusG Specialized Paralog (SP), AnFA1; Regulation of the afp, antifeeding prophage, gene cluster is mediated by AnFA1, a RfaH-like transcriptional antiterminator. RfaH is an operon-specific virulence regulator, thought to arisen from an early duplication of N-Utilization Substance G (NusG). NusG is essential in Escherichia coli and is associated with RNA polymerase elongation and Rho-termination in bacteria. Paralogs of eubacterial NusG, NusG SP (Specialized Paralog of NusG), are more diverse and often found as the first ORF in operons encoding secreted proteins and LPS biosynthesis genes. NusG SP family members are operon-specific transcriptional antitermination factors. Orthologs of the NusG gene exist in all bacteria, but their functions and requirements are different. The NusG N-terminal domain (NGN) is similar in all NusG orthologs, but its C-terminal domain and the linker that separate these two domains are different. The domain organization of NusG and its orthologs suggests that the common properties of NusG and its orthologs and paralogs are due to their similar NGN domains.


Pssm-ID: 193583 [Multi-domain]  Cd Length: 99  Bit Score: 34.57  E-value: 7.33e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 818522861  10 RKWY-AVHTYagyedTVVRNLKQRIESLGMEdvIFNAVVPTEKKIKIQSGKRkVVEEKIYPGYVLVDMVVTDNSWYVVRN 88
Cdd:cd09894    1 KRWYlLRCKS-----GKIQSVIFSLERLGVE--VFCPMIRTRRKRTDCKSYR-EKIEPLFPGYLFVRFDPEVVHTSKITL 72
                         90       100
                 ....*....|....*....|..
gi 818522861  89 TPRVTGFVGAGTTPVPLDEKEI 110
Cdd:cd09894   73 ASGVSGFVRFGGEPCPVPDAVI 94
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
131-160 7.79e-03

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 32.74  E-value: 7.79e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 818522861  131 GEVVKITDGPFKDFEGNVSEIDEERGKIKI 160
Cdd:pfam00467   2 GDVVRVIAGPFKGKVGKVVEVDDKKNRVLV 31
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH