NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|486615938|ref|WP_001618775|]
View 

RNA-guided endonuclease TnpB family protein [Escherichia coli]

Protein Classification

RNA-guided endonuclease InsQ/TnpB family protein( domain architecture ID 11430747)

RNA-guided endonuclease InsQ/TnpB family protein such as the RNA-guided endonuclease TnpB from IS200/IS605 family elements and IS607 family elements; this protein is homologous to some CRISPR-associated (Cas) proteins such as the type V CRISPR-associated protein C2c8; TnpB proteins were described as accessory proteins in IS (insertion sequence) elements, present as one of just one or two proteins encoded in the element but not necessary for transposition. The programmable RNA-guided endonuclease TnpB proteins may provide a CRISPR-like, widespread form of phage defense by RNA-guided DNA degradation.

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
InsQ COG0675
Transposase [Mobilome: prophages, transposons];
4-304 5.38e-70

Transposase [Mobilome: prophages, transposons];


:

Pssm-ID: 440439 [Multi-domain]  Cd Length: 381  Bit Score: 221.69  E-value: 5.38e-70
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938   4 KQAFKFLLEPNKTHMNDFLVFAGSCRFVYNKGLAFINENYDSGKKFLNYNQLASELVNWKNEecLAWLKMAPSQCLQQSL 83
Cdd:COG0675    2 LRTYKFRLYPTKEQEELLERTLGCCRFVYNYALAERRQAYKETGKSLSYYELQKLLTELKKE--YPWLKELPSQVLQQAL 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938  84 RDLDRAFKNFFS-----GKSQYPRFKKKGRNDSFRVPCQRVRLDQEKhlVSLPKLGWVKYRKSREI--TGVLKNVTISRK 156
Cdd:COG0675   80 KRLDEAFKSFFKrkkkgKKAGFPRFKKKGRYRSFTYPQSGFKLKDGR--LKLPKIGWVKIRLHRPLpdDGKIKSVTISRK 157
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 157 L-DKWYISFNTE-EVVPEPLHPSfsktKIL-----LNNewlmqLTACeSLVEQFANM----EGNKKLRNLNNILGRKVKY 225
Cdd:COG0675  158 AaGKWYVSFVVEvEDVPELPPTG----KVVgidlgLKN-----FATL-SDGEKIDNPkflkKAERKLAKLQRRLSRKKKG 227
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 226 SSNWLKTKKKIDGVKARSSRRRLDALHKITTAICKKHAIV-----ELVNLTDSlpDKNNGSVSM--TYEFVRQLMYKQEW 298
Cdd:COG0675  228 SKNRRKARKKLAKLHEKIANQRKDFLHKLARKLVKEADVIvvedlNVKGMKKN--KKLNKSISDagWGEFRRQLEYKAEK 305

                 ....*.
gi 486615938 299 LGGKVI 304
Cdd:COG0675  306 YGIKVV 311
 
Name Accession Description Interval E-value
InsQ COG0675
Transposase [Mobilome: prophages, transposons];
4-304 5.38e-70

Transposase [Mobilome: prophages, transposons];


Pssm-ID: 440439 [Multi-domain]  Cd Length: 381  Bit Score: 221.69  E-value: 5.38e-70
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938   4 KQAFKFLLEPNKTHMNDFLVFAGSCRFVYNKGLAFINENYDSGKKFLNYNQLASELVNWKNEecLAWLKMAPSQCLQQSL 83
Cdd:COG0675    2 LRTYKFRLYPTKEQEELLERTLGCCRFVYNYALAERRQAYKETGKSLSYYELQKLLTELKKE--YPWLKELPSQVLQQAL 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938  84 RDLDRAFKNFFS-----GKSQYPRFKKKGRNDSFRVPCQRVRLDQEKhlVSLPKLGWVKYRKSREI--TGVLKNVTISRK 156
Cdd:COG0675   80 KRLDEAFKSFFKrkkkgKKAGFPRFKKKGRYRSFTYPQSGFKLKDGR--LKLPKIGWVKIRLHRPLpdDGKIKSVTISRK 157
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 157 L-DKWYISFNTE-EVVPEPLHPSfsktKIL-----LNNewlmqLTACeSLVEQFANM----EGNKKLRNLNNILGRKVKY 225
Cdd:COG0675  158 AaGKWYVSFVVEvEDVPELPPTG----KVVgidlgLKN-----FATL-SDGEKIDNPkflkKAERKLAKLQRRLSRKKKG 227
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 226 SSNWLKTKKKIDGVKARSSRRRLDALHKITTAICKKHAIV-----ELVNLTDSlpDKNNGSVSM--TYEFVRQLMYKQEW 298
Cdd:COG0675  228 SKNRRKARKKLAKLHEKIANQRKDFLHKLARKLVKEADVIvvedlNVKGMKKN--KKLNKSISDagWGEFRRQLEYKAEK 305

                 ....*.
gi 486615938 299 LGGKVI 304
Cdd:COG0675  306 YGIKVV 311
IS200_TnpB NF038281
IS200/IS605 family element RNA-guided endonuclease TnpB;
6-307 9.44e-56

IS200/IS605 family element RNA-guided endonuclease TnpB;


Pssm-ID: 468448 [Multi-domain]  Cd Length: 359  Bit Score: 184.23  E-value: 9.44e-56
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938   6 AFKFLLEPNKTHMNDFLVFAGSCRFVYNKGLAFINENYDSGKKFLNYNQLASELVNWKNEEclAWLKMAPSQCLQQSLRD 85
Cdd:NF038281   3 AYKFRIYPNKEQEILINKTIGCSRFVYNHFLAKWNEAYEETGKGLSYNACSKQLTQLKKEE--EWLKEVDSIALQNSLKN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938  86 LDRAFKNFFSGKSQYPRFK-KKGRNDSFRVPCQRVRLDQEKHLVSLPKLGWVKYRKSREITGVLKNVTISRK-LDKWYIS 163
Cdd:NF038281  81 LDDAFKRFFKKQNGFPRFKsKKNPVQSYTTKNTNGNIAIVGNKIKLPKLGWVKFAKSREVEGRILSATVRRNpSGKYFVS 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 164 FNTEEVVPEPLHPSFS-------KTKILLNNEwlmqltaceslvEQFAN------MEgnKKLRNLNNILGRKVKYSSNWL 230
Cdd:NF038281 161 ILVETEVQLLPKTNSAvgidlglKDFAILSDG------------GKIENpkylrkLE--KKLAKLQRILSRRKKGSSNWQ 226
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 231 KTKKKIDGVKARSSRRRLDALHKITTAICKKHAIVELVNLTDSLPDKNN------GSVSMtYEFVRQLMYKQEWLGGKVI 304
Cdd:NF038281 227 KQRIKVARLHEKIANQRKDFLHKLSTRLIKENQVICIEDLQVKNMLKNHklaksiSDVSW-SEFRTMLEYKAKWYGRTVV 305

                 ...
gi 486615938 305 RLG 307
Cdd:NF038281 306 KVG 308
guided_TnpB NF040570
RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB ...
7-305 3.83e-44

RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB from IS200/IS605 family elements (NF038281) and IS607 family elements (NF038280), but also many additional proteins. It exhibits homolog to or actually includes some CRISPR-associated (Cas) proteins such as the type V CRISPR-associated protein C2c8. For a long time, TnpB proteins were described as accessory proteins in IS (insertion sequence) elements, present as one of just one or two proteins encoded in the element but not necessary for transposition. The programmable RNA-guided endonuclease TnpB proteins may provide a CRISPR-like, widespread form of phage defense by RNA-guided DNA degradation.


Pssm-ID: 468544 [Multi-domain]  Cd Length: 384  Bit Score: 154.62  E-value: 3.83e-44
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938   7 FKFLLEPNKTHMNDFLVFAGSCRFVYNKGLAFINENYDSGKKFL-NYNQLASELVNWKNEECLAWLKMAPSQCLQQSLRD 85
Cdd:NF040570   1 YKYRLYPTKEQKRELAELFGAARFLYNAALAERKEAYEKNGKFLsYKALLKKLLTELKKEKELEWLKELSSQALQQALKR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938  86 LDRAFKNFFSGKSQ--YPRFKKKGRNDSFRVPCQRVRLD-------QEKHLVSLPKLGWVKYRKSREI-------TGVLK 149
Cdd:NF040570  81 LAKAFKNFFKKLKKagFPRFKSKKKKVPSYTPQSVNKRLrkkrnrkKKNGRLKLPKLGGVKLRLSRILpilldgkGGKIK 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 150 NVTISR-KLDKWYISFNTEEVVPEPLHPSFSKTK--ILLNNEWLMQLTACESLVEQFANMEG-----NKKLRNLNNILGR 221
Cdd:NF040570 161 SVTISKpKKGKYYVSISVEVEVPEPPPKEVTGKVvgIDLGLKNFATLSDGGEKIENPRFLRKkekrlRRLQRKLSRKLQR 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 222 KVKYSSNWLKTKKKIDGVKARSSRRRLDALHKITTAICKKHAI--VELVNLTD------SLPDKNNGSVS--MTYEFVRQ 291
Cdd:NF040570 241 KGKGSSNRKKARKKVARLHRKIANQRKDFLHKLSKRLVKEADAnnVVVEDLEVkgmvknKKKKKLAKSIHdwAFGQLRRM 320
                        330
                 ....*....|....
gi 486615938 292 LMYKQEWLGGKVIR 305
Cdd:NF040570 321 LEYKAEWYGIKVVK 334
HTH_OrfB_IS605 pfam12323
Helix-turn-helix domain; This is the N terminal helix-turn-helix domain of Transposase_2 ...
1-48 2.42e-08

Helix-turn-helix domain; This is the N terminal helix-turn-helix domain of Transposase_2 pfam01385.


Pssm-ID: 432479 [Multi-domain]  Cd Length: 47  Bit Score: 49.49  E-value: 2.42e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 486615938    1 MIKKqAFKFLLEPNKTHMNDFLVFAGSCRFVYNKGLAFINENYDSGKK 48
Cdd:pfam12323   1 MVLK-AYKYRLYPTPEQEELLARTFGCARFVYNKALAERKEAYKEGGK 47
 
Name Accession Description Interval E-value
InsQ COG0675
Transposase [Mobilome: prophages, transposons];
4-304 5.38e-70

Transposase [Mobilome: prophages, transposons];


Pssm-ID: 440439 [Multi-domain]  Cd Length: 381  Bit Score: 221.69  E-value: 5.38e-70
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938   4 KQAFKFLLEPNKTHMNDFLVFAGSCRFVYNKGLAFINENYDSGKKFLNYNQLASELVNWKNEecLAWLKMAPSQCLQQSL 83
Cdd:COG0675    2 LRTYKFRLYPTKEQEELLERTLGCCRFVYNYALAERRQAYKETGKSLSYYELQKLLTELKKE--YPWLKELPSQVLQQAL 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938  84 RDLDRAFKNFFS-----GKSQYPRFKKKGRNDSFRVPCQRVRLDQEKhlVSLPKLGWVKYRKSREI--TGVLKNVTISRK 156
Cdd:COG0675   80 KRLDEAFKSFFKrkkkgKKAGFPRFKKKGRYRSFTYPQSGFKLKDGR--LKLPKIGWVKIRLHRPLpdDGKIKSVTISRK 157
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 157 L-DKWYISFNTE-EVVPEPLHPSfsktKIL-----LNNewlmqLTACeSLVEQFANM----EGNKKLRNLNNILGRKVKY 225
Cdd:COG0675  158 AaGKWYVSFVVEvEDVPELPPTG----KVVgidlgLKN-----FATL-SDGEKIDNPkflkKAERKLAKLQRRLSRKKKG 227
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 226 SSNWLKTKKKIDGVKARSSRRRLDALHKITTAICKKHAIV-----ELVNLTDSlpDKNNGSVSM--TYEFVRQLMYKQEW 298
Cdd:COG0675  228 SKNRRKARKKLAKLHEKIANQRKDFLHKLARKLVKEADVIvvedlNVKGMKKN--KKLNKSISDagWGEFRRQLEYKAEK 305

                 ....*.
gi 486615938 299 LGGKVI 304
Cdd:COG0675  306 YGIKVV 311
IS200_TnpB NF038281
IS200/IS605 family element RNA-guided endonuclease TnpB;
6-307 9.44e-56

IS200/IS605 family element RNA-guided endonuclease TnpB;


Pssm-ID: 468448 [Multi-domain]  Cd Length: 359  Bit Score: 184.23  E-value: 9.44e-56
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938   6 AFKFLLEPNKTHMNDFLVFAGSCRFVYNKGLAFINENYDSGKKFLNYNQLASELVNWKNEEclAWLKMAPSQCLQQSLRD 85
Cdd:NF038281   3 AYKFRIYPNKEQEILINKTIGCSRFVYNHFLAKWNEAYEETGKGLSYNACSKQLTQLKKEE--EWLKEVDSIALQNSLKN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938  86 LDRAFKNFFSGKSQYPRFK-KKGRNDSFRVPCQRVRLDQEKHLVSLPKLGWVKYRKSREITGVLKNVTISRK-LDKWYIS 163
Cdd:NF038281  81 LDDAFKRFFKKQNGFPRFKsKKNPVQSYTTKNTNGNIAIVGNKIKLPKLGWVKFAKSREVEGRILSATVRRNpSGKYFVS 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 164 FNTEEVVPEPLHPSFS-------KTKILLNNEwlmqltaceslvEQFAN------MEgnKKLRNLNNILGRKVKYSSNWL 230
Cdd:NF038281 161 ILVETEVQLLPKTNSAvgidlglKDFAILSDG------------GKIENpkylrkLE--KKLAKLQRILSRRKKGSSNWQ 226
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 231 KTKKKIDGVKARSSRRRLDALHKITTAICKKHAIVELVNLTDSLPDKNN------GSVSMtYEFVRQLMYKQEWLGGKVI 304
Cdd:NF038281 227 KQRIKVARLHEKIANQRKDFLHKLSTRLIKENQVICIEDLQVKNMLKNHklaksiSDVSW-SEFRTMLEYKAKWYGRTVV 305

                 ...
gi 486615938 305 RLG 307
Cdd:NF038281 306 KVG 308
guided_TnpB NF040570
RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB ...
7-305 3.83e-44

RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB from IS200/IS605 family elements (NF038281) and IS607 family elements (NF038280), but also many additional proteins. It exhibits homolog to or actually includes some CRISPR-associated (Cas) proteins such as the type V CRISPR-associated protein C2c8. For a long time, TnpB proteins were described as accessory proteins in IS (insertion sequence) elements, present as one of just one or two proteins encoded in the element but not necessary for transposition. The programmable RNA-guided endonuclease TnpB proteins may provide a CRISPR-like, widespread form of phage defense by RNA-guided DNA degradation.


Pssm-ID: 468544 [Multi-domain]  Cd Length: 384  Bit Score: 154.62  E-value: 3.83e-44
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938   7 FKFLLEPNKTHMNDFLVFAGSCRFVYNKGLAFINENYDSGKKFL-NYNQLASELVNWKNEECLAWLKMAPSQCLQQSLRD 85
Cdd:NF040570   1 YKYRLYPTKEQKRELAELFGAARFLYNAALAERKEAYEKNGKFLsYKALLKKLLTELKKEKELEWLKELSSQALQQALKR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938  86 LDRAFKNFFSGKSQ--YPRFKKKGRNDSFRVPCQRVRLD-------QEKHLVSLPKLGWVKYRKSREI-------TGVLK 149
Cdd:NF040570  81 LAKAFKNFFKKLKKagFPRFKSKKKKVPSYTPQSVNKRLrkkrnrkKKNGRLKLPKLGGVKLRLSRILpilldgkGGKIK 160
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 150 NVTISR-KLDKWYISFNTEEVVPEPLHPSFSKTK--ILLNNEWLMQLTACESLVEQFANMEG-----NKKLRNLNNILGR 221
Cdd:NF040570 161 SVTISKpKKGKYYVSISVEVEVPEPPPKEVTGKVvgIDLGLKNFATLSDGGEKIENPRFLRKkekrlRRLQRKLSRKLQR 240
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 486615938 222 KVKYSSNWLKTKKKIDGVKARSSRRRLDALHKITTAICKKHAI--VELVNLTD------SLPDKNNGSVS--MTYEFVRQ 291
Cdd:NF040570 241 KGKGSSNRKKARKKVARLHRKIANQRKDFLHKLSKRLVKEADAnnVVVEDLEVkgmvknKKKKKLAKSIHdwAFGQLRRM 320
                        330
                 ....*....|....
gi 486615938 292 LMYKQEWLGGKVIR 305
Cdd:NF040570 321 LEYKAEWYGIKVVK 334
HTH_OrfB_IS605 pfam12323
Helix-turn-helix domain; This is the N terminal helix-turn-helix domain of Transposase_2 ...
1-48 2.42e-08

Helix-turn-helix domain; This is the N terminal helix-turn-helix domain of Transposase_2 pfam01385.


Pssm-ID: 432479 [Multi-domain]  Cd Length: 47  Bit Score: 49.49  E-value: 2.42e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 486615938    1 MIKKqAFKFLLEPNKTHMNDFLVFAGSCRFVYNKGLAFINENYDSGKK 48
Cdd:pfam12323   1 MVLK-AYKYRLYPTPEQEELLARTFGCARFVYNKALAERKEAYKEGGK 47
OrfB_IS605 pfam01385
Probable transposase; This family includes IS891, IS1136 and IS1341. DUF1225, pfam06774, has ...
209-262 3.71e-06

Probable transposase; This family includes IS891, IS1136 and IS1341. DUF1225, pfam06774, has now been merged into this family.


Pssm-ID: 396108 [Multi-domain]  Cd Length: 120  Bit Score: 45.37  E-value: 3.71e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 486615938  209 NKKLRNLNNILGRKVKYSSNWLKTKKKIDGVKARSSRRRLDALHKITTAICKKH 262
Cdd:pfam01385  50 YKYLAKRIARLQRKLKGSNNRKKASRKLARLHRKRSRRRKDFLHKLVRRLIEEL 103
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH