NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1632590681|ref|WP_136759000|]
View 

RNA-guided endonuclease TnpB family protein [Escherichia coli]

Protein Classification

RNA-guided endonuclease InsQ/TnpB family protein( domain architecture ID 11430747)

RNA-guided endonuclease InsQ/TnpB family protein such as the RNA-guided endonuclease TnpB from IS200/IS605 family elements and IS607 family elements; this protein is homologous to some CRISPR-associated (Cas) proteins such as the type V CRISPR-associated protein C2c8; TnpB proteins were described as accessory proteins in IS (insertion sequence) elements, present as one of just one or two proteins encoded in the element but not necessary for transposition. The programmable RNA-guided endonuclease TnpB proteins may provide a CRISPR-like, widespread form of phage defense by RNA-guided DNA degradation.

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
InsQ COG0675
Transposase [Mobilome: prophages, transposons];
1-373 5.71e-145

Transposase [Mobilome: prophages, transposons];


:

Pssm-ID: 440439 [Multi-domain]  Cd Length: 381  Bit Score: 416.22  E-value: 5.71e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681   1 MLRATKVRIYPTPEQAEYLYAQFGAVRFAYNKALHIKKHAYQRHGVNLSpRNDLKPLLSVAKKsrRYAWLKEFDSMALQQ 80
Cdd:COG0675     1 MLRTYKFRLYPTKEQEELLERTLGCCRFVYNYALAERRQAYKETGKSLS-YYELQKLLTELKK--EYPWLKELPSQVLQQ 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681  81 AVINLDVAFSNFFNPK---RKARFPTFKRKHGKQS-SYHCVGVKVLDNAIKIPKLSPIEARLHRELH--GKLKSITITRS 154
Cdd:COG0675    78 ALKRLDEAFKSFFKRKkkgKKAGFPRFKKKGRYRSfTYPQSGFKLKDGRLKLPKIGWVKIRLHRPLPddGKIKSVTISRK 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 155 ATGKYYASILCDDGLEAPVKPTliSTVTGLDMGLEHYAIRSDGAKIANPRHLINASRNLRRKQKALSRKQKGSANRKKAR 234
Cdd:COG0675   158 AAGKWYVSFVVEVEDVPELPPT--GKVVGIDLGLKNFATLSDGEKIDNPKFLKKAERKLAKLQRRLSRKKKGSKNRRKAR 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 235 IRLAALHERVANARADFQHKLSRTIVDENQAVIVETLKTANMMKNHNLARVIGDAGWHSFITSLEYKAAEKGAHLVKLDQ 314
Cdd:COG0675   236 KKLAKLHEKIANQRKDFLHKLARKLVKEADVIVVEDLNVKGMKKNKKLNKSISDAGWGEFRRQLEYKAEKYGIKVVEVDP 315
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1632590681 315 WFaSSKTCHCCGYKMSEMPLHKRIWRCPECGIEHDRDINAALNIRQKGILELKAAGLVV 373
Cdd:COG0675   316 AY-TSQTCSSCGHVVKKLRLSVRTFVCPKCGTVHDRDVNAAINILRRGLRQLGLAGHSG 373
 
Name Accession Description Interval E-value
InsQ COG0675
Transposase [Mobilome: prophages, transposons];
1-373 5.71e-145

Transposase [Mobilome: prophages, transposons];


Pssm-ID: 440439 [Multi-domain]  Cd Length: 381  Bit Score: 416.22  E-value: 5.71e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681   1 MLRATKVRIYPTPEQAEYLYAQFGAVRFAYNKALHIKKHAYQRHGVNLSpRNDLKPLLSVAKKsrRYAWLKEFDSMALQQ 80
Cdd:COG0675     1 MLRTYKFRLYPTKEQEELLERTLGCCRFVYNYALAERRQAYKETGKSLS-YYELQKLLTELKK--EYPWLKELPSQVLQQ 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681  81 AVINLDVAFSNFFNPK---RKARFPTFKRKHGKQS-SYHCVGVKVLDNAIKIPKLSPIEARLHRELH--GKLKSITITRS 154
Cdd:COG0675    78 ALKRLDEAFKSFFKRKkkgKKAGFPRFKKKGRYRSfTYPQSGFKLKDGRLKLPKIGWVKIRLHRPLPddGKIKSVTISRK 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 155 ATGKYYASILCDDGLEAPVKPTliSTVTGLDMGLEHYAIRSDGAKIANPRHLINASRNLRRKQKALSRKQKGSANRKKAR 234
Cdd:COG0675   158 AAGKWYVSFVVEVEDVPELPPT--GKVVGIDLGLKNFATLSDGEKIDNPKFLKKAERKLAKLQRRLSRKKKGSKNRRKAR 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 235 IRLAALHERVANARADFQHKLSRTIVDENQAVIVETLKTANMMKNHNLARVIGDAGWHSFITSLEYKAAEKGAHLVKLDQ 314
Cdd:COG0675   236 KKLAKLHEKIANQRKDFLHKLARKLVKEADVIVVEDLNVKGMKKNKKLNKSISDAGWGEFRRQLEYKAEKYGIKVVEVDP 315
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1632590681 315 WFaSSKTCHCCGYKMSEMPLHKRIWRCPECGIEHDRDINAALNIRQKGILELKAAGLVV 373
Cdd:COG0675   316 AY-TSQTCSSCGHVVKKLRLSVRTFVCPKCGTVHDRDVNAAINILRRGLRQLGLAGHSG 373
IS200_TnpB NF038281
IS200/IS605 family element RNA-guided endonuclease TnpB;
2-363 9.65e-143

IS200/IS605 family element RNA-guided endonuclease TnpB;


Pssm-ID: 468448 [Multi-domain]  Cd Length: 359  Bit Score: 409.57  E-value: 9.65e-143
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681   2 LRATKVRIYPTPEQAEYLYAQFGAVRFAYNKALHIKKHAYQRHGVNLSpRNDLKPLLSVAKKsrRYAWLKEFDSMALQQA 81
Cdd:NF038281    1 HKAYKFRIYPNKEQEILINKTIGCSRFVYNHFLAKWNEAYEETGKGLS-YNACSKQLTQLKK--EEEWLKEVDSIALQNS 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681  82 VINLDVAFSNFFnpKRKARFPTFKRKHGKQSSYHCV----GVKVLDNAIKIPKLSPIEARLHRELHGKLKSITITRSATG 157
Cdd:NF038281   78 LKNLDDAFKRFF--KKQNGFPRFKSKKNPVQSYTTKntngNIAIVGNKIKLPKLGWVKFAKSREVEGRILSATVRRNPSG 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 158 KYYASILCDDGLEAPVKptlISTVTGLDMGLEHYAIRSDGAKIANPRHLINASRNLRRKQKALSRKQKGSANRKKARIRL 237
Cdd:NF038281  156 KYFVSILVETEVQLLPK---TNSAVGIDLGLKDFAILSDGGKIENPKYLRKLEKKLAKLQRILSRRKKGSSNWQKQRIKV 232
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 238 AALHERVANARADFQHKLSRTIVDENQAVIVETLKTANMMKNHNLARVIGDAGWHSFITSLEYKAAEKGAHLVKLDQWFA 317
Cdd:NF038281  233 ARLHEKIANQRKDFLHKLSTRLIKENQVICIEDLQVKNMLKNHKLAKSISDVSWSEFRTMLEYKAKWYGRTVVKVGKFFP 312
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*..
gi 1632590681 318 SSKTCHCCGYKMSEMP-LHKRIWRCPECGIEHDRDINAALNIRQKGI 363
Cdd:NF038281  313 SSQLCSCCGYKNKEVKnLALREWTCPSCGTHHDRDINASKNILNEGL 359
guided_TnpB NF040570
RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB ...
6-361 1.49e-109

RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB from IS200/IS605 family elements (NF038281) and IS607 family elements (NF038280), but also many additional proteins. It exhibits homolog to or actually includes some CRISPR-associated (Cas) proteins such as the type V CRISPR-associated protein C2c8. For a long time, TnpB proteins were described as accessory proteins in IS (insertion sequence) elements, present as one of just one or two proteins encoded in the element but not necessary for transposition. The programmable RNA-guided endonuclease TnpB proteins may provide a CRISPR-like, widespread form of phage defense by RNA-guided DNA degradation.


Pssm-ID: 468544 [Multi-domain]  Cd Length: 384  Bit Score: 326.42  E-value: 1.49e-109
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681   6 KVRIYPTPEQAEYLYAQFGAVRFAYNKALHIKKHAYQRHGVNLSPRNDLKPLLSVAKKSRRYAWLKEFDSMALQQAVINL 85
Cdd:NF040570    2 KYRLYPTKEQKRELAELFGAARFLYNAALAERKEAYEKNGKFLSYKALLKKLLTELKKEKELEWLKELSSQALQQALKRL 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681  86 DVAFSNFFNPKRKARFPTFKRKHGKQSSYH----------CVGVKVLDNAIKIPKLSPIEARLHRELH-------GKLKS 148
Cdd:NF040570   82 AKAFKNFFKKLKKAGFPRFKSKKKKVPSYTpqsvnkrlrkKRNRKKKNGRLKLPKLGGVKLRLSRILPilldgkgGKIKS 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 149 ITITRSATGKYYASILCDDGLEAPVKPTLISTVTGLDMGLEHYAIRSDGA-KIANPRHLINASRNLRRKQKALSRK---- 223
Cdd:NF040570  162 VTISKPKKGKYYVSISVEVEVPEPPPKEVTGKVVGIDLGLKNFATLSDGGeKIENPRFLRKKEKRLRRLQRKLSRKlqrk 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 224 QKGSANRKKARIRLAALHERVANARADFQHKLSRTIVDENQA--VIVETLKTANMMKN---HNLARVIGDAGWHSFITSL 298
Cdd:NF040570  242 GKGSSNRKKARKKVARLHRKIANQRKDFLHKLSKRLVKEADAnnVVVEDLEVKGMVKNkkkKKLAKSIHDWAFGQLRRML 321
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1632590681 299 EYKAAEKGAHLVKLDQWFASSKTCHCCGYKMSEMPLHKRIWRCPECGIEHDRDINAALNIRQK 361
Cdd:NF040570  322 EYKAEWYGIKVVKVDPAYTSSQCCSCGGHRKEKLLLSCREWTCPECGYTVHRDINAAINILRR 384
IS607_TnpB NF038280
IS607 family element RNA-guided endonuclease TnpB;
1-359 3.16e-53

IS607 family element RNA-guided endonuclease TnpB;


Pssm-ID: 468447 [Multi-domain]  Cd Length: 431  Bit Score: 182.19  E-value: 3.16e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681   1 MLRATKVRIYPTPEQAEYLYAQFGAVRFAYNKAL-HIKKHAYQRHGVNLSPrnDLKPLLSVAKKSRRYA------WLKEF 73
Cdd:NF038280    2 VVQAYRFALDPTPAQARALRSHFGARRKAFNWGLaRVKADLDARAAEPLTE--SVKWSLRSLRKAWNTAkdevapWWAEN 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681  74 DSMALQQAVINLDVAFSNFF---NPKRKAR---FPTFKRKHGKQSSYH----CVGVKVLDNAIKIPKLSPIEA-----RL 138
Cdd:NF038280   80 SKEAYSDGLAGLARALWNWQasrAGTRAGRrvgFPRFKSKRRDADRVRfttgAMRVEPDRRHVTLPVIGTVRThentrRL 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 139 HREL---HGKLKSITITRSAtGKYYASILCDDGLEAPVKPTLISTVTGLDMGLEHYAIRSDG-----AKIANPRHLINAS 210
Cdd:NF038280  160 ARHIeagRARILAATVRRNG-GRLFVSVRVEVQRPQQRAPARPDSRVGVDLGVRRLATVATAtgeviERVPNPRPLEAAL 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 211 RNLRRKQKALSRKQKGSANRKKARIRLAALHERVANARADFQHKLSRTIVDENQAVIVETLKTANMMKN---HNLARVIG 287
Cdd:NF038280  239 RALRRLSRALSRRTPGSRRWRKATAELSRLHRRVADLRRDHLHKLTTRLARTHGTIVVEDLDVAGMLRNpgaRALRRGVS 318
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1632590681 288 DAGWHSFITSLEYKAAEKGAHLVKLDQWFASSKTCHCCGYkMSEMPLHKRIWRCPECGIEHDRDINAALNIR 359
Cdd:NF038280  319 DAAMGEIRRQLSYKTGWYGSRLVVADRWFPSSKTCHGCGH-VKDKILWDRHWQCDACGLVHDRDDNAARNLA 389
OrfB_Zn_ribbon pfam07282
Putative transposase DNA-binding domain; This putative domain is found at the C-terminus of a ...
291-361 6.99e-24

Putative transposase DNA-binding domain; This putative domain is found at the C-terminus of a large number of transposase proteins. This domain contains four conserved cysteines suggestive of a zinc binding domain. Given the need for transposases to bind DNA as well as the large number of DNA-binding zinc fingers we hypothesize this domain is DNA-binding.


Pssm-ID: 284650 [Multi-domain]  Cd Length: 69  Bit Score: 93.82  E-value: 6.99e-24
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1632590681 291 WHSFITSLEYKAAEKGAHLVKLDQWFaSSKTCHCCGYKMSEmPLHKRIWRCPECGIEHDRDINAALNIRQK 361
Cdd:pfam07282   1 FRKFIEQLEYKAKEYGIKVVEVDPAY-TSKTCSVCGHKNKE-SLSGRTFVCPNCGFVADRDVNAAINILKR 69
PHA02942 PHA02942
putative transposase; Provisional
199-369 6.07e-08

putative transposase; Provisional


Pssm-ID: 165252 [Multi-domain]  Cd Length: 383  Bit Score: 54.26  E-value: 6.07e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 199 KIANPRHLINASRNLrrkQKALSRKQKgsaNRKKARIRLAALHERVANARADFQHKLSRTIVDenqavIVETLKtANMMK 278
Cdd:PHA02942  204 RLHDAHHFKSLAENL---QKKYPRRWK---ENKRILHRARSFHHKAKLIMEDFARKVGKWVVE-----IAEDLG-ANVIK 271
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 279 NHNLARVIGDAG--------------WHSFITSLEYKAAEKGAHLVKLDQWFaSSKTCHCCGYKMSEMPlhKRIWRCPEC 344
Cdd:PHA02942  272 LEDLKNLIKDVNklpaefrdklylmqYHRIQYWIEWQAKKHGMIVEFVNPSY-SSVSCPKCGHKMVEIA--HRYFHCPSC 348
                         170       180
                  ....*....|....*....|....*
gi 1632590681 345 GIEHDRDINAALNIRQKGILELKAA 369
Cdd:PHA02942  349 GYENDRDVIAIMNLNGRGSLTLSTA 373
tspaseT_teng_C TIGR01766
transposase, IS605 OrfB family, central region; This model represents a region of a sequence ...
246-318 3.25e-06

transposase, IS605 OrfB family, central region; This model represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by pfam01385, and other proteins. [Mobile and extrachromosomal element functions, Transposon functions]


Pssm-ID: 273793 [Multi-domain]  Cd Length: 82  Bit Score: 44.63  E-value: 3.25e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 246 NARADFQHKLSRTIV----DENQAVIVETLKTA-NMM--KNHNLARVIGDAGWHSFITSLEYKAAEKGAHLVKLDQWFAS 318
Cdd:TIGR01766   3 NKVEDFLHKIVKQIVeyakENNGTIVLEDLKNIrEMVdkKSKYLRRKLHQWSFRKLISKIKYKAEEYGIEVIEVNPAYTS 82
 
Name Accession Description Interval E-value
InsQ COG0675
Transposase [Mobilome: prophages, transposons];
1-373 5.71e-145

Transposase [Mobilome: prophages, transposons];


Pssm-ID: 440439 [Multi-domain]  Cd Length: 381  Bit Score: 416.22  E-value: 5.71e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681   1 MLRATKVRIYPTPEQAEYLYAQFGAVRFAYNKALHIKKHAYQRHGVNLSpRNDLKPLLSVAKKsrRYAWLKEFDSMALQQ 80
Cdd:COG0675     1 MLRTYKFRLYPTKEQEELLERTLGCCRFVYNYALAERRQAYKETGKSLS-YYELQKLLTELKK--EYPWLKELPSQVLQQ 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681  81 AVINLDVAFSNFFNPK---RKARFPTFKRKHGKQS-SYHCVGVKVLDNAIKIPKLSPIEARLHRELH--GKLKSITITRS 154
Cdd:COG0675    78 ALKRLDEAFKSFFKRKkkgKKAGFPRFKKKGRYRSfTYPQSGFKLKDGRLKLPKIGWVKIRLHRPLPddGKIKSVTISRK 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 155 ATGKYYASILCDDGLEAPVKPTliSTVTGLDMGLEHYAIRSDGAKIANPRHLINASRNLRRKQKALSRKQKGSANRKKAR 234
Cdd:COG0675   158 AAGKWYVSFVVEVEDVPELPPT--GKVVGIDLGLKNFATLSDGEKIDNPKFLKKAERKLAKLQRRLSRKKKGSKNRRKAR 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 235 IRLAALHERVANARADFQHKLSRTIVDENQAVIVETLKTANMMKNHNLARVIGDAGWHSFITSLEYKAAEKGAHLVKLDQ 314
Cdd:COG0675   236 KKLAKLHEKIANQRKDFLHKLARKLVKEADVIVVEDLNVKGMKKNKKLNKSISDAGWGEFRRQLEYKAEKYGIKVVEVDP 315
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1632590681 315 WFaSSKTCHCCGYKMSEMPLHKRIWRCPECGIEHDRDINAALNIRQKGILELKAAGLVV 373
Cdd:COG0675   316 AY-TSQTCSSCGHVVKKLRLSVRTFVCPKCGTVHDRDVNAAINILRRGLRQLGLAGHSG 373
IS200_TnpB NF038281
IS200/IS605 family element RNA-guided endonuclease TnpB;
2-363 9.65e-143

IS200/IS605 family element RNA-guided endonuclease TnpB;


Pssm-ID: 468448 [Multi-domain]  Cd Length: 359  Bit Score: 409.57  E-value: 9.65e-143
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681   2 LRATKVRIYPTPEQAEYLYAQFGAVRFAYNKALHIKKHAYQRHGVNLSpRNDLKPLLSVAKKsrRYAWLKEFDSMALQQA 81
Cdd:NF038281    1 HKAYKFRIYPNKEQEILINKTIGCSRFVYNHFLAKWNEAYEETGKGLS-YNACSKQLTQLKK--EEEWLKEVDSIALQNS 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681  82 VINLDVAFSNFFnpKRKARFPTFKRKHGKQSSYHCV----GVKVLDNAIKIPKLSPIEARLHRELHGKLKSITITRSATG 157
Cdd:NF038281   78 LKNLDDAFKRFF--KKQNGFPRFKSKKNPVQSYTTKntngNIAIVGNKIKLPKLGWVKFAKSREVEGRILSATVRRNPSG 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 158 KYYASILCDDGLEAPVKptlISTVTGLDMGLEHYAIRSDGAKIANPRHLINASRNLRRKQKALSRKQKGSANRKKARIRL 237
Cdd:NF038281  156 KYFVSILVETEVQLLPK---TNSAVGIDLGLKDFAILSDGGKIENPKYLRKLEKKLAKLQRILSRRKKGSSNWQKQRIKV 232
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 238 AALHERVANARADFQHKLSRTIVDENQAVIVETLKTANMMKNHNLARVIGDAGWHSFITSLEYKAAEKGAHLVKLDQWFA 317
Cdd:NF038281  233 ARLHEKIANQRKDFLHKLSTRLIKENQVICIEDLQVKNMLKNHKLAKSISDVSWSEFRTMLEYKAKWYGRTVVKVGKFFP 312
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*..
gi 1632590681 318 SSKTCHCCGYKMSEMP-LHKRIWRCPECGIEHDRDINAALNIRQKGI 363
Cdd:NF038281  313 SSQLCSCCGYKNKEVKnLALREWTCPSCGTHHDRDINASKNILNEGL 359
guided_TnpB NF040570
RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB ...
6-361 1.49e-109

RNA-guided endonuclease TnpB family protein; This family includes RNA-guided endonuclease TnpB from IS200/IS605 family elements (NF038281) and IS607 family elements (NF038280), but also many additional proteins. It exhibits homolog to or actually includes some CRISPR-associated (Cas) proteins such as the type V CRISPR-associated protein C2c8. For a long time, TnpB proteins were described as accessory proteins in IS (insertion sequence) elements, present as one of just one or two proteins encoded in the element but not necessary for transposition. The programmable RNA-guided endonuclease TnpB proteins may provide a CRISPR-like, widespread form of phage defense by RNA-guided DNA degradation.


Pssm-ID: 468544 [Multi-domain]  Cd Length: 384  Bit Score: 326.42  E-value: 1.49e-109
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681   6 KVRIYPTPEQAEYLYAQFGAVRFAYNKALHIKKHAYQRHGVNLSPRNDLKPLLSVAKKSRRYAWLKEFDSMALQQAVINL 85
Cdd:NF040570    2 KYRLYPTKEQKRELAELFGAARFLYNAALAERKEAYEKNGKFLSYKALLKKLLTELKKEKELEWLKELSSQALQQALKRL 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681  86 DVAFSNFFNPKRKARFPTFKRKHGKQSSYH----------CVGVKVLDNAIKIPKLSPIEARLHRELH-------GKLKS 148
Cdd:NF040570   82 AKAFKNFFKKLKKAGFPRFKSKKKKVPSYTpqsvnkrlrkKRNRKKKNGRLKLPKLGGVKLRLSRILPilldgkgGKIKS 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 149 ITITRSATGKYYASILCDDGLEAPVKPTLISTVTGLDMGLEHYAIRSDGA-KIANPRHLINASRNLRRKQKALSRK---- 223
Cdd:NF040570  162 VTISKPKKGKYYVSISVEVEVPEPPPKEVTGKVVGIDLGLKNFATLSDGGeKIENPRFLRKKEKRLRRLQRKLSRKlqrk 241
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 224 QKGSANRKKARIRLAALHERVANARADFQHKLSRTIVDENQA--VIVETLKTANMMKN---HNLARVIGDAGWHSFITSL 298
Cdd:NF040570  242 GKGSSNRKKARKKVARLHRKIANQRKDFLHKLSKRLVKEADAnnVVVEDLEVKGMVKNkkkKKLAKSIHDWAFGQLRRML 321
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1632590681 299 EYKAAEKGAHLVKLDQWFASSKTCHCCGYKMSEMPLHKRIWRCPECGIEHDRDINAALNIRQK 361
Cdd:NF040570  322 EYKAEWYGIKVVKVDPAYTSSQCCSCGGHRKEKLLLSCREWTCPECGYTVHRDINAAINILRR 384
IS607_TnpB NF038280
IS607 family element RNA-guided endonuclease TnpB;
1-359 3.16e-53

IS607 family element RNA-guided endonuclease TnpB;


Pssm-ID: 468447 [Multi-domain]  Cd Length: 431  Bit Score: 182.19  E-value: 3.16e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681   1 MLRATKVRIYPTPEQAEYLYAQFGAVRFAYNKAL-HIKKHAYQRHGVNLSPrnDLKPLLSVAKKSRRYA------WLKEF 73
Cdd:NF038280    2 VVQAYRFALDPTPAQARALRSHFGARRKAFNWGLaRVKADLDARAAEPLTE--SVKWSLRSLRKAWNTAkdevapWWAEN 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681  74 DSMALQQAVINLDVAFSNFF---NPKRKAR---FPTFKRKHGKQSSYH----CVGVKVLDNAIKIPKLSPIEA-----RL 138
Cdd:NF038280   80 SKEAYSDGLAGLARALWNWQasrAGTRAGRrvgFPRFKSKRRDADRVRfttgAMRVEPDRRHVTLPVIGTVRThentrRL 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 139 HREL---HGKLKSITITRSAtGKYYASILCDDGLEAPVKPTLISTVTGLDMGLEHYAIRSDG-----AKIANPRHLINAS 210
Cdd:NF038280  160 ARHIeagRARILAATVRRNG-GRLFVSVRVEVQRPQQRAPARPDSRVGVDLGVRRLATVATAtgeviERVPNPRPLEAAL 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 211 RNLRRKQKALSRKQKGSANRKKARIRLAALHERVANARADFQHKLSRTIVDENQAVIVETLKTANMMKN---HNLARVIG 287
Cdd:NF038280  239 RALRRLSRALSRRTPGSRRWRKATAELSRLHRRVADLRRDHLHKLTTRLARTHGTIVVEDLDVAGMLRNpgaRALRRGVS 318
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1632590681 288 DAGWHSFITSLEYKAAEKGAHLVKLDQWFASSKTCHCCGYkMSEMPLHKRIWRCPECGIEHDRDINAALNIR 359
Cdd:NF038280  319 DAAMGEIRRQLSYKTGWYGSRLVVADRWFPSSKTCHGCGH-VKDKILWDRHWQCDACGLVHDRDDNAARNLA 389
OrfB_Zn_ribbon pfam07282
Putative transposase DNA-binding domain; This putative domain is found at the C-terminus of a ...
291-361 6.99e-24

Putative transposase DNA-binding domain; This putative domain is found at the C-terminus of a large number of transposase proteins. This domain contains four conserved cysteines suggestive of a zinc binding domain. Given the need for transposases to bind DNA as well as the large number of DNA-binding zinc fingers we hypothesize this domain is DNA-binding.


Pssm-ID: 284650 [Multi-domain]  Cd Length: 69  Bit Score: 93.82  E-value: 6.99e-24
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1632590681 291 WHSFITSLEYKAAEKGAHLVKLDQWFaSSKTCHCCGYKMSEmPLHKRIWRCPECGIEHDRDINAALNIRQK 361
Cdd:pfam07282   1 FRKFIEQLEYKAKEYGIKVVEVDPAY-TSKTCSVCGHKNKE-SLSGRTFVCPNCGFVADRDVNAAINILKR 69
OrfB_IS605 pfam01385
Probable transposase; This family includes IS891, IS1136 and IS1341. DUF1225, pfam06774, has ...
169-278 1.56e-23

Probable transposase; This family includes IS891, IS1136 and IS1341. DUF1225, pfam06774, has now been merged into this family.


Pssm-ID: 396108 [Multi-domain]  Cd Length: 120  Bit Score: 94.29  E-value: 1.56e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 169 LEAPVKPTLISTVTGLDMGLEHYAIRSDG---AKIANPRHLINASRNLRRKQKALSRKQKGSANRKKARIRLAALHERVA 245
Cdd:pfam01385   6 VEDPPPVAEPNKAAGIDLGINNLATVSTEdgdWFLFNPRRLKSDYKYLAKRIARLQRKLKGSNNRKKASRKLARLHRKRS 85
                          90       100       110
                  ....*....|....*....|....*....|...
gi 1632590681 246 NARADFQHKLSRTIVDENQAVIVETLKTANMMK 278
Cdd:pfam01385  86 RRRKDFLHKLVRRLIEELDEVGVEDLNVGGMKD 118
HTH_OrfB_IS605 pfam12323
Helix-turn-helix domain; This is the N terminal helix-turn-helix domain of Transposase_2 ...
1-45 3.43e-16

Helix-turn-helix domain; This is the N terminal helix-turn-helix domain of Transposase_2 pfam01385.


Pssm-ID: 432479 [Multi-domain]  Cd Length: 47  Bit Score: 71.83  E-value: 3.43e-16
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 1632590681   1 MLRATKVRIYPTPEQAEYLYAQFGAVRFAYNKALHIKKHAYQRHG 45
Cdd:pfam12323   2 VLKAYKYRLYPTPEQEELLARTFGCARFVYNKALAERKEAYKEGG 46
PHA02942 PHA02942
putative transposase; Provisional
199-369 6.07e-08

putative transposase; Provisional


Pssm-ID: 165252 [Multi-domain]  Cd Length: 383  Bit Score: 54.26  E-value: 6.07e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 199 KIANPRHLINASRNLrrkQKALSRKQKgsaNRKKARIRLAALHERVANARADFQHKLSRTIVDenqavIVETLKtANMMK 278
Cdd:PHA02942  204 RLHDAHHFKSLAENL---QKKYPRRWK---ENKRILHRARSFHHKAKLIMEDFARKVGKWVVE-----IAEDLG-ANVIK 271
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 279 NHNLARVIGDAG--------------WHSFITSLEYKAAEKGAHLVKLDQWFaSSKTCHCCGYKMSEMPlhKRIWRCPEC 344
Cdd:PHA02942  272 LEDLKNLIKDVNklpaefrdklylmqYHRIQYWIEWQAKKHGMIVEFVNPSY-SSVSCPKCGHKMVEIA--HRYFHCPSC 348
                         170       180
                  ....*....|....*....|....*
gi 1632590681 345 GIEHDRDINAALNIRQKGILELKAA 369
Cdd:PHA02942  349 GYENDRDVIAIMNLNGRGSLTLSTA 373
tspaseT_teng_C TIGR01766
transposase, IS605 OrfB family, central region; This model represents a region of a sequence ...
246-318 3.25e-06

transposase, IS605 OrfB family, central region; This model represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by pfam01385, and other proteins. [Mobile and extrachromosomal element functions, Transposon functions]


Pssm-ID: 273793 [Multi-domain]  Cd Length: 82  Bit Score: 44.63  E-value: 3.25e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1632590681 246 NARADFQHKLSRTIV----DENQAVIVETLKTA-NMM--KNHNLARVIGDAGWHSFITSLEYKAAEKGAHLVKLDQWFAS 318
Cdd:TIGR01766   3 NKVEDFLHKIVKQIVeyakENNGTIVLEDLKNIrEMVdkKSKYLRRKLHQWSFRKLISKIKYKAEEYGIEVIEVNPAYTS 82
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH