NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2318961068|gb|UYJ17975|]
View 

MAG: IS3 family transposase [Veillonellaceae bacterium]

Protein Classification

transposase family protein( domain architecture ID 1750059)

transposase family protein might bind to the end of a transposon and catalyze the movement of the transposon to another part of the genome by a cut and paste mechanism or a replicative transposition mechanism

Gene Ontology:  GO:0003677|GO:0006313
PubMed:  11774877

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
transpos_IS3 NF033516
IS3 family transposase;
4-272 7.10e-73

IS3 family transposase;


:

Pssm-ID: 468052 [Multi-domain]  Cd Length: 369  Bit Score: 227.83  E-value: 7.10e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068   4 LCRLSKVSRAAYYGWLNHIKSGRELLREKVAQEVVKIHQEYPD-MGYRRMNDWIKkysESHLMVSDSLVLRVRRILNIKS 82
Cdd:NF033516  106 ACRVLGVSRSTYYYWRKRPPSRRAPDDAELRARIREIFEESRGrYGYRRITALLR---REGIRVNHKRVYRLMRELGLLA 182
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068  83 VIKYKTDGCTRNAKDPKYIFENLLNRDFDAGVSNARWMTDVTEFKytTADGvlhKLYLSAIIDGHDRRIVSYVIGDRNNT 162
Cdd:NF033516  183 RRRRKRRPYTTDSGHVHPVAPNLLNRQFTATRPNQVWVTDITYIR--TAEG---WLYLAVVLDLFSREIVGWSVSTSMSA 257
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 163 ALAFETMEKALKEN--PGaHPMIHTDRGFQYTSNGFHKIVEKAGLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKFTS 240
Cdd:NF033516  258 ELVLDALEMAIEWRgkPE-GLILHSDNGSQYTSKAYREWLKEHGITQSMSRPGNCWDNAVAESFFGTLKRECLYRRRFRT 336
                         250       260       270
                  ....*....|....*....|....*....|..
gi 2318961068 241 RKAVVSMINGYIYFYNNKRIQRKLHLLAPMEV 272
Cdd:NF033516  337 LEEARQAIEEYIEFYNHERPHSSLGYLTPAEF 368
 
Name Accession Description Interval E-value
transpos_IS3 NF033516
IS3 family transposase;
4-272 7.10e-73

IS3 family transposase;


Pssm-ID: 468052 [Multi-domain]  Cd Length: 369  Bit Score: 227.83  E-value: 7.10e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068   4 LCRLSKVSRAAYYGWLNHIKSGRELLREKVAQEVVKIHQEYPD-MGYRRMNDWIKkysESHLMVSDSLVLRVRRILNIKS 82
Cdd:NF033516  106 ACRVLGVSRSTYYYWRKRPPSRRAPDDAELRARIREIFEESRGrYGYRRITALLR---REGIRVNHKRVYRLMRELGLLA 182
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068  83 VIKYKTDGCTRNAKDPKYIFENLLNRDFDAGVSNARWMTDVTEFKytTADGvlhKLYLSAIIDGHDRRIVSYVIGDRNNT 162
Cdd:NF033516  183 RRRRKRRPYTTDSGHVHPVAPNLLNRQFTATRPNQVWVTDITYIR--TAEG---WLYLAVVLDLFSREIVGWSVSTSMSA 257
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 163 ALAFETMEKALKEN--PGaHPMIHTDRGFQYTSNGFHKIVEKAGLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKFTS 240
Cdd:NF033516  258 ELVLDALEMAIEWRgkPE-GLILHSDNGSQYTSKAYREWLKEHGITQSMSRPGNCWDNAVAESFFGTLKRECLYRRRFRT 336
                         250       260       270
                  ....*....|....*....|....*....|..
gi 2318961068 241 RKAVVSMINGYIYFYNNKRIQRKLHLLAPMEV 272
Cdd:NF033516  337 LEEARQAIEEYIEFYNHERPHSSLGYLTPAEF 368
Tra5 COG2801
Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];
4-280 1.76e-70

Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442053 [Multi-domain]  Cd Length: 309  Bit Score: 219.64  E-value: 1.76e-70
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068   4 LCRLSKVSRAAYYGWLNHIKSGRELLREKVAQEVVKIHQEYPDMGYRRMNDWIKKyseSHLMVSDSLVLRVRRILNIKSV 83
Cdd:COG2801    44 RLLRRRRARSRRRRRLRRPRSYRADEDAELLERIKEIFAESPRYGYRRITAELRR---EGIAVNRKRVRRLMRELGLQAR 120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068  84 IKYKTDGCTRNAKDPKyIFENLLnrdFDAGVSNARWMTDVTEFKytTADGvlhKLYLSAIIDGHDRRIVSYVIGDRNNTA 163
Cdd:COG2801   121 RRRKKKYTTYSGHGGP-IAPNLL---FTATAPNQVWVTDITYIP--TAEG---WLYLAAVIDLFSREIVGWSVSDSMDAE 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 164 LAFETMEKAL-KENPGAHPMIHTDRGFQYTSNGFHKIVEKAGLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKFTSRK 242
Cdd:COG2801   192 LVVDALEMAIeRRGPPKPLILHSDNGSQYTSKAYQELLKKLGITQSMSRPGNPQDNAFIESFFGTLKYELLYRRRFESLE 271
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 2318961068 243 AVVSMINGYIYFYNNKRIQRKLHLLAPMEVFNAAPMAA 280
Cdd:COG2801   272 EAREAIEEYIEFYNHERPHSSLGYLTPAEYEKQLAAAA 309
PHA02517 PHA02517
putative transposase OrfB; Reviewed
10-275 3.91e-36

putative transposase OrfB; Reviewed


Pssm-ID: 222853 [Multi-domain]  Cd Length: 277  Bit Score: 129.98  E-value: 3.91e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068  10 VSRAAYYGWLN--HIKSGRELLREKVA---QEVVKIHQEYPDMgYRRMNDWiKKYSESHLMVSDSLVLRVRRILNIKSVI 84
Cdd:PHA02517    3 IAPSTYYRCQQqrHHPDKRRARAQHDDwlkSEILRVYDENHQV-YGVRKVW-RQLNREGIRVARCTVGRLMKELGLAGVL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068  85 ----KYKTDGCTRNAKdpkyifENLLNRDFDAGVSNARWMTDVTEFkyTTADGVLhklYLSAIIDGHDRRIVSYVIGDRN 160
Cdd:PHA02517   81 rgkkVRTTISRKAVAA------PDRVNRQFVATRPNQLWVADFTYV--STWQGWV---YVAFIIDVFARRIVGWRVSSSM 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 161 NTALAFETMEKALK--ENPGAhPMIHTDRGFQYTSNGFHKIVEKAGLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKF 238
Cdd:PHA02517  150 DTDFVLDALEQALWarGRPGG-LIHHSDKGSQYVSLAYTQRLKEAGIRASTGSRGDSYDNAPAESINGLYKAEVIHRVSW 228
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 2318961068 239 TSRKAVVSMINGYIYFYNNKRIQRKLHLLAPMEVFNA 275
Cdd:PHA02517  229 KNREEVELATLEWVAWYNNRRLHERLGYTPPAEAEKA 265
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
115-215 9.52e-23

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 89.68  E-value: 9.52e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 115 SNARWMTDVTEFKYTTADGvlhKLYLSAIIDGHDRRIVSYVIGDRNNTALAFETMEKALKENPGAHPMIHTDRGFQYTSN 194
Cdd:pfam00665   1 PNQLWQGDFTYIRIPGGGG---KLYLLVIVDDFSREILAWALSSEMDAELVLDALERAIAFRGGVPLIIHSDNGSEYTSK 77
                          90       100
                  ....*....|....*....|.
gi 2318961068 195 GFHKIVEKAGLVHSMSRVGCC 215
Cdd:pfam00665  78 AFREFLKDLGIKPSFSRPGNP 98
transpos_IS481 NF033577
IS481 family transposase; null
110-273 6.64e-12

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 64.15  E-value: 6.64e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 110 FDAGVSNARWMTDVteFKYTTADGVLhKLYLSAIIDGHDRRIVSYVIGDRN-NTALAFetMEKALKENPGAHPMIHTDRG 188
Cdd:NF033577  122 YERAHPGELWHIDI--KKLGRIPDVG-RLYLHTAIDDHSRFAYAELYPDETaETAADF--LRRAFAEHGIPIRRVLTDNG 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 189 FQYTSN--GFHKIVEKAGLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKFTSRKAVVSMINGYIYFYNNKRIQRKLHL 266
Cdd:NF033577  197 SEFRSRahGFELALAELGIEHRRTRPYHPQTNGKVERFHRTLKDEFAYARPYESLAELQAALDEWLHHYNHHRPHSALGG 276

                  ....*..
gi 2318961068 267 LAPMEVF 273
Cdd:NF033577  277 KTPAERF 283
 
Name Accession Description Interval E-value
transpos_IS3 NF033516
IS3 family transposase;
4-272 7.10e-73

IS3 family transposase;


Pssm-ID: 468052 [Multi-domain]  Cd Length: 369  Bit Score: 227.83  E-value: 7.10e-73
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068   4 LCRLSKVSRAAYYGWLNHIKSGRELLREKVAQEVVKIHQEYPD-MGYRRMNDWIKkysESHLMVSDSLVLRVRRILNIKS 82
Cdd:NF033516  106 ACRVLGVSRSTYYYWRKRPPSRRAPDDAELRARIREIFEESRGrYGYRRITALLR---REGIRVNHKRVYRLMRELGLLA 182
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068  83 VIKYKTDGCTRNAKDPKYIFENLLNRDFDAGVSNARWMTDVTEFKytTADGvlhKLYLSAIIDGHDRRIVSYVIGDRNNT 162
Cdd:NF033516  183 RRRRKRRPYTTDSGHVHPVAPNLLNRQFTATRPNQVWVTDITYIR--TAEG---WLYLAVVLDLFSREIVGWSVSTSMSA 257
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 163 ALAFETMEKALKEN--PGaHPMIHTDRGFQYTSNGFHKIVEKAGLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKFTS 240
Cdd:NF033516  258 ELVLDALEMAIEWRgkPE-GLILHSDNGSQYTSKAYREWLKEHGITQSMSRPGNCWDNAVAESFFGTLKRECLYRRRFRT 336
                         250       260       270
                  ....*....|....*....|....*....|..
gi 2318961068 241 RKAVVSMINGYIYFYNNKRIQRKLHLLAPMEV 272
Cdd:NF033516  337 LEEARQAIEEYIEFYNHERPHSSLGYLTPAEF 368
Tra5 COG2801
Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];
4-280 1.76e-70

Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442053 [Multi-domain]  Cd Length: 309  Bit Score: 219.64  E-value: 1.76e-70
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068   4 LCRLSKVSRAAYYGWLNHIKSGRELLREKVAQEVVKIHQEYPDMGYRRMNDWIKKyseSHLMVSDSLVLRVRRILNIKSV 83
Cdd:COG2801    44 RLLRRRRARSRRRRRLRRPRSYRADEDAELLERIKEIFAESPRYGYRRITAELRR---EGIAVNRKRVRRLMRELGLQAR 120
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068  84 IKYKTDGCTRNAKDPKyIFENLLnrdFDAGVSNARWMTDVTEFKytTADGvlhKLYLSAIIDGHDRRIVSYVIGDRNNTA 163
Cdd:COG2801   121 RRRKKKYTTYSGHGGP-IAPNLL---FTATAPNQVWVTDITYIP--TAEG---WLYLAAVIDLFSREIVGWSVSDSMDAE 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 164 LAFETMEKAL-KENPGAHPMIHTDRGFQYTSNGFHKIVEKAGLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKFTSRK 242
Cdd:COG2801   192 LVVDALEMAIeRRGPPKPLILHSDNGSQYTSKAYQELLKKLGITQSMSRPGNPQDNAFIESFFGTLKYELLYRRRFESLE 271
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 2318961068 243 AVVSMINGYIYFYNNKRIQRKLHLLAPMEVFNAAPMAA 280
Cdd:COG2801   272 EAREAIEEYIEFYNHERPHSSLGYLTPAEYEKQLAAAA 309
PHA02517 PHA02517
putative transposase OrfB; Reviewed
10-275 3.91e-36

putative transposase OrfB; Reviewed


Pssm-ID: 222853 [Multi-domain]  Cd Length: 277  Bit Score: 129.98  E-value: 3.91e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068  10 VSRAAYYGWLN--HIKSGRELLREKVA---QEVVKIHQEYPDMgYRRMNDWiKKYSESHLMVSDSLVLRVRRILNIKSVI 84
Cdd:PHA02517    3 IAPSTYYRCQQqrHHPDKRRARAQHDDwlkSEILRVYDENHQV-YGVRKVW-RQLNREGIRVARCTVGRLMKELGLAGVL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068  85 ----KYKTDGCTRNAKdpkyifENLLNRDFDAGVSNARWMTDVTEFkyTTADGVLhklYLSAIIDGHDRRIVSYVIGDRN 160
Cdd:PHA02517   81 rgkkVRTTISRKAVAA------PDRVNRQFVATRPNQLWVADFTYV--STWQGWV---YVAFIIDVFARRIVGWRVSSSM 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 161 NTALAFETMEKALK--ENPGAhPMIHTDRGFQYTSNGFHKIVEKAGLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKF 238
Cdd:PHA02517  150 DTDFVLDALEQALWarGRPGG-LIHHSDKGSQYVSLAYTQRLKEAGIRASTGSRGDSYDNAPAESINGLYKAEVIHRVSW 228
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 2318961068 239 TSRKAVVSMINGYIYFYNNKRIQRKLHLLAPMEVFNA 275
Cdd:PHA02517  229 KNREEVELATLEWVAWYNNRRLHERLGYTPPAEAEKA 265
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
115-215 9.52e-23

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 89.68  E-value: 9.52e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 115 SNARWMTDVTEFKYTTADGvlhKLYLSAIIDGHDRRIVSYVIGDRNNTALAFETMEKALKENPGAHPMIHTDRGFQYTSN 194
Cdd:pfam00665   1 PNQLWQGDFTYIRIPGGGG---KLYLLVIVDDFSREILAWALSSEMDAELVLDALERAIAFRGGVPLIIHSDNGSEYTSK 77
                          90       100
                  ....*....|....*....|.
gi 2318961068 195 GFHKIVEKAGLVHSMSRVGCC 215
Cdd:pfam00665  78 AFREFLKDLGIKPSFSRPGNP 98
transpos_IS481 NF033577
IS481 family transposase; null
110-273 6.64e-12

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 64.15  E-value: 6.64e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 110 FDAGVSNARWMTDVteFKYTTADGVLhKLYLSAIIDGHDRRIVSYVIGDRN-NTALAFetMEKALKENPGAHPMIHTDRG 188
Cdd:NF033577  122 YERAHPGELWHIDI--KKLGRIPDVG-RLYLHTAIDDHSRFAYAELYPDETaETAADF--LRRAFAEHGIPIRRVLTDNG 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2318961068 189 FQYTSN--GFHKIVEKAGLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKFTSRKAVVSMINGYIYFYNNKRIQRKLHL 266
Cdd:NF033577  197 SEFRSRahGFELALAELGIEHRRTRPYHPQTNGKVERFHRTLKDEFAYARPYESLAELQAALDEWLHHYNHHRPHSALGG 276

                  ....*..
gi 2318961068 267 LAPMEVF 273
Cdd:NF033577  277 KTPAERF 283
rve_2 pfam13333
Integrase core domain;
222-271 1.72e-06

Integrase core domain;


Pssm-ID: 372570 [Multi-domain]  Cd Length: 52  Bit Score: 44.17  E-value: 1.72e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 2318961068 222 EGFWGMLKRERYYTRKFTSRKAVVSMINGYIYFYNNKRiqrkLHLLAPME 271
Cdd:pfam13333   1 ESFFGSLKTEMVYGEHFKTLEELELAIFDYIEWYNNKR----LKGLSPVQ 46
rve_3 pfam13683
Integrase core domain;
204-259 6.83e-06

Integrase core domain;


Pssm-ID: 433402 [Multi-domain]  Cd Length: 67  Bit Score: 42.97  E-value: 6.83e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 2318961068 204 GLVHSMSRVGCCADNGLMEGFWGMLKRERYYTRKFTSRKAVVSMINGYIYFYNNKR 259
Cdd:pfam13683   2 GIEISYIAPGKPMQNGLVESFNGTLRDECLNEHLFSSLAEARALLAAWREDYNTER 57
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH