NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|16130668|ref|NP_417241|]
View 

CRISPR-associated endonuclease/helicase Cas3 [Escherichia coli str. K-12 substr. MG1655]

Protein Classification

DEAD/DEAH box helicase family protein( domain architecture ID 11484460)

DEAD/DEAH box helicase family protein such as a DEAD/DEAH box-containing ATP-dependent helicase, which catalyzes the unwinding of DNA or RNA

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PRK09694 PRK09694
CRISPR-associated helicase/endonuclease Cas3;
1-883 0e+00

CRISPR-associated helicase/endonuclease Cas3;


:

Pssm-ID: 182031 [Multi-domain]  Cd Length: 878  Bit Score: 1596.77  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    1 MEPFKYICHYWGKSSKSLTKGNDIHLLIYHCLDVAAVADCWWDQSVVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFD 80
Cdd:PRK09694   1 MESFKYYCRYWGKASKSLTKGNDYHLLPYHCLDVAAVADCWWDQSPVLRSQFSANEMLSKQQVRAWLLFFVALHDIGKFD 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   81 IRFQYKSAESWLKLNPATPSLNGPSTQMCRKFNHGAAGLYWFNQDSLSEQSLGDffSFFDAAPHPYESWFPWVEAVTGHH 160
Cdd:PRK09694  81 IRFQYKAPEIWLKLNPAGPSISGPSTQMCRKYDHGAAGLLWFRQDFRSNQASDD--SFFDAAPHPYEAWFPWMEAVTGHH 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  161 GFILHSQDQDKSRWEMPASLASYAAQDKQAREEWISVLEALFLTPAGLSINDIPPDCSSLLAGFCSLADWLGSWTTTNTF 240
Cdd:PRK09694 159 GYILHSQDQDDSRWEMPASLASYAEQDKQAREEWIQALEALFLTPAGLSLNDIPPPCSPLLAGFCSVSDWLGSWTTTFTF 238
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  241 LFNedapSDINALRTYFQDRQQDASRVLELSGLVSNKRCYEGVHALLDNGYQPRQLQVLVDALPVAPGLTVIEAPTGSGK 320
Cdd:PRK09694 239 LFN----SPILALRQYFQQRQQDAARVLELSGLVANKKPYGGVHALLDNGYQPRQLQTLVDALPLQPGLTIIEAPTGSGK 314
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  321 TETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSPNLILAHGNSRFNHLFQSIKSRAITEQGQEEAW 400
Cdd:PRK09694 315 TEAALAYAWRLIDQGLADSIIFALPTQATANAMLSRLEALASKLFPSPNLILAHGNSRFNHLFQSLKSRAATEQGQEEAW 394
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  401 VQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLPVKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEAVLKAQADVGGSVI 480
Cdd:PRK09694 395 VQCCEWLSQSNKRVFLGQIGVCTIDQVLISVLPVKHRFIRGFGLGRSVLIVDEVHAYDAYMYGLLEAVLKAQAQAGGSVI 474
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  481 LLSATLPMKQKQKLLDTYGLHtDPVENNSAYPLINWRGVNGAQRFDLLAHPEQLPPRFSIQPEPICLADMLPDLTMLERM 560
Cdd:PRK09694 475 LLSATLPATLKQKLLDTYGGH-DPVELSSAYPLITWRGVNGAQRFDLSAHPEQLPARFTIQLEPICLADMLPDLTLLQRM 553
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  561 IAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQVDIDLFHARFTLNDRREKENRVISNFGKNGKRNVGRILVATQVVEQ 640
Cdd:PRK09694 554 IAAANAGAQVCLICNLVDDAQKLYQRLKELNNTQVDIDLFHARFTLNDRREKEQRVIENFGKNGKRNQGRILVATQVVEQ 633
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  641 SLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRPAGFEIPVATILLPDGEGYGRHEHIYSNVRVMWRTQQHIEELNGAS 720
Cdd:PRK09694 634 SLDLDFDWLITQLCPVDLLFQRLGRLHRHHRKYRPAGFEIPVATVLLPDGEGYGRSGYIYGNTRVLWRTEQLLEEHNAAS 713
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  721 LFFPDAYRQWLDSIYDDAEMD-EPEWVGNGMDKFESAECEKRFKARKVLQWAEEYSLQDNDETILAVTRDGEMSLPLLPY 799
Cdd:PRK09694 714 LFFPDAYREWIESVYDEAEMDeEPEWVISGMDKFEDKECEKRYKARKMLKWAEETPLSDNDERVLALTRDGEMSLPVLPY 793
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  800 VQTSSGKQLLDGQVYEDLSHEQQYEALALNRVNVPFTWKRSFSEVVDEDGLLWLEGKQNLDGWVWQG-NSIVITYTGDEG 878
Cdd:PRK09694 794 VQTEHGKQLLDGQVLEQLDEEQQYEALALNRVNVPHTWKRSFLEVVDEDGLIWLEGHQDADGWCWQGkNDIVITYTEDEG 873

                 ....*
gi 16130668  879 MTRVI 883
Cdd:PRK09694 874 MTRVI 878
 
Name Accession Description Interval E-value
PRK09694 PRK09694
CRISPR-associated helicase/endonuclease Cas3;
1-883 0e+00

CRISPR-associated helicase/endonuclease Cas3;


Pssm-ID: 182031 [Multi-domain]  Cd Length: 878  Bit Score: 1596.77  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    1 MEPFKYICHYWGKSSKSLTKGNDIHLLIYHCLDVAAVADCWWDQSVVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFD 80
Cdd:PRK09694   1 MESFKYYCRYWGKASKSLTKGNDYHLLPYHCLDVAAVADCWWDQSPVLRSQFSANEMLSKQQVRAWLLFFVALHDIGKFD 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   81 IRFQYKSAESWLKLNPATPSLNGPSTQMCRKFNHGAAGLYWFNQDSLSEQSLGDffSFFDAAPHPYESWFPWVEAVTGHH 160
Cdd:PRK09694  81 IRFQYKAPEIWLKLNPAGPSISGPSTQMCRKYDHGAAGLLWFRQDFRSNQASDD--SFFDAAPHPYEAWFPWMEAVTGHH 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  161 GFILHSQDQDKSRWEMPASLASYAAQDKQAREEWISVLEALFLTPAGLSINDIPPDCSSLLAGFCSLADWLGSWTTTNTF 240
Cdd:PRK09694 159 GYILHSQDQDDSRWEMPASLASYAEQDKQAREEWIQALEALFLTPAGLSLNDIPPPCSPLLAGFCSVSDWLGSWTTTFTF 238
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  241 LFNedapSDINALRTYFQDRQQDASRVLELSGLVSNKRCYEGVHALLDNGYQPRQLQVLVDALPVAPGLTVIEAPTGSGK 320
Cdd:PRK09694 239 LFN----SPILALRQYFQQRQQDAARVLELSGLVANKKPYGGVHALLDNGYQPRQLQTLVDALPLQPGLTIIEAPTGSGK 314
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  321 TETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSPNLILAHGNSRFNHLFQSIKSRAITEQGQEEAW 400
Cdd:PRK09694 315 TEAALAYAWRLIDQGLADSIIFALPTQATANAMLSRLEALASKLFPSPNLILAHGNSRFNHLFQSLKSRAATEQGQEEAW 394
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  401 VQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLPVKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEAVLKAQADVGGSVI 480
Cdd:PRK09694 395 VQCCEWLSQSNKRVFLGQIGVCTIDQVLISVLPVKHRFIRGFGLGRSVLIVDEVHAYDAYMYGLLEAVLKAQAQAGGSVI 474
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  481 LLSATLPMKQKQKLLDTYGLHtDPVENNSAYPLINWRGVNGAQRFDLLAHPEQLPPRFSIQPEPICLADMLPDLTMLERM 560
Cdd:PRK09694 475 LLSATLPATLKQKLLDTYGGH-DPVELSSAYPLITWRGVNGAQRFDLSAHPEQLPARFTIQLEPICLADMLPDLTLLQRM 553
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  561 IAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQVDIDLFHARFTLNDRREKENRVISNFGKNGKRNVGRILVATQVVEQ 640
Cdd:PRK09694 554 IAAANAGAQVCLICNLVDDAQKLYQRLKELNNTQVDIDLFHARFTLNDRREKEQRVIENFGKNGKRNQGRILVATQVVEQ 633
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  641 SLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRPAGFEIPVATILLPDGEGYGRHEHIYSNVRVMWRTQQHIEELNGAS 720
Cdd:PRK09694 634 SLDLDFDWLITQLCPVDLLFQRLGRLHRHHRKYRPAGFEIPVATVLLPDGEGYGRSGYIYGNTRVLWRTEQLLEEHNAAS 713
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  721 LFFPDAYRQWLDSIYDDAEMD-EPEWVGNGMDKFESAECEKRFKARKVLQWAEEYSLQDNDETILAVTRDGEMSLPLLPY 799
Cdd:PRK09694 714 LFFPDAYREWIESVYDEAEMDeEPEWVISGMDKFEDKECEKRYKARKMLKWAEETPLSDNDERVLALTRDGEMSLPVLPY 793
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  800 VQTSSGKQLLDGQVYEDLSHEQQYEALALNRVNVPFTWKRSFSEVVDEDGLLWLEGKQNLDGWVWQG-NSIVITYTGDEG 878
Cdd:PRK09694 794 VQTEHGKQLLDGQVLEQLDEEQQYEALALNRVNVPHTWKRSFLEVVDEDGLIWLEGHQDADGWCWQGkNDIVITYTEDEG 873

                 ....*
gi 16130668  879 MTRVI 883
Cdd:PRK09694 874 MTRVI 878
cas3_core TIGR01587
CRISPR-associated helicase Cas3; This model represents the highly conserved core region of an ...
309-716 3.32e-159

CRISPR-associated helicase Cas3; This model represents the highly conserved core region of an alignment of Cas3, a protein found in association with CRISPR repeat elements in a broad range of bacteria and archaea. Cas3 appears to be a helicase, with regions found by pfam00270 (DEAD/DEAH box helicase) and pfam00271 (Helicase conserved C-terminal domain). Some but not all members have an N-terminal HD domain region (pfam01966) that is not included within this model.


Pssm-ID: 273707 [Multi-domain]  Cd Length: 359  Bit Score: 469.63  E-value: 3.32e-159
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   309 LTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEAsashLFSSpNLILAHGNSRFnhlfqsiks 388
Cdd:TIGR01587   1 LLVIEAPTGYGKTEAALLWALHSIKSQKADRVIIALPTRATINAMYRRAKE----LFGS-ELVGLHHSSSF--------- 66
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   389 RAITEQGQEEAWVQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLP-VKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEA 467
Cdd:TIGR01587  67 SRIKEMGDSEEFEHLFPLYIHSNDKLFLDPITVCTIDQVLKSVFGeFGHYEFTLASIANSLLIFDEVHFYDEYTLALILA 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   468 VLKAQADVGGSVILLSATLPmkqkqKLLDTYGLHTDPVENNSAYPLINwrgvngaqrfdllahpeqlPPRFSIQPEPICL 547
Cdd:TIGR01587 147 VLEVLKDNDVPILLMSATLP-----KFLKEYAEKIGYVEFNEPLDLKE-------------------ERRFENHRFILIE 202
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   548 ADMLPDLTMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQvDIDLFHARFTLNDRREKENRVISNFGKNgkrN 627
Cdd:TIGR01587 203 SDKVGEISSLERLLEFIKKGGSIAIIVNTVDRAQEFYQQLKEKAPEE-EIILYHSRFTEKDRAKKEAELLREMKKS---N 278
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   628 VGRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRpAGFEIPVATILLPDGegygrheHIYSNVRVMW 707
Cdd:TIGR01587 279 EKFVIVATQVIEASLDISADVMITELAPIDSLIQRLGRLHRYGRKIG-ENFEVYIITIAPEGK-------LFPYPYELVE 350

                  ....*....
gi 16130668   708 RTQQHIEEL 716
Cdd:TIGR01587 351 RTIQKLEES 359
Cas3_I cd09639
CRISPR/Cas system-associated protein Cas3; CRISPR (Clustered Regularly Interspaced Short ...
309-716 1.64e-157

CRISPR/Cas system-associated protein Cas3; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; DEAD/DEAH box helicase DNA helicase cas3'; Often but not always is fused to HD nuclease domain; signature gene for Type I


Pssm-ID: 187770 [Multi-domain]  Cd Length: 353  Bit Score: 464.98  E-value: 1.64e-157
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 309 LTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSpnlilahgnsrfnhlFQSIKS 388
Cdd:cd09639   1 LLVIEAPTGYGKTEAALLWALHSLKSQKADRVIIALPTRATINAMYRRAKEAFGETGLY---------------HSSILS 65
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 389 RAITEQGQEEAWVQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLP-VKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEA 467
Cdd:cd09639  66 SRIKEMGDSEEFEHLFPLYIHSNDTLFLDPITVCTIDQVLKSVFGeFGHYEFTLASIANSLLIFDEVHFYDEYTLALILA 145
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 468 VLKAQADVGGSVILLSATLPmkqkqKLLDTYGLHTDPVENNSAYPLINWRGvngaqrfdllahpeqlpprfsiQPEPICL 547
Cdd:cd09639 146 VLEVLKDNDVPILLMSATLP-----KFLKEYAEKIGYVEENEPLDLKPNER----------------------APFIKIE 198
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 548 ADMLPDLTMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNtQVDIDLFHARFTLNDRREKENRVISNFGKNGKrn 627
Cdd:cd09639 199 SDKVGEISSLERLLEFIKKGGSVAIIVNTVDRAQEFYQQLKEKGP-EEEIMLIHSRFTEKDRAKKEAELLLEFKKSEK-- 275
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 628 vgRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRpagfEIPVATILLPDGEGygrheHIYSNVRVMW 707
Cdd:cd09639 276 --FVIVATQVIEASLDISVDVMITELAPIDSLIQRLGRLHRYGEKNG----EEVYIITDAPDGKG-----QKPYPYDLVE 344

                ....*....
gi 16130668 708 RTQQHIEEL 716
Cdd:cd09639 345 RTIELLEEG 353
Cas3 COG1203
CRISPR-Cas type I system-associated endonuclease/helicase Cas3 [Defense mechanisms]; ...
144-756 1.90e-115

CRISPR-Cas type I system-associated endonuclease/helicase Cas3 [Defense mechanisms]; CRISPR-Cas type I system-associated endonuclease/helicase Cas3 is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 440816 [Multi-domain]  Cd Length: 535  Bit Score: 362.86  E-value: 1.90e-115
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 144 HPYESWFPWVEAVTGHHGFILHSQDQDKSRWEMPASLASYAAQDKQAREEW--ISVLEALFLTPAGLSINDIPPDCSSLL 221
Cdd:COG1203   5 AKEALLGALALAALLLLLLALLLAALLLLLLAALLLALLLALLLLAALELAllLLLLLLLLLLLLLLLLDLLLDDLAFLF 84
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 222 AGFCSLADWLGSWTTtntflfnEDAPSDINALRtyfqdrqqdasrvlelsglvsnkrcYEGVHALLDNGYQPR------Q 295
Cdd:COG1203  85 LLLLIDADWLDSANF-------DMARQALDHLL-------------------------AERLERLLPKKSKPRtpinplQ 132
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 296 LQVLVDALPVA---PGLTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEAsashlFSSPNLIL 372
Cdd:COG1203 133 NEALELALEAAeeePGLFILTAPTGGGKTEAALLFALRLAAKHGGRRIIYALPFTSIINQTYDRLRD-----LFGEDVLL 207
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 373 AHGNSRFNHLfqsiksRAITEQGQEEAWvqccqwlSQSNKKVFLGQIGVCTIDQVLISVL-PVKHRFIRGLGIGRSVLIV 451
Cdd:COG1203 208 HHSLADLDLL------EEEEEYESEARW-------LKLLKELWDAPVVVTTIDQLFESLFsNRKGQERRLHNLANSVIIL 274
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 452 DEVHAYDTYMNGLLEAVLKAQADVGGSVILLSATLPMKQKQKLLDTYGLHTDPVENNSAYplinwrgvngAQRFDllahp 531
Cdd:COG1203 275 DEVQAYPPYMLALLLRLLEWLKNLGGSVILMTATLPPLLREELLEAYELIPDEPEELPEY----------FRAFV----- 339
                       410       420       430       440       450       460       470       480
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 532 eqlPPRFSIQPEPICLADmlpdltMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNtQVDIDLFHARFTLNDRRE 611
Cdd:COG1203 340 ---RKRVELKEGPLSDEE------LAELILEALHKGKSVLVIVNTVKDAQELYEALKEKLP-DEEVYLLHSRFCPADRSE 409
                       490       500       510       520       530       540       550       560
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 612 KENRVISNFgkngKRNVGRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKyrpagFEIPVATILLPDGE 691
Cdd:COG1203 410 IEKEIKERL----ERGKPCILVSTQVVEAGVDIDFDVVIRDLAPLDSLIQRAGRCNRHGRK-----EEEGNVYVFDPEDE 480
                       570       580       590       600       610       620
                ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 16130668 692 GYGRhehIYSNVRVmWRTQQHIEELNGaslFFPDAYRQWLDSIYDDAEMDEPEWvgngMDKFESA 756
Cdd:COG1203 481 GGGY---VYDKPLL-ERTRELLREHDE---ILPEDKRELIEEYYRELYELLPDE----LDSFKEI 534
HD_6 pfam18019
HD domain; This HD domain is found at the N-terminus of Cas3 enzymes fused to a helicase ...
11-236 7.12e-47

HD domain; This HD domain is found at the N-terminus of Cas3 enzymes fused to a helicase domain. This domain is sometimes found as a separate protein. It acts as a nuclease that cleaves ssDNA.


Pssm-ID: 436216  Cd Length: 212  Bit Score: 166.46  E-value: 7.12e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    11 WGKSSKslTKGNDIHLLIYHCLDVAAVADCWWDQSV--VLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFDIRFQYKSA 88
Cdd:pfam18019   1 WAKSDR--EGGGGWHPLVYHLLDVAAVAGALWDHWLapGVRDLLARLLGLDEEAARRLLAFLAALHDIGKASPAFQAKVP 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    89 ESWLKLNPATPSLNGPSTqmCRKFNHGAAGLYWFNQDslseqslgdffsFFDAAPHPYESWFPWVEAVTGHHGF-ILHSQ 167
Cdd:pfam18019  79 ELAEKLRDAGLPFPSSLD--ESRARHGLAGAALLREW------------LEDEAGWDRGVARALAAAVGGHHGRpPAEDL 144
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   168 DQDKSRWEMPASLASYA-AQDKQAREEWISVLEALFLTPAGlsinDIPPDCSSLLAGFCSLADWLGSWTT 236
Cdd:pfam18019 145 RLARPALRPAGGSWQEArRELLEAAAAFLGAAAVLLLPPAR----ELSQPAQVLLAGLVILADWIASNED 210
DEXDc smart00487
DEAD-like helicases superfamily;
310-487 1.81e-13

DEAD-like helicases superfamily;


Pssm-ID: 214692 [Multi-domain]  Cd Length: 201  Bit Score: 70.21  E-value: 1.81e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    310 TVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFssPNLILAHGNSRFNHLFQSIKSR 389
Cdd:smart00487  27 VILAAPTGSGKTLAALLPALEALKRGKGGRVLVLVPTRELAEQWAEELKKLGPSLG--LKVVGLYGGDSKREQLRKLESG 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    390 AIteqgqeeawvqccqwlsqsnkkvflgQIGVCTIdQVLISVLPVKHRFIRGLgigrSVLIVDEVHAYDTYMNGLLEAVL 469
Cdd:smart00487 105 KT--------------------------DILVTTP-GRLLDLLENDKLSLSNV----DLVILDEAHRLLDGGFGDQLEKL 153
                          170
                   ....*....|....*...
gi 16130668    470 KAQADVGGSVILLSATLP 487
Cdd:smart00487 154 LKLLPKNVQLLLLSATPP 171
 
Name Accession Description Interval E-value
PRK09694 PRK09694
CRISPR-associated helicase/endonuclease Cas3;
1-883 0e+00

CRISPR-associated helicase/endonuclease Cas3;


Pssm-ID: 182031 [Multi-domain]  Cd Length: 878  Bit Score: 1596.77  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    1 MEPFKYICHYWGKSSKSLTKGNDIHLLIYHCLDVAAVADCWWDQSVVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFD 80
Cdd:PRK09694   1 MESFKYYCRYWGKASKSLTKGNDYHLLPYHCLDVAAVADCWWDQSPVLRSQFSANEMLSKQQVRAWLLFFVALHDIGKFD 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   81 IRFQYKSAESWLKLNPATPSLNGPSTQMCRKFNHGAAGLYWFNQDSLSEQSLGDffSFFDAAPHPYESWFPWVEAVTGHH 160
Cdd:PRK09694  81 IRFQYKAPEIWLKLNPAGPSISGPSTQMCRKYDHGAAGLLWFRQDFRSNQASDD--SFFDAAPHPYEAWFPWMEAVTGHH 158
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  161 GFILHSQDQDKSRWEMPASLASYAAQDKQAREEWISVLEALFLTPAGLSINDIPPDCSSLLAGFCSLADWLGSWTTTNTF 240
Cdd:PRK09694 159 GYILHSQDQDDSRWEMPASLASYAEQDKQAREEWIQALEALFLTPAGLSLNDIPPPCSPLLAGFCSVSDWLGSWTTTFTF 238
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  241 LFNedapSDINALRTYFQDRQQDASRVLELSGLVSNKRCYEGVHALLDNGYQPRQLQVLVDALPVAPGLTVIEAPTGSGK 320
Cdd:PRK09694 239 LFN----SPILALRQYFQQRQQDAARVLELSGLVANKKPYGGVHALLDNGYQPRQLQTLVDALPLQPGLTIIEAPTGSGK 314
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  321 TETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSPNLILAHGNSRFNHLFQSIKSRAITEQGQEEAW 400
Cdd:PRK09694 315 TEAALAYAWRLIDQGLADSIIFALPTQATANAMLSRLEALASKLFPSPNLILAHGNSRFNHLFQSLKSRAATEQGQEEAW 394
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  401 VQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLPVKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEAVLKAQADVGGSVI 480
Cdd:PRK09694 395 VQCCEWLSQSNKRVFLGQIGVCTIDQVLISVLPVKHRFIRGFGLGRSVLIVDEVHAYDAYMYGLLEAVLKAQAQAGGSVI 474
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  481 LLSATLPMKQKQKLLDTYGLHtDPVENNSAYPLINWRGVNGAQRFDLLAHPEQLPPRFSIQPEPICLADMLPDLTMLERM 560
Cdd:PRK09694 475 LLSATLPATLKQKLLDTYGGH-DPVELSSAYPLITWRGVNGAQRFDLSAHPEQLPARFTIQLEPICLADMLPDLTLLQRM 553
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  561 IAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQVDIDLFHARFTLNDRREKENRVISNFGKNGKRNVGRILVATQVVEQ 640
Cdd:PRK09694 554 IAAANAGAQVCLICNLVDDAQKLYQRLKELNNTQVDIDLFHARFTLNDRREKEQRVIENFGKNGKRNQGRILVATQVVEQ 633
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  641 SLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRPAGFEIPVATILLPDGEGYGRHEHIYSNVRVMWRTQQHIEELNGAS 720
Cdd:PRK09694 634 SLDLDFDWLITQLCPVDLLFQRLGRLHRHHRKYRPAGFEIPVATVLLPDGEGYGRSGYIYGNTRVLWRTEQLLEEHNAAS 713
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  721 LFFPDAYRQWLDSIYDDAEMD-EPEWVGNGMDKFESAECEKRFKARKVLQWAEEYSLQDNDETILAVTRDGEMSLPLLPY 799
Cdd:PRK09694 714 LFFPDAYREWIESVYDEAEMDeEPEWVISGMDKFEDKECEKRYKARKMLKWAEETPLSDNDERVLALTRDGEMSLPVLPY 793
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  800 VQTSSGKQLLDGQVYEDLSHEQQYEALALNRVNVPFTWKRSFSEVVDEDGLLWLEGKQNLDGWVWQG-NSIVITYTGDEG 878
Cdd:PRK09694 794 VQTEHGKQLLDGQVLEQLDEEQQYEALALNRVNVPHTWKRSFLEVVDEDGLIWLEGHQDADGWCWQGkNDIVITYTEDEG 873

                 ....*
gi 16130668  879 MTRVI 883
Cdd:PRK09694 874 MTRVI 878
cas3_core TIGR01587
CRISPR-associated helicase Cas3; This model represents the highly conserved core region of an ...
309-716 3.32e-159

CRISPR-associated helicase Cas3; This model represents the highly conserved core region of an alignment of Cas3, a protein found in association with CRISPR repeat elements in a broad range of bacteria and archaea. Cas3 appears to be a helicase, with regions found by pfam00270 (DEAD/DEAH box helicase) and pfam00271 (Helicase conserved C-terminal domain). Some but not all members have an N-terminal HD domain region (pfam01966) that is not included within this model.


Pssm-ID: 273707 [Multi-domain]  Cd Length: 359  Bit Score: 469.63  E-value: 3.32e-159
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   309 LTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEAsashLFSSpNLILAHGNSRFnhlfqsiks 388
Cdd:TIGR01587   1 LLVIEAPTGYGKTEAALLWALHSIKSQKADRVIIALPTRATINAMYRRAKE----LFGS-ELVGLHHSSSF--------- 66
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   389 RAITEQGQEEAWVQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLP-VKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEA 467
Cdd:TIGR01587  67 SRIKEMGDSEEFEHLFPLYIHSNDKLFLDPITVCTIDQVLKSVFGeFGHYEFTLASIANSLLIFDEVHFYDEYTLALILA 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   468 VLKAQADVGGSVILLSATLPmkqkqKLLDTYGLHTDPVENNSAYPLINwrgvngaqrfdllahpeqlPPRFSIQPEPICL 547
Cdd:TIGR01587 147 VLEVLKDNDVPILLMSATLP-----KFLKEYAEKIGYVEFNEPLDLKE-------------------ERRFENHRFILIE 202
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   548 ADMLPDLTMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQvDIDLFHARFTLNDRREKENRVISNFGKNgkrN 627
Cdd:TIGR01587 203 SDKVGEISSLERLLEFIKKGGSIAIIVNTVDRAQEFYQQLKEKAPEE-EIILYHSRFTEKDRAKKEAELLREMKKS---N 278
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   628 VGRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRpAGFEIPVATILLPDGegygrheHIYSNVRVMW 707
Cdd:TIGR01587 279 EKFVIVATQVIEASLDISADVMITELAPIDSLIQRLGRLHRYGRKIG-ENFEVYIITIAPEGK-------LFPYPYELVE 350

                  ....*....
gi 16130668   708 RTQQHIEEL 716
Cdd:TIGR01587 351 RTIQKLEES 359
Cas3_I cd09639
CRISPR/Cas system-associated protein Cas3; CRISPR (Clustered Regularly Interspaced Short ...
309-716 1.64e-157

CRISPR/Cas system-associated protein Cas3; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; DEAD/DEAH box helicase DNA helicase cas3'; Often but not always is fused to HD nuclease domain; signature gene for Type I


Pssm-ID: 187770 [Multi-domain]  Cd Length: 353  Bit Score: 464.98  E-value: 1.64e-157
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 309 LTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSpnlilahgnsrfnhlFQSIKS 388
Cdd:cd09639   1 LLVIEAPTGYGKTEAALLWALHSLKSQKADRVIIALPTRATINAMYRRAKEAFGETGLY---------------HSSILS 65
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 389 RAITEQGQEEAWVQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLP-VKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEA 467
Cdd:cd09639  66 SRIKEMGDSEEFEHLFPLYIHSNDTLFLDPITVCTIDQVLKSVFGeFGHYEFTLASIANSLLIFDEVHFYDEYTLALILA 145
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 468 VLKAQADVGGSVILLSATLPmkqkqKLLDTYGLHTDPVENNSAYPLINWRGvngaqrfdllahpeqlpprfsiQPEPICL 547
Cdd:cd09639 146 VLEVLKDNDVPILLMSATLP-----KFLKEYAEKIGYVEENEPLDLKPNER----------------------APFIKIE 198
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 548 ADMLPDLTMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNtQVDIDLFHARFTLNDRREKENRVISNFGKNGKrn 627
Cdd:cd09639 199 SDKVGEISSLERLLEFIKKGGSVAIIVNTVDRAQEFYQQLKEKGP-EEEIMLIHSRFTEKDRAKKEAELLLEFKKSEK-- 275
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 628 vgRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRpagfEIPVATILLPDGEGygrheHIYSNVRVMW 707
Cdd:cd09639 276 --FVIVATQVIEASLDISVDVMITELAPIDSLIQRLGRLHRYGEKNG----EEVYIITDAPDGKG-----QKPYPYDLVE 344

                ....*....
gi 16130668 708 RTQQHIEEL 716
Cdd:cd09639 345 RTIELLEEG 353
Cas3 COG1203
CRISPR-Cas type I system-associated endonuclease/helicase Cas3 [Defense mechanisms]; ...
144-756 1.90e-115

CRISPR-Cas type I system-associated endonuclease/helicase Cas3 [Defense mechanisms]; CRISPR-Cas type I system-associated endonuclease/helicase Cas3 is part of the Pathway/BioSystem: CRISPR-Cas system


Pssm-ID: 440816 [Multi-domain]  Cd Length: 535  Bit Score: 362.86  E-value: 1.90e-115
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 144 HPYESWFPWVEAVTGHHGFILHSQDQDKSRWEMPASLASYAAQDKQAREEW--ISVLEALFLTPAGLSINDIPPDCSSLL 221
Cdd:COG1203   5 AKEALLGALALAALLLLLLALLLAALLLLLLAALLLALLLALLLLAALELAllLLLLLLLLLLLLLLLLDLLLDDLAFLF 84
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 222 AGFCSLADWLGSWTTtntflfnEDAPSDINALRtyfqdrqqdasrvlelsglvsnkrcYEGVHALLDNGYQPR------Q 295
Cdd:COG1203  85 LLLLIDADWLDSANF-------DMARQALDHLL-------------------------AERLERLLPKKSKPRtpinplQ 132
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 296 LQVLVDALPVA---PGLTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEAsashlFSSPNLIL 372
Cdd:COG1203 133 NEALELALEAAeeePGLFILTAPTGGGKTEAALLFALRLAAKHGGRRIIYALPFTSIINQTYDRLRD-----LFGEDVLL 207
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 373 AHGNSRFNHLfqsiksRAITEQGQEEAWvqccqwlSQSNKKVFLGQIGVCTIDQVLISVL-PVKHRFIRGLGIGRSVLIV 451
Cdd:COG1203 208 HHSLADLDLL------EEEEEYESEARW-------LKLLKELWDAPVVVTTIDQLFESLFsNRKGQERRLHNLANSVIIL 274
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 452 DEVHAYDTYMNGLLEAVLKAQADVGGSVILLSATLPMKQKQKLLDTYGLHTDPVENNSAYplinwrgvngAQRFDllahp 531
Cdd:COG1203 275 DEVQAYPPYMLALLLRLLEWLKNLGGSVILMTATLPPLLREELLEAYELIPDEPEELPEY----------FRAFV----- 339
                       410       420       430       440       450       460       470       480
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 532 eqlPPRFSIQPEPICLADmlpdltMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNtQVDIDLFHARFTLNDRRE 611
Cdd:COG1203 340 ---RKRVELKEGPLSDEE------LAELILEALHKGKSVLVIVNTVKDAQELYEALKEKLP-DEEVYLLHSRFCPADRSE 409
                       490       500       510       520       530       540       550       560
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 612 KENRVISNFgkngKRNVGRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKyrpagFEIPVATILLPDGE 691
Cdd:COG1203 410 IEKEIKERL----ERGKPCILVSTQVVEAGVDIDFDVVIRDLAPLDSLIQRAGRCNRHGRK-----EEEGNVYVFDPEDE 480
                       570       580       590       600       610       620
                ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 16130668 692 GYGRhehIYSNVRVmWRTQQHIEELNGaslFFPDAYRQWLDSIYDDAEMDEPEWvgngMDKFESA 756
Cdd:COG1203 481 GGGY---VYDKPLL-ERTRELLREHDE---ILPEDKRELIEEYYRELYELLPDE----LDSFKEI 534
DEXHc_cas3 cd17930
DEXH/Q-box helicase domain of Cas3; CRISPR-associated (Cas) 3 is a nuclease-helicase ...
307-499 1.05e-63

DEXH/Q-box helicase domain of Cas3; CRISPR-associated (Cas) 3 is a nuclease-helicase responsible for degradation of dsDNA. The two enzymatic units of Cas3, a histidine-aspartate (HD) nuclease and a Superfamily 2 (SF2) helicase, may be expressed from separate genes as Cas3' (SF2 helicase) and Cas3'' (HD nuclease) or may be fused as a single HD-SF2 polypeptide. The nucleolytic activity of most Cas3 enzymes is transition metal ion-dependent. Cas3 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350688 [Multi-domain]  Cd Length: 186  Bit Score: 212.54  E-value: 1.05e-63
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 307 PGLTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSPNLILAHGNSRFNHLFQSI 386
Cdd:cd17930   1 PGLVILEAPTGSGKTEAALLWALKLAARGGKRRIIYALPTRATINQMYERIREILGRLDDEDKVLLLHSKAALELLESDE 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 387 KsraiteqgqEEAWVQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLPVKHRFIRGLGIGRSVLIVDEVHAYD-TYMNGLL 465
Cdd:cd17930  81 E---------PDDDPVEAVDWALLLKRSWLAPIVVTTIDQLLESLLKYKHFERRLHGLANSVVVLDEVQAYDpEYMALLL 151
                       170       180       190
                ....*....|....*....|....*....|....
gi 16130668 466 EAVLKAQADVGGSVILLSATLPMKQKQKLLDTYG 499
Cdd:cd17930 152 KALLELLGELGGPVVLMTATLPALLRDELLEALL 185
HD_6 pfam18019
HD domain; This HD domain is found at the N-terminus of Cas3 enzymes fused to a helicase ...
11-236 7.12e-47

HD domain; This HD domain is found at the N-terminus of Cas3 enzymes fused to a helicase domain. This domain is sometimes found as a separate protein. It acts as a nuclease that cleaves ssDNA.


Pssm-ID: 436216  Cd Length: 212  Bit Score: 166.46  E-value: 7.12e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    11 WGKSSKslTKGNDIHLLIYHCLDVAAVADCWWDQSV--VLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFDIRFQYKSA 88
Cdd:pfam18019   1 WAKSDR--EGGGGWHPLVYHLLDVAAVAGALWDHWLapGVRDLLARLLGLDEEAARRLLAFLAALHDIGKASPAFQAKVP 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    89 ESWLKLNPATPSLNGPSTqmCRKFNHGAAGLYWFNQDslseqslgdffsFFDAAPHPYESWFPWVEAVTGHHGF-ILHSQ 167
Cdd:pfam18019  79 ELAEKLRDAGLPFPSSLD--ESRARHGLAGAALLREW------------LEDEAGWDRGVARALAAAVGGHHGRpPAEDL 144
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   168 DQDKSRWEMPASLASYA-AQDKQAREEWISVLEALFLTPAGlsinDIPPDCSSLLAGFCSLADWLGSWTT 236
Cdd:pfam18019 145 RLARPALRPAGGSWQEArRELLEAAAAFLGAAAVLLLPPAR----ELSQPAQVLLAGLVILADWIASNED 210
Cas3''_I cd09641
CRISPR/Cas system-associated protein Cas3''; CRISPR (Clustered Regularly Interspaced Short ...
20-233 5.38e-21

CRISPR/Cas system-associated protein Cas3''; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; HD-like nuclease, specifically digesting double-stranded oligonucleotides and preferably cleaving at G:C pairs; signature gene for Type I


Pssm-ID: 193608 [Multi-domain]  Cd Length: 200  Bit Score: 91.95  E-value: 5.38e-21
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  20 KGNDIHLLIYHCLDVAAVAdcwwdqsVVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFDIRFQYKsaesWLKLNPATP 99
Cdd:cd09641   2 KSGPWQPLLEHLLDVAAWD-------AELAEEFARKLGLELGLSRELLALAGLLHDLGKATPAFQKY----LRGGKEALR 70
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 100 SLNGpstqmcRKFNHGAAGLYWFNQdslseqslgdffsFFDAAPHPYESWFPWVEAVTGHHGFI--LHSQDQDKSRWEMP 177
Cdd:cd09641  71 EGKR------KEVRHSLLGALLLYE-------------LLKELGLDEELALLLAYAIAGHHGGLpdVLLLLDEDDESALK 131
                       170       180       190       200       210       220
                ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 16130668 178 ASLASYA------AQDKQAREEWISVLEALFLTPAGLSINDIPPDC-SSLLAGFCSLADWLGS 233
Cdd:cd09641 132 ERLEELDeeklllELWEEELEELLDELLKELLLLLLPELLSFELYLlLRLLFSLLVDADWLAS 194
cas3_HD TIGR01596
CRISPR-associated endonuclease Cas3-HD; CRISPR/Cas systems are widespread, mobile systems for ...
27-233 4.97e-16

CRISPR-associated endonuclease Cas3-HD; CRISPR/Cas systems are widespread, mobile systems for host defense against invasive elements such as phage. In these systems, Cas3 designates one of the core proteins shared widely by multiple types of CRISPR/Cas system. This model represents an HD-like endonuclease that occurs either separately or as the N-terminal region of Cas3, the helicase-containing CRISPR-associated protein.


Pssm-ID: 273711 [Multi-domain]  Cd Length: 176  Bit Score: 76.86  E-value: 4.97e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    27 LIYHCLDVAAVADcwwdqsvVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFDIRFQyksaeSWLKLNPATPSlngpst 106
Cdd:TIGR01596   1 LKEHLLDVAAVAE-------ALPALRPRLAEKLGLELRELLKLAGLLHDLGKASPAFQ-----KKLRKAEERGD------ 62
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   107 qmCRKFNHGAAGLYWFNQdslseqslgdffsFFDAAPHPYESWFPWVEAVTGHHGFILhsqDQDKSRWEMPASLASYAAQ 186
Cdd:TIGR01596  63 --RGEVRHSTLSAALLYD-------------LLEELGLEEELALLLALAIAGHHGGLI---DDDDLEELLELLERELEEA 124
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|..
gi 16130668   187 DKQAREEWISVLEALFLT-PAGLSINDIPPDCSS----LLAGFCSLADWLGS 233
Cdd:TIGR01596 125 LGELLEELEELLDEVLKAlPLRLLLDKEEPIELYllarLLFGLLVDADWLAS 176
SF2-N cd00046
N-terminal DEAD/H-box helicase domain of superfamily 2 helicases; The DEAD/H-like superfamily ...
310-485 6.63e-14

N-terminal DEAD/H-box helicase domain of superfamily 2 helicases; The DEAD/H-like superfamily 2 helicases comprise a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This N-terminal domain contains the ATP-binding region.


Pssm-ID: 350668 [Multi-domain]  Cd Length: 146  Bit Score: 69.74  E-value: 6.63e-14
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 310 TVIEAPTGSGKTETALAYAWKLIDQQiADSVIFALPTQATANAMLTRMEASASHlfsSPNLILAHGNSRFNHLFQSIKSR 389
Cdd:cd00046   4 VLITAPTGSGKTLAALLAALLLLLKK-GKKVLVLVPTKALALQTAERLRELFGP---GIRVAVLVGGSSAEEREKNKLGD 79
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 390 AiteqgqeeawvqccqwlsqsnkkvflgQIGVCTIDQVLISVLPVKHRFIRGLgigrSVLIVDEVHAYDT---YMNGLLE 466
Cdd:cd00046  80 A---------------------------DIIIATPDMLLNLLLREDRLFLKDL----KLIIVDEAHALLIdsrGALILDL 128
                       170
                ....*....|....*....
gi 16130668 467 AVLKAQADvGGSVILLSAT 485
Cdd:cd00046 129 AVRKAGLK-NAQVILLSAT 146
DEXDc smart00487
DEAD-like helicases superfamily;
310-487 1.81e-13

DEAD-like helicases superfamily;


Pssm-ID: 214692 [Multi-domain]  Cd Length: 201  Bit Score: 70.21  E-value: 1.81e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    310 TVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFssPNLILAHGNSRFNHLFQSIKSR 389
Cdd:smart00487  27 VILAAPTGSGKTLAALLPALEALKRGKGGRVLVLVPTRELAEQWAEELKKLGPSLG--LKVVGLYGGDSKREQLRKLESG 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    390 AIteqgqeeawvqccqwlsqsnkkvflgQIGVCTIdQVLISVLPVKHRFIRGLgigrSVLIVDEVHAYDTYMNGLLEAVL 469
Cdd:smart00487 105 KT--------------------------DILVTTP-GRLLDLLENDKLSLSNV----DLVILDEAHRLLDGGFGDQLEKL 153
                          170
                   ....*....|....*...
gi 16130668    470 KAQADVGGSVILLSATLP 487
Cdd:smart00487 154 LKLLPKNVQLLLLSATPP 171
Cas3_Cas2_I-F cd09673
CRISPR/Cas system-associated protein Cas3/Cas2; CRISPR (Clustered Regularly Interspaced Short ...
296-681 4.79e-10

CRISPR/Cas system-associated protein Cas3/Cas2; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; Cas3/Cas2 fusion; This protein includes both DEAH and HD motifs for helicase and N-terminal domain corresponding to Cas2 RNAse; signature gene for Type I and subtype I-F


Pssm-ID: 187804 [Multi-domain]  Cd Length: 1106  Bit Score: 63.74  E-value: 4.79e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  296 LQVLVDALPVAPGLTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALP----TQATANAMLTRMeasasHLfSSPNLI 371
Cdd:cd09673  416 AQKLRQKSPEQGAFGVNMASTGCGKTLANARAMYALRDDKQGARFAIALGlrslTLQTGHALKTRL-----NL-SDDDLA 489
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  372 LAHGNSRFNHLFQSIKSRA--ITEQGQEEA-----WVQCC---------------QWLS--QSNKKVFLGQIGVCTIDQV 427
Cdd:cd09673  490 VLIGGTAVQTLFDLSKEKIeqVDEDGSESApiflaEGQDCnlpdwdgpldtiellGRLSldDKEKTLLAAPVLVCTIDHL 569
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  428 LISVLPVK--HRFIRGLGIGRSVLIVDEVHAYDtyMNGL--LEAVLKAQADVGGSVILLSATLPMKQKQKLLDTYGLHTD 503
Cdd:cd09673  570 IPATESHRggHHIAPMLRLMSSDLILDEPDDYE--PEDLpaLLRLVQLAGLLGSRVLLSSATLPPALVKTLFRAYEAGRQ 647
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  504 PVENNSAYP------LINW------RGVNGAQRFDLLAH--------PEQL---PPR-----FSIQPEPICLADMLPDL- 554
Cdd:cd09673  648 MYQALYGQPkkplniCCAWvdepqvWQADCNQKSEFIQRhqdflrdrAVQLakkPVRrlaelLSLSSLKPRNESTYLALa 727
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  555 -TMLERMIAAANAGAQ---------------VCLICNLVDVAQVCYQRLKElNNTQVDIDLFHARFTLNDRREKENR--- 615
Cdd:cd09673  728 qSLLEGALRLHQAHAQtdpksekkvsvglirVANIDPLIRLAQFLYALLAE-EKFAIHLCCYHAQDPLLLRSYIERRldq 806
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668  616 ---------------VISNFGKNGKRNVGRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHH-RKYRPAGFE 679
Cdd:cd09673  807 lltrhkpeqlfqddeIIDLMQNSPALNHLFIVLATPVEEVGRDHDYDWAIADPSSMRSIIQLAGRVNRHRlEKVQQPNIV 886

                 ..
gi 16130668  680 IP 681
Cdd:cd09673  887 IL 888
HELICc smart00490
helicase superfamily c-terminal domain;
584-670 1.42e-06

helicase superfamily c-terminal domain;


Pssm-ID: 197757 [Multi-domain]  Cd Length: 82  Bit Score: 46.82  E-value: 1.42e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668    584 YQRLKELNntqVDIDLFHARFTLNDRREKENRVisnfgKNGKRnvgRILVATQVVEQSLDV-DFDWLITQHCPAD--LLF 660
Cdd:smart00490   4 AELLKELG---IKVARLHGGLSQEEREEILDKF-----NNGKI---KVLVATDVAERGLDLpGVDLVIIYDLPWSpaSYI 72
                           90
                   ....*....|
gi 16130668    661 QRLGRLHRHH 670
Cdd:smart00490  73 QRIGRAGRAG 82
DEAD pfam00270
DEAD/DEAH box helicase; Members of this family include the DEAD and DEAH box helicases. ...
311-487 6.17e-06

DEAD/DEAH box helicase; Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.


Pssm-ID: 425570 [Multi-domain]  Cd Length: 165  Bit Score: 47.24  E-value: 6.17e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   311 VIEAPTGSGKTETALAYAWKLIDQQIADS-VIFALPTQATANAMLTRMEASASHLfsSPNLILAHGNSRFNHLFQSIKSr 389
Cdd:pfam00270  18 LVQAPTGSGKTLAFLLPALEALDKLDNGPqALVLAPTRELAEQIYEELKKLGKGL--GLKVASLLGGDSRKEQLEKLKG- 94
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   390 aiteqgqeeawvqcCQWLsqsnkkvflgqigVCTIDQvLISVLPVKHRFiRGLGIgrsvLIVDEVHAYDTYMNG-LLEAV 468
Cdd:pfam00270  95 --------------PDIL-------------VGTPGR-LLDLLQERKLL-KNLKL----LVLDEAHRLLDMGFGpDLEEI 141
                         170
                  ....*....|....*....
gi 16130668   469 LKaQADVGGSVILLSATLP 487
Cdd:pfam00270 142 LR-RLPKKRQILLLSATLP 159
ResIII pfam04851
Type III restriction enzyme, res subunit;
311-460 1.03e-05

Type III restriction enzyme, res subunit;


Pssm-ID: 398492 [Multi-domain]  Cd Length: 162  Bit Score: 46.51  E-value: 1.03e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   311 VIEAPTGSGKTETALAYAWKLIDQQIADSVIFalptqatanamltrmeasashlfsspnliLAHGNSRFNHLFQSIKSRA 390
Cdd:pfam04851  27 LIVMATGSGKTLTAAKLIARLFKKGPIKKVLF-----------------------------LVPRKDLLEQALEEFKKFL 77
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 16130668   391 ITEQGQEEAWVQCCQWLSQSNKKVFlgqigVCTIDQVLISVLPVKHRFIRGlgiGRSVLIVDEVH--AYDTY 460
Cdd:pfam04851  78 PNYVEIGEIISGDKKDESVDDNKIV-----VTTIQSLYKALELASLELLPD---FFDVIIIDEAHrsGASSY 141
Cas3_I cd09696
CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to ...
547-718 3.12e-05

CRISPR/Cas system-associated protein Cas3; Distinct Cas3 family with HD domain fused to C-termus of Helicase domain; CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) and associated Cas proteins comprise a system for heritable host defense by prokaryotic cells against phage and other foreign DNA; DNA helicase Cas3; This protein includes both DEAH and HD motifs; signature gene for Type I


Pssm-ID: 187827 [Multi-domain]  Cd Length: 843  Bit Score: 47.71  E-value: 3.12e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 547 LADMLPDLTMLErmiaaANAGAQVCLICNLVDVAQVCYQRLKelnntQVDIDLFHARFTLNDRRE-KENRVISNF---GK 622
Cdd:cd09696 256 LSTMVKELNLLM-----KDSGGAILVFCRTVKHVRKVFAKLP-----KEKFELLTGTLRGAERDDlVKKEIFNRFlpqML 325
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 623 NGKRNVGR----ILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRPAGfeipvATILLPDGEGYGRHEH 698
Cdd:cd09696 326 SGSRARPQqgtvYLVCTSAGEVGVNISADHLVCDLAPFESMQQRFGRVNRFGELQACQI-----AVVHLDLGKDQDFDVY 400
                       170       180
                ....*....|....*....|
gi 16130668 699 IYSNVRVMWRTQQHIEELNG 718
Cdd:cd09696 401 GKKIDKSTWSTLKKLQQLKG 420
EEXXQc_AQR cd17935
EEXXQ-box helicase domain of AQR; Aquarius (AQR) is a multifunctional RNA helicase that binds ...
291-393 3.99e-04

EEXXQ-box helicase domain of AQR; Aquarius (AQR) is a multifunctional RNA helicase that binds precursor-mRNA introns at a defined position and is part of a pentameric intron-binding complex (IBC). It is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350693 [Multi-domain]  Cd Length: 207  Bit Score: 42.80  E-value: 3.99e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 291 YQPRQLQVLVDALpvAPGLTVIEAPTGSGKTETALayawklidqQIADSVIFALPTQATanamltrmeasashlfsspnL 370
Cdd:cd17935   6 FTPTQIEAIRSGM--QPGLTMVVGPPGTGKTDVAV---------QIISNLYHNFPNQRT--------------------L 54
                        90       100
                ....*....|....*....|...
gi 16130668 371 ILAHGNSRFNHLFQSIKSRAITE 393
Cdd:cd17935  55 IVTHSNQALNQLFEKIMALDIDE 77
Helicase_C pfam00271
Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, ...
554-669 6.10e-04

Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.


Pssm-ID: 459740 [Multi-domain]  Cd Length: 109  Bit Score: 40.27  E-value: 6.10e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668   554 LTMLERMIAAANaGAQVCLICNLVDVAQVCYqrLKELNNtqVDIDLFHARFTLNDRREkenrVISNFgKNGKRNVgriLV 633
Cdd:pfam00271   3 LEALLELLKKER-GGKVLIFSQTKKTLEAEL--LLEKEG--IKVARLHGDLSQEEREE----ILEDF-RKGKIDV---LV 69
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 16130668   634 ATQVVEQSLDV---------DFDWlitqhCPADLLfQRLGRLHRH 669
Cdd:pfam00271  70 ATDVAERGLDLpdvdlvinyDLPW-----NPASYI-QRIGRAGRA 108
SF2_C_RecG cd18811
C-terminal helicase domain of DNA helicase RecG; ATP-dependent DNA helicase RecG plays a ...
556-644 3.14e-03

C-terminal helicase domain of DNA helicase RecG; ATP-dependent DNA helicase RecG plays a critical role in recombination and DNA repair. RecG helps process Holliday junction intermediates to mature products by catalyzing branch migration. It is a DEAD-like helicase belonging to superfamily (SF)2, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. Similar to SF1 helicases, SF2 helicases do not form toroidal structures like SF3-6 helicases. Their helicase core consists of two similar protein domains that resemble the fold of the recombination protein RecA. This model describes the C-terminal domain, also called HelicC.


Pssm-ID: 350198 [Multi-domain]  Cd Length: 159  Bit Score: 39.25  E-value: 3.14e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 16130668 556 MLERMIAAANAGAQVCLICNLVD--------VAQVCYQRLKELNNTQVDIDLFHARFTlndRREKEnRVISNFgkngKRN 627
Cdd:cd18811  15 VYEFVREEIAKGRQAYVIYPLIEesekldlkAAVAMYEYLKERFRPELNVGLLHGRLK---SDEKD-AVMAEF----REG 86
                        90
                ....*....|....*..
gi 16130668 628 VGRILVATQVVEQSLDV 644
Cdd:cd18811  87 EVDILVSTTVIEVGVDV 103
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH