GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM4046317

Query DataSets for GSM4046317

Status

Public on Jan 13, 2021

Title

CnR_igg_repl2

Sample type

SRA

Source name

SNU398_CnR_igg

Organism

Homo sapiens

Characteristics

cell line: SNU398
cell type: Hepatocellular Carcinoma cell line
chip anitbody: normal rabbit IgG (Cell Signaling DA1E)
molecule subtype: nuclear extract

Growth protocol

SNU398 cells were grown in RPMI media with 10% FBS

Extracted molecule

genomic DNA

Extraction protocol

Briefly, 2 million cell nuclei were immobilized on Concanavalin A beads after washing. SALL4 (CST D16H12) or normal rabbit IgG (Cell Signaling DA1E) were incubated with the nuclei overnight in the presence of 0.02% digitonin at 4 degrees. The next day, 700ng/mL of proteinA-micrococcal nuclease (pA-Mnase purified in house with vector from Addgene 86973, protocol from Schmid et al.) were incubated with the nuclei at 4 degrees for an hour. After washing, the tubes were laced in heat blocks on ice set to 0 degrees, CaCl2 (1mM) was added and incubated for 30 minutes before 2x Stop buffer containing EDTA was added. DNA was eluted by heat and highspeed spin, then phenol-chloroform extracted.
NEBNext Ultra II DNA library prep kit (NEB E7645) was used to make the libraries according to Liu et al.’s protocol, outlined on protocols.io (dx.doi.org/10.17504/protocols.io.wvgfe3w)
Pair-end (42bp) Illumina sequencing was performed on the barcoded and amplified libraries

Library strategy

OTHER

Library source

genomic

Library selection

other

Instrument model

Illumina MiniSeq

Data processing

library strategy: CUT&RUN
Cut&Run - Cut and Run Analysis Pipeline (CnRAP) scripts can be found on github (https://github.com/mbassalbioinformatics/CnRAP). Raw fastq files were trimmed with Trimmomatic (REF, v0.36 tested) in paired end mode with the flags “ILLUMINACLIP: <adapter_path> Truseq3.PE.fa:2:15:4:4:true LEADING:20 TRAILING:20 SLIDINGWINDOW:4:15 MINLEN:25”. Next, the kseq trimmer developed by the Orkin lab was run on each fastq file, there were no flags to modify. For alignment, BWA (v0.7.17-r1188 tested) was first run in “aln” mode on a masked hg38 genome downloaded from UCSC to create *.sai files. Next, BWA was run in “sampe” mode with the flag “-n 20” on the *.sai files. Afterwards, stampy (v1.0.32 tested) was run in “--sensitive" mode. For mapping statistics, bamtools (v2.5.1 tested) “stat” was used. Post alignment, unmapped reads were removed from bam files using bamtools with the flags “filter -isMapped true.” Next, using samtools (v1.5 tested), bam files were sorted (“sort -l 9 -O bam”), had read pair mates fixed (“fixmate”) and indexed (“index”). Bam coverage maps were generated using bamCoverage from the deeptools suite (v2.5.7 tested). The same procedure was run to align fastq files to a masked genome Saccharomyces Cerevisiae v3 (sacCer3), also downloaded from UCSC. In preparation for peak calling, a normalization factor was determined for each hg38 aligned replicate based on the corresponding number of proper-pairs aligned to the sacCer3 genome, as recommended in the Henikoff pipeline. This was calculated as follows normalization_factor=10,000,000\(#proper_pairs/2). The number of ‘proper-pairs’ was extracted from the bamtools mapping statistics calculated previously. Next, from the hg38 aligned bams, ‘proper-paired’ reads were extracted using samtools with the flags “view -b -f 2 -F 524” with the output piped into bedtools with the flags “genomecov -bg -scale <normalization_factor> -ibam stdin.” This produced bed files of ‘proper-paired’ reads that have been normalized to the number of reads aligned back to the sacCer3 genome. Bedgraphs of these normalized bed files were generated as intermediary files to facilitate generation of bigwig coverage maps using the bedGraphToBigWig from UCSC (v4). For peak calling, the recently developed SEACR (v1.1 tested) was utilized and run in both “stringent” and “relaxed” mode to produce peak files with the flag “non” as the bed files were already normalized to the number of yeast spike-in reads. Subsequently peak file columns were re-arranged to facilitate motif discovery using both HOMER (v4.10 tested, flags “-size given,50,100,200 -mask -p 20 -S 50”) and MEME (v5.0.5 tested, “-dreme-m 50 -meme-nmotifs 50”). Peaks were annotated using the R package ChIPSeeker (v1.20.0 tested). Overlapping peak subsets were generated using mergePeaks.py from the HOMER suite with the flag “-d 1000”.
NicE-seq - Raw fastq reads were trimmed and mapped in the same way as described above for CUT&RUN data. Peaks were called with Model-based Analysis of ChIP-seq (MACS) (v2.1.1.20160309) for scrambled control sample and SALL4 KD sample separately. Unique peaks to each sample was identified with HOMER mergePeaks function. bamCoverage (described above for CUT&RUN) was used to generate BigWig files for visualization.
RNA-Seq - The raw fastq reads were trimmed using TrimGalore (v.0.4.5) and mapped to hg38 genome downloaded from UCSC using STAR. The read counts table was generated with the featureCounts. Differential gene expression analysis was performed to compare scrambled control samples with SALL4 KD samples using DESeq2 and the fold change was plotted as a volcano plot using ggplot2.
Genome_build: (human) hg38, (yeast) saccer3
Supplementary_files_format_and_content: peak files; gene x features expression matrix tab delimited text

Submission date

Aug 26, 2019

Last update date

Jan 13, 2021

Contact name

Mahmoud Adel Bassal

E-mail(s)

[email protected]

Organization name

Beth Israel Deaconess Medical Center

Department

Hematology and Oncology

Lab

Tenen Lab

Street address

3 Blackfan Circle

City

Boston

State/province

Massachusetts

ZIP/Postal code

02131-4834

Country

USA

Platform ID

GPL22790

Series (1)

GSE136332

Zinc finger protein SALL4 functions through an AT-rich motif to regulate gene expression

Relations

BioSample

SAMN12637039

SRA

SRX6759426

Supplementary file	Size	Download	File type/resource
GSM4046317_CnR_igg_repl2.bw	47.4 Mb	(ftp)(http)	BW
SRA Run Selector
Raw data are available in SRA
Processed data provided as supplementary file
Processed data are available on Series record