|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Jan 13, 2021 |
Title |
CnR_SALL4_nuc_repl3 |
Sample type |
SRA |
|
|
Source name |
SNU398_CnR_SALL4_nuc
|
Organism |
Homo sapiens |
Characteristics |
cell line: SNU398 cell type: Hepatocellular Carcinoma cell line chip anitbody: SALL4 (CST D16H12) molecule subtype: nuclear extract
|
Growth protocol |
SNU398 cells were grown in RPMI media with 10% FBS
|
Extracted molecule |
genomic DNA |
Extraction protocol |
Briefly, 2 million cell nuclei were immobilized on Concanavalin A beads after washing. SALL4 (CST D16H12) or normal rabbit IgG (Cell Signaling DA1E) were incubated with the nuclei overnight in the presence of 0.02% digitonin at 4 degrees. The next day, 700ng/mL of proteinA-micrococcal nuclease (pA-Mnase purified in house with vector from Addgene 86973, protocol from Schmid et al.) were incubated with the nuclei at 4 degrees for an hour. After washing, the tubes were laced in heat blocks on ice set to 0 degrees, CaCl2 (1mM) was added and incubated for 30 minutes before 2x Stop buffer containing EDTA was added. DNA was eluted by heat and highspeed spin, then phenol-chloroform extracted. NEBNext Ultra II DNA library prep kit (NEB E7645) was used to make the libraries according to Liu et al.’s protocol, outlined on protocols.io (dx.doi.org/10.17504/protocols.io.wvgfe3w) Pair-end (42bp) Illumina sequencing was performed on the barcoded and amplified libraries
|
|
|
Library strategy |
OTHER |
Library source |
genomic |
Library selection |
other |
Instrument model |
Illumina MiniSeq |
|
|
Data processing |
library strategy: CUT&RUN Cut&Run - Cut and Run Analysis Pipeline (CnRAP) scripts can be found on github (https://github.com/mbassalbioinformatics/CnRAP). Raw fastq files were trimmed with Trimmomatic (REF, v0.36 tested) in paired end mode with the flags “ILLUMINACLIP: <adapter_path> Truseq3.PE.fa:2:15:4:4:true LEADING:20 TRAILING:20 SLIDINGWINDOW:4:15 MINLEN:25”. Next, the kseq trimmer developed by the Orkin lab was run on each fastq file, there were no flags to modify. For alignment, BWA (v0.7.17-r1188 tested) was first run in “aln” mode on a masked hg38 genome downloaded from UCSC to create *.sai files. Next, BWA was run in “sampe” mode with the flag “-n 20” on the *.sai files. Afterwards, stampy (v1.0.32 tested) was run in “--sensitive" mode. For mapping statistics, bamtools (v2.5.1 tested) “stat” was used. Post alignment, unmapped reads were removed from bam files using bamtools with the flags “filter -isMapped true.” Next, using samtools (v1.5 tested), bam files were sorted (“sort -l 9 -O bam”), had read pair mates fixed (“fixmate”) and indexed (“index”). Bam coverage maps were generated using bamCoverage from the deeptools suite (v2.5.7 tested). The same procedure was run to align fastq files to a masked genome Saccharomyces Cerevisiae v3 (sacCer3), also downloaded from UCSC. In preparation for peak calling, a normalization factor was determined for each hg38 aligned replicate based on the corresponding number of proper-pairs aligned to the sacCer3 genome, as recommended in the Henikoff pipeline. This was calculated as follows normalization_factor=10,000,000\(#proper_pairs/2). The number of ‘proper-pairs’ was extracted from the bamtools mapping statistics calculated previously. Next, from the hg38 aligned bams, ‘proper-paired’ reads were extracted using samtools with the flags “view -b -f 2 -F 524” with the output piped into bedtools with the flags “genomecov -bg -scale <normalization_factor> -ibam stdin.” This produced bed files of ‘proper-paired’ reads that have been normalized to the number of reads aligned back to the sacCer3 genome. Bedgraphs of these normalized bed files were generated as intermediary files to facilitate generation of bigwig coverage maps using the bedGraphToBigWig from UCSC (v4). For peak calling, the recently developed SEACR (v1.1 tested) was utilized and run in both “stringent” and “relaxed” mode to produce peak files with the flag “non” as the bed files were already normalized to the number of yeast spike-in reads. Subsequently peak file columns were re-arranged to facilitate motif discovery using both HOMER (v4.10 tested, flags “-size given,50,100,200 -mask -p 20 -S 50”) and MEME (v5.0.5 tested, “-dreme-m 50 -meme-nmotifs 50”). Peaks were annotated using the R package ChIPSeeker (v1.20.0 tested). Overlapping peak subsets were generated using mergePeaks.py from the HOMER suite with the flag “-d 1000”. NicE-seq - Raw fastq reads were trimmed and mapped in the same way as described above for CUT&RUN data. Peaks were called with Model-based Analysis of ChIP-seq (MACS) (v2.1.1.20160309) for scrambled control sample and SALL4 KD sample separately. Unique peaks to each sample was identified with HOMER mergePeaks function. bamCoverage (described above for CUT&RUN) was used to generate BigWig files for visualization. RNA-Seq - The raw fastq reads were trimmed using TrimGalore (v.0.4.5) and mapped to hg38 genome downloaded from UCSC using STAR. The read counts table was generated with the featureCounts. Differential gene expression analysis was performed to compare scrambled control samples with SALL4 KD samples using DESeq2 and the fold change was plotted as a volcano plot using ggplot2. Genome_build: (human) hg38, (yeast) saccer3 Supplementary_files_format_and_content: peak files; gene x features expression matrix tab delimited text
|
|
|
Submission date |
Aug 26, 2019 |
Last update date |
Jan 13, 2021 |
Contact name |
Mahmoud Adel Bassal |
E-mail(s) |
[email protected]
|
Organization name |
Beth Israel Deaconess Medical Center
|
Department |
Hematology and Oncology
|
Lab |
Tenen Lab
|
Street address |
3 Blackfan Circle
|
City |
Boston |
State/province |
Massachusetts |
ZIP/Postal code |
02131-4834 |
Country |
USA |
|
|
Platform ID |
GPL22790 |
Series (1) |
GSE136332 |
Zinc finger protein SALL4 functions through an AT-rich motif to regulate gene expression |
|
Relations |
BioSample |
SAMN12637035 |
SRA |
SRX6759430 |
Supplementary file |
Size |
Download |
File type/resource |
GSM4046321_CnR_SALL4_nuc_repl3.bw |
56.7 Mb |
(ftp)(http) |
BW |
GSM4046321_CnR_SALL4_repl3_relaxed_peaks.bed.gz |
4.7 Mb |
(ftp)(http) |
BED |
SRA Run Selector |
Raw data are available in SRA |
Processed data provided as supplementary file |
Processed data are available on Series record |
|
|
|
|
|