GEO Accession viewer

NCBI > GEO > Accession Display

Not logged in | Login

GEO help: Mouse over screen elements for information.

Sample GSM4594348

Query DataSets for GSM4594348

Status

Public on Dec 18, 2020

Title

Sample9_Total_HEK_Rep1

Sample type

SRA

Source name

Total_HEK

Organism

Homo sapiens

Characteristics

cell line/strain: HEK293-T
tissue source: Embrionic Kidney
genotype: wild-type
sample type: Footprints from cushion-isolated ribosomes
growth condition: DMEM with GlutaMAX, pyruvate, 10% FCS, pen/strep, 37°C, 5% CO2, humidified

Treatment protocol

General information on treatments is indicated in the corresponding sample characteristics field. For detailed information, please refer to the Material and Methods of the original publication.

Growth protocol

Human cells were cultivated in the indicated temperature and growth medium (as indicated in the corresponding sample "characteristics: growth conditions" field). Cells were passaged regularly through trypsinization. For all experiments cells were seeded 18-24 hours before lysis in 15 cm2 dishes (3.5 million U2OS and 6 million HEK293-T cells) to reach 70-90% confluency at the time of harvesting. E. coli cells were cultivated in the indicated temperature and growth medium (as indicated in the corresponding sample "characteristics: growth conditions") with shaking at 120 rpm. For all experiments 200 ml of EZ-RDM were inoculated with an overnight culture to an initial OD600 of 0.05 and grown untill a cell density of OD600 = 0.4 - 0.6 was reached for harvest.

Extracted molecule

total RNA

Extraction protocol

RNA from frozen ribosome fractions was extracted using the hot acid phenol method as described in Galmozzi and Merker et al. 2019
Libraries from isolated ribosome footprints were prepared following the previously published protocol (Galmozzi and Merker et al. 2019)

Library strategy

OTHER

Library source

transcriptomic

Library selection

other

Instrument model

NextSeq 550

Data processing

library strategy: Ribosome Profiling
Raw NextSeq550 sequencing files were converted with bcl2fastq2 conversion software (v2.20) to obtain single fastq files per indexed sample using following command: bcl2fastq -p16 -r4 -w4 --no-lane-splitting -R <path_to_input_folder> -o <path_to_output_folder>. Only samples 5-8 were sequenced by the DKFZ Sequencing Facility on a HighSeq2000 machine and converted to fastq files.
3' adaptor sequences were trimmed with Cutadapt (v1.13) with the following command: cutadapt --cores=4 -q20 -m23 --discard-untrimmed -O6 -a ATCGTAGATCGGAAGAGCACACGTCTGAACTCCAGTCAC -o <path_to_output>.fastq.gz <path_to_input>.fastq.gz For samples sequenced on HiSeq2000 following command was used: cutadapt --cores=4 -q20 -m23 --discard-untrimmed -O6 -a CTGTAGGCACCATCAATTCGTATGCCGTCTTCTGCTTG -o <path_to_output>.fastq.gz <path_to_input>.fastq.gz
Unique molecular identifiers (UMIs) were extracted from each read for the downstream analysis using a custom Julia script (available as supplement information of this study) with the following command: julia <path_to_script>/Script1.jl <path_to_input>.fastq.gz <path_to_output>.fastq.gz --umi3 5 --umi5 2. Only samples 5-8 did not contain UMI sequences in the adaptors, therefore this step was not included for U2OS samples.
For removal of human rRNA contaminants, sequenced libraries were aligned to rRNA sequences including: RNA45SN1 - RNA45SN5 (human 28S, 18S, 5.8S), RNA5S1 (human 5S), MTRNR2L10 (Mito 16S), MT-RNR1 (Mito 12S), MT-RNR2U1 (Mito 16S), 22 mito tRNAs, 631 nuclear human tRNAs. The E. coli rRNA file included: 5S rRNA (rrfA - rrfH), 16S rRNA (rrsA, rrsB, rrsC, rrsD, rrsE, rrsG, and rrsH), 23S rRNA (rrlA, rrlB, rrlC, rrlD, rrlE, rrlG, and rrlH). All rRNA contaminations were removed by Bowtie2 (v.2.3.5.1) with the following command: bowtie2 -p4 -t -x <path_to_folder_with_Bowtie-indexed_rRNA_contamination> -q <path_to_input_file>.fastq.gz' --un <path_to_output_file>.fastq -S /dev/null
Reads that did not align to rRNA were aligned to the human genome (GRCh38p10) or E.coli genome (GCA_000022665.2, E. coli BL21(DE3), including additional chromosome files containing custom plasmid sequences, see the corresponding sample description field) using STAR (v2.7.1a) with the following command: STAR --runThreadN 24 --genomeDir <path_to_indexed_genome> --readFilesIn <path_to_input>.fastq --outFilterMultimapNmax 1 --outFilterType BySJout --alignIntronMin 5 --outFileNamePrefix <path_to_output> --outReadsUnmapped Fastx --outSAMtype BAM SortedByCoordinate --outSAMattributes All XS --quantMode GeneCounts --twopassMode Basic
All reads aligning to the coding sequence (CDS) were assigned (a-, p-, e-site) via a custom Julia script (available as supplement information of this study). p-site assigned reads were used for all downstream analysis.This script accounts for soft-clipping via the '-c 1' option: the integer indicates the bases at the 5' end that often arise from untemplated addition by the reverse transcriptase during library preparation by using the following command: julia <path_to_script>/Script2.jl -c 1 -g <path_to_genome_annotation>.gff' -u -o <path_to_output> <path_to_input>.bam
Genome_build: Human genome (GRCh38p10), E.coli genome (GCF_000009565.1, E. coli BL21(DE3))
Supplementary_files_format_and_content: Processed data are stored in HDF5 files. Each HDF5 file contains one data set per gene. Each data set consists of a 2-row matrix, with the first row containing the 1-based position within the CDS, and the second row the number of detected P-site reads at this position. Additional information is stored in the data set attributes, including: gene and protein names, transcript isoform used for position assignment, length of the coding sequence, chromosome and strand location of the gene. In this study the p-side assignments stored in such HDF5 files were further analyzed using RiboSeqTools (available at: https://github.com/ilia-kats/RiboSeqTools).

Submission date

Jun 07, 2020

Last update date

Dec 18, 2020

Contact name

Kai Fenzl

E-mail(s)

[email protected]

Organization name

ZMBH

Lab

Prof. Bernd Bukau

Street address

INF 282

City

Heidelberg

ZIP/Postal code

69115

Country

Germany

Platform ID

GPL21697

Series (1)

GSE151959

Nascent chain interactions of proximal ribosomes drive homo-oligomer assembly

Relations

BioSample

SAMN15160148

SRA

SRX8489848

Supplementary file	Size	Download	File type/resource
GSM4594348_Total_HEK_Rep1.h5	32.3 Mb	(ftp)(http)	H5
SRA Run Selector
Raw data are available in SRA
Processed data provided as supplementary file