|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Apr 23, 2017 |
Title |
Dmel_RNA_R1 |
Sample type |
SRA |
|
|
Source name |
head
|
Organism |
Drosophila melanogaster |
Characteristics |
strain: y; cn, bw, sp tissue: head developmental stage: adult
|
Extracted molecule |
total RNA |
Extraction protocol |
Total RNA was obtained from each Drosophila yakuba strain. Approximately 50 fly heads were used to obtain total RNA (5 µg) using TRIzol reagent (Life) as per manufacturer’s instruction. RNA integrity was evaluated on the BioAnalyzer (Aligent). Oligo(dT) selection was performed by using Dynal magnetic beads (Invitrogen) according to the manufacturer's protocol. The mRNA was fragmented by addition of 5x fragmentation buffer (200 mM Tris acetate, pH 8.2, 500 mM potassium acetate and 150 mM magnesium acetate), heated at 94 °C for 90 seconds, and then transferred to ice and run over a Sephadex-G50 column (USA Scientific) to remove the fragmentation ions.The fragmented mRNA was incubated in 38 µl CE solution (50% (v/v) ethanol, 1.1 M triethylammoniumacetate (pH 8.6)) with 1.6 M acrylonitrile at 70 °C for 1 hr (ICE treatment). A parallel mock treatment was preformed without 1.6 M acrylonitrile. The RNA was purified with ethanol precipitation and reverse transcribed with random hexamers, added to prime first-strand reverse transcription according to the manufacturer's protocol (Invitrogen cDNA synthesis kit). After the first strand was synthesized, a custom second-strand synthesis buffer (Illumina) was added, and dNTPs, RNase H and Escherichia coli polymerase I were added to nick translate the second-strand synthesis for 2.5 h at 16 °C. The reaction was then cleaned up on a QiaQuick PCR column (Qiagen) and eluted in 30 µl EB buffer (Qiagen).HiSeq reads were sequenced as single-ended 50-mers and NextSeq reads were sequenced as 86-mers. The reads were mapped onto the Drosophila melanogaster genome (BDGP R5/dm3) or Drosophila yakuba genome (WUGSC 7.1/droYak2) using TopHat version 2.0.6 (Kim 2013) / Bowtie version 2.0.2 (Langmead 2012) using the following options: --read-realign-edit-dist 0 --read-edit-dist 4 --read-mismatches 4 -G <gene annotation file> -x 1 -g 1. Unmapped reads were allowed to remap using bsmap version 2.74 (Xi 2009) using the following options: -v 3 -M GA. The RNA was purified with ethanol precipitation and reverse transcribed with random hexamers, added to prime first-strand reverse transcription according to the manufacturer's protocol (Invitrogen cDNA synthesis kit). After the first strand was synthesized, a custom second-strand synthesis buffer (Illumina) was added, and dNTPs, RNase H and Escherichia coli polymerase I were added to nick translate the second-strand synthesis for 2.5 h at 16 °C. The reaction was then cleaned up on a QiaQuick PCR column (Qiagen) and eluted in 30 µl EB buffer (Qiagen)
|
|
|
Library strategy |
RNA-Seq |
Library source |
transcriptomic |
Library selection |
cDNA |
Instrument model |
Illumina NextSeq 500 |
|
|
Description |
mock treatment
|
Data processing |
An initial set of sites were obtained using samtools (Li 2009) version 0.1.19 requiring variants to be seen in at least two RNA-seq replicates and a minimum editing rate of 5%. We also required the following for at least one replicate: a p-value less than 0.01 for the variant frequency between the ICE and RNA samples, less coverage in the ICE-seq than the RNA-seq, and a lower variant frequency in the ICE-seq than the RNA-seq. The Storer-Kim method was used to determine the p-value because it is considered one of the better statistical tests, in terms of power and Type I error, particularly for small and/or unequal sample sizes. We limited ourselves to A-to-G, T-to-C, G-to-A, and C-to-T variants because of the error bias in the NextSeq. The A-to-G and T-to-C variants represented our potential set of RNA editing sites (T-to-C variants were included because we used an unstranded library protocol). The G-to-A and C-to-T variants were used as a negative control because these types of variants had a comparable error profile and drosophila do not have C-to-U RNA editing. We computed the IDR for each site using the change in the number of G’s from RNA-seq to ICE-seq as the signal value. The sites were sorted by IDR and a 0.85 fraction of A-to-G/T-to-C sites were used to determine a cutoff threshold. Of the remaining sites, we removed sites outside gene boundaries, sites that were not A-to-G in the direction of transcription, and sites in repetitive regions. We sequenced two biological replicates of RNA-seq libraries derived from Drosophila melanogaster heads. The libraries were sequenced on a HiSeq 2500 and NextSeq 500. The resulting reads were normalized for read number and read length and mapped onto the dm3 assembly using tophat2 (Kim 2013) version 2.0.6. Variants were called using samtools mpileup with any at least 1 read supporting a variant. We reasoned that true biological variants and mapping errors were shared between the two sequencers while sequencing errors should be in the outersects. Variants that were sequencer specific were determined to be sequencing errors. Gene ontology analysis was done using WebGestalt (Wang 2013). A significance level of 0.01 adjusted with the Benjamini-Hochberg procedure was used. A minimum of 5 genes per category was required. Genome_build: dm3/droYak2 Supplementary_files_format_and_content: bed files containing RNA editing sites or other variants
|
|
|
Submission date |
Aug 27, 2014 |
Last update date |
May 15, 2019 |
Contact name |
Eddie Park |
E-mail(s) |
[email protected]
|
Phone |
9498248393
|
Organization name |
UCI
|
Department |
DevCell
|
Lab |
Mortazavi
|
Street address |
2300 Biological Sciences 3
|
City |
Irvine |
State/province |
CA |
ZIP/Postal code |
92697 |
Country |
USA |
|
|
Platform ID |
GPL19132 |
Series (1) |
GSE60851 |
Analysis of canonical A-to-I RNA editing in Drosophila using ICE-seq |
|
Relations |
Reanalyzed by |
GSM3286947 |
BioSample |
SAMN03010384 |
SRA |
SRX688902 |
Supplementary data files not provided |
SRA Run Selector |
Raw data are available in SRA |
Processed data are available on Series record |
|
|
|
|
|