|
|
GEO help: Mouse over screen elements for information. |
|
Status |
Public on Jul 01, 2014 |
Title |
1304203 NTL [expression] |
Sample type |
SRA |
|
|
Source name |
liver, NTL
|
Organism |
Homo sapiens |
Characteristics |
disease status: hepatocellular carcinoma (HCC) tissue: non-tumor liver gender: M age (y): 62 patient: 1304203
|
Extracted molecule |
total RNA |
Extraction protocol |
Total RNA was extracted using the Invitrogen TRIzol® Reagent and then treated with RNase-free DNase I (Ambion) for 30 min. The integrity of total RNA was checked using an Agilent 2100 Bioanalyser. cDNA libraries were prepared according to the manufacturer's instructions (Illumina). The poly(A)-containing mRNA molecules were purified using Oligo(dT) Beads (Illumina) from 20 ug of total RNA from each sample. Tris–HCl (10 mM) was used to elute the mRNA from the magnetic beads. To avoid priming bias when synthesizing cDNA, mRNA was fragmented before the cDNA synthesis. Fragmentation was performed using divalent cations at an elevated temperature. The cleaved mRNA fragments were converted into double-stranded cDNA using SuperScript II, RNaseH and DNA Pol I, primed by random primers. The resulting cDNA was purified using the QIAquick PCR Purification Kit (Qiagen). Then, cDNA was subjected to end-repair and phosphorylation using T4 DNA polymerase, Klenow DNA polymerase and T4 PNK. Subsequent purifications were performed using the QIAquick PCR Purification Kit (Qiagen). These repaired cDNA fragments were 3'-adenylated using Klenow Exo- (Illumina) and purified using the MinElute PCR Purification Kit (Qiagen), producing cDNA fragments with a single 'A' base overhang at the 3'-end for subsequent ligation to the adapters. Illumina PE adapters were ligated to the ends of these 30-adenylated cDNA fragments and then purified using the MinElute PCR Purification Kit (Qiagen). To select a size range of templates for downstream enrichment, the products of the ligation reaction were purified on a 2% TAE-agarose gel (Certified Low-Range Ultra Agarose, Biorad). cDNA fragments (200±20bp) were excised from the gel and extracted using the QIAquick Gel Extraction Kit (Qiagen). Fifteen rounds of PCR amplification were performed to enrich the adapter-modified cDNA library using primers complementary to the ends of the adapters [PCR Primer PE 1.0 and PCR Primer PE 2.0 (Illumina)]. The 200±20 bp PCR products were purified using QIAquick Gel Extraction Kit (Qiagen), using the MinElute spin columns (Qiagen). Finally, after detection on an Agilent Technologies 2100 Bioanalyser using the Agilent DNA 1000 chip kit and quantification on a StepOne plus qPCR (ABI), the cDNA library products were sequenced by a pair-end 90bp strategy using the Illumina Genome Analyser.
|
|
|
Library strategy |
RNA-Seq |
Library source |
transcriptomic |
Library selection |
cDNA |
Instrument model |
Illumina Genome Analyzer |
|
|
Description |
s14_P Processed data file: s13_C-VS-s14_P.GeneExp.txt.gz
|
Data processing |
Firstly, data cleaning (or data filtering) is performed to obtain clean reads (or "clean data") for further analysis. The procedure includes the following steps: 1, Remove reads with adaptor sequences; 2, Remove reads in which the percentage of unknown bases (N) is greater than 10%; and 3, Remove low-quality reads. If the percentage of the low-quality base (base with quality value ≤ 5) is greater than 50% in a read, we define this read as low quality. Then, clean reads were mapped to genomic sequences or reference gene set using SOAPaligner/SOAP2 with no more than 2 mismatches. The gene expression level is calculated by using the RPKM (Reads Per kb per Million reads) method (Mortazavi, A., B. A. Williams, et al.), and the formula is shown as follows: RPKM= (10^6*C)/(NL⁄10^3 ). Here C is number of reads that uniquely aligned to gene A, N is total number of reads that uniquely aligned to all genes, and L is number of bases of gene A. The RPKM method is able to eliminate the influence of different gene length and sequencing discrepancy on the calculation of gene expression level. Therefore, the RPKM values can be directly used for comparing the difference of gene expression among samples. If there is more than one transcript for a gene, the longest one is used to calculate its expression level and coverage. To identify differentially expressed genes between two samples, a more strict algorithm to be used to calculated the P-value corresponds to differential gene expression test (Audic, S and J. M. Claverie). The multiple tests P-values were then adjusted by FDR (False Discovery Rate). Lastly, we use "FDR ≤ 0.001 and the absolute value of fold-change value ≥ 1" as the threshold to judge the significance of gene expression difference. Integration of promoter methylomic and transcriptomic profiling. To further clarify the genes that are potentially regulated by promoter methylation, we further performed pair-wise comparisons on promoter methylomes to identify specific DMRs between HCCs and peripheral normal tissues. Furthermore, we evaluated differential gene expression between 8 pairs of HCC and PN samples using Illumina high-throughput RNA-seq technology. Genome_build: GRCh37 (UCSC hg19) Supplementary_files_format_and_content: *.GeneExp.txt.gz files include all genes expressing in tumor and NTL sample pair. These files contain 10 columns. The format is: (1) geneID; (2) geneLength; (3) ctrl-Expression; (4) case-Expression; (5) ctrl-RPKM; (6) case-RPKM; (7) log2 Ratio (case/ctrl); (8) Up-Down-Regulation (case/ctrl); (9) P-value; and (10) FDR.
|
|
|
Submission date |
Mar 10, 2014 |
Last update date |
May 15, 2019 |
Contact name |
Desheng Gong |
E-mail(s) |
[email protected]
|
Organization name |
Agricultural Genomes Institute at Shenzhen
|
Street address |
No.7 PengFei road
|
City |
Shenzhen |
ZIP/Postal code |
518120 |
Country |
China |
|
|
Platform ID |
GPL9052 |
Series (2) |
GSE55758 |
Promoter methylome and integrative transcriptome profiling identify key epigenetics-regulated genes in human HCCs [expression] |
GSE55759 |
Promoter methylome and integrative transcriptome profiling identify key epigenetics-regulated genes in human HCCs |
|
Relations |
BioSample |
SAMN02680122 |
SRA |
SRX483910 |
Supplementary data files not provided |
SRA Run Selector |
Raw data are available in SRA |
Processed data are available on Series record |
|
|
|
|
|