NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Sample GSM1342901 Query DataSets for GSM1342901
Status Public on Jul 01, 2014
Title 1304203 NTL [expression]
Sample type SRA
 
Source name liver, NTL
Organism Homo sapiens
Characteristics disease status: hepatocellular carcinoma (HCC)
tissue: non-tumor liver
gender: M
age (y): 62
patient: 1304203
Extracted molecule total RNA
Extraction protocol Total RNA was extracted using the Invitrogen TRIzol® Reagent and then treated with RNase-free DNase I (Ambion) for 30 min. The integrity of total RNA was checked using an Agilent 2100 Bioanalyser.
cDNA libraries were prepared according to the manufacturer's instructions (Illumina). The poly(A)-containing mRNA molecules were purified using Oligo(dT) Beads (Illumina) from 20 ug of total RNA from each sample. Tris–HCl (10 mM) was used to elute the mRNA from the magnetic beads. To avoid priming bias when synthesizing cDNA, mRNA was fragmented before the cDNA synthesis. Fragmentation was performed using divalent cations at an elevated temperature. The cleaved mRNA fragments were converted into double-stranded cDNA using SuperScript II, RNaseH and DNA Pol I, primed by random primers. The resulting cDNA was purified using the QIAquick PCR Purification Kit (Qiagen). Then, cDNA was subjected to end-repair and phosphorylation using T4 DNA polymerase, Klenow DNA polymerase and T4 PNK. Subsequent purifications were performed using the QIAquick PCR Purification Kit (Qiagen). These repaired cDNA fragments were 3'-adenylated using Klenow Exo- (Illumina) and purified using the MinElute PCR Purification Kit (Qiagen), producing cDNA fragments with a single 'A' base overhang at the 3'-end for subsequent ligation to the adapters. Illumina PE adapters were ligated to the ends of these 30-adenylated cDNA fragments and then purified using the MinElute PCR Purification Kit (Qiagen). To select a size range of templates for downstream enrichment, the products of the ligation reaction were purified on a 2% TAE-agarose gel (Certified Low-Range Ultra Agarose, Biorad). cDNA fragments (200±20bp) were excised from the gel and extracted using the QIAquick Gel Extraction Kit (Qiagen). Fifteen rounds of PCR amplification were performed to enrich the adapter-modified cDNA library using primers complementary to the ends of the adapters [PCR Primer PE 1.0 and PCR Primer PE 2.0 (Illumina)]. The 200±20 bp PCR products were purified using QIAquick Gel Extraction Kit (Qiagen), using the MinElute spin columns (Qiagen). Finally, after detection on an Agilent Technologies 2100 Bioanalyser using the Agilent DNA 1000 chip kit and quantification on a StepOne plus qPCR (ABI), the cDNA library products were sequenced by a pair-end 90bp strategy using the Illumina Genome Analyser.
 
Library strategy RNA-Seq
Library source transcriptomic
Library selection cDNA
Instrument model Illumina Genome Analyzer
 
Description s14_P
Processed data file: s13_C-VS-s14_P.GeneExp.txt.gz
Data processing Firstly, data cleaning (or data filtering) is performed to obtain clean reads (or "clean data") for further analysis. The procedure includes the following steps: 1, Remove reads with adaptor sequences; 2, Remove reads in which the percentage of unknown bases (N) is greater than 10%; and 3, Remove low-quality reads. If the percentage of the low-quality base (base with quality value ≤ 5) is greater than 50% in a read, we define this read as low quality. Then, clean reads were mapped to genomic sequences or reference gene set using SOAPaligner/SOAP2 with no more than 2 mismatches.
The gene expression level is calculated by using the RPKM (Reads Per kb per Million reads) method (Mortazavi, A., B. A. Williams, et al.), and the formula is shown as follows: RPKM= (10^6*C)/(NL⁄10^3 ). Here C is number of reads that uniquely aligned to gene A, N is total number of reads that uniquely aligned to all genes, and L is number of bases of gene A. The RPKM method is able to eliminate the influence of different gene length and sequencing discrepancy on the calculation of gene expression level. Therefore, the RPKM values can be directly used for comparing the difference of gene expression among samples. If there is more than one transcript for a gene, the longest one is used to calculate its expression level and coverage. To identify differentially expressed genes between two samples, a more strict algorithm to be used to calculated the P-value corresponds to differential gene expression test (Audic, S and J. M. Claverie). The multiple tests P-values were then adjusted by FDR (False Discovery Rate). Lastly, we use "FDR ≤ 0.001 and the absolute value of fold-change value ≥ 1" as the threshold to judge the significance of gene expression difference.
Integration of promoter methylomic and transcriptomic profiling. To further clarify the genes that are potentially regulated by promoter methylation, we further performed pair-wise comparisons on promoter methylomes to identify specific DMRs between HCCs and peripheral normal tissues. Furthermore, we evaluated differential gene expression between 8 pairs of HCC and PN samples using Illumina high-throughput RNA-seq technology.
Genome_build: GRCh37 (UCSC hg19)
Supplementary_files_format_and_content: *.GeneExp.txt.gz files include all genes expressing in tumor and NTL sample pair. These files contain 10 columns. The format is: (1) geneID; (2) geneLength; (3) ctrl-Expression; (4) case-Expression; (5) ctrl-RPKM; (6) case-RPKM; (7) log2 Ratio (case/ctrl); (8) Up-Down-Regulation (case/ctrl); (9) P-value; and (10) FDR.
 
Submission date Mar 10, 2014
Last update date May 15, 2019
Contact name Desheng Gong
E-mail(s) [email protected]
Organization name Agricultural Genomes Institute at Shenzhen
Street address No.7 PengFei road
City Shenzhen
ZIP/Postal code 518120
Country China
 
Platform ID GPL9052
Series (2)
GSE55758 Promoter methylome and integrative transcriptome profiling identify key epigenetics-regulated genes in human HCCs [expression]
GSE55759 Promoter methylome and integrative transcriptome profiling identify key epigenetics-regulated genes in human HCCs
Relations
BioSample SAMN02680122
SRA SRX483910

Supplementary data files not provided
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap