NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE215355 Query DataSets for GSE215355
Status Public on Oct 14, 2023
Title Merging short and stranded long reads improves transcript assembly
Organism Homo sapiens
Experiment type Expression profiling by high throughput sequencing
Summary New tools for improved long-read transcript assembly and coalescence with its short-read counterpart are required. Using our short- and long-read measurements from different cell lines with spiked-in standards, we systematically compared key parameters and biases in the read alignment and assembly of transcripts. We report a cDNA synthesis artifact in long-read datasets that impacts the identity and quantitation of assembled transcripts. We developed a computational pipeline to strand long-read cDNA libraries that markedly improves assembly of transcripts from long-reads. Incorporating stranded long-reads in a new hybrid assembly approach, we demonstrate its efficacy for improved characterization of challenging lncRNA transcripts. Our workflow can be applied to a wide range of transcriptomics datasets for superior demarcation of transcript ends and refined isoform structure, which can enable better differential gene expression analyses and molecular manipulations of transcripts.
 
Overall design Nuclei from HAP1 cells were isolated by detergent lysis, fractionated, then three chromatin and three nucleplasm RNA samples were converted to cDNA using NEBNext Ultra II Directional library kit, and sequenced on Illumina HiSeq 4000. In parallel, chromatin fraction RNA from two replicates were converted to cDNA using Oxford Nanopore direct cDNA sequencing kit and sequenced on MinION flowcell.
Web link http:// 10.1371/journal.pcbi.1011576
 
Contributor(s) Kainth AS, Haddad GA, Ruthenburg AJ
Citation(s) 37883581
Submission date Oct 12, 2022
Last update date Nov 10, 2023
Contact name Alexander J Ruthenburg
E-mail(s) [email protected]
Organization name University of Chicago
Department Molecular Genetics and Cell Biology
Street address 920 E. 58th Street
City Chicago
State/province IL
ZIP/Postal code 60637
Country USA
 
Platforms (2)
GPL20301 Illumina HiSeq 4000 (Homo sapiens)
GPL24106 MinION (Homo sapiens)
Samples (8)
GSM6634264 HAP1_short-read_chromatinfraction_rep1
GSM6634265 HAP1_short-read_chromatinfraction_rep2
GSM6634266 HAP1_short-read_chromatinfraction_rep3
Relations
BioProject PRJNA889837

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE215355_HAP1_long-read_StringTie_transcriptabundanceTPM.csv.gz 4.7 Mb (ftp)(http) CSV
GSE215355_HAP1_short-read_DESeq2_normalizedgenecounts.csv.gz 2.0 Mb (ftp)(http) CSV
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap