NCBI Logo
GEO Logo
   NCBI > GEO > Accession DisplayHelp Not logged in | LoginHelp
GEO help: Mouse over screen elements for information.
          Go
Series GSE215357 Query DataSets for GSE215357
Status Public on Oct 14, 2023
Title Merging short and stranded long reads improves transcript assembly
Organism Mus musculus
Experiment type Expression profiling by high throughput sequencing
Summary New tools for improved long-read transcript assembly and coalescence with its short-read counterpart are required. Using our short- and long-read measurements from different cell lines with spiked-in standards, we systematically compared key parameters and biases in the read alignment and assembly of transcripts. We report a cDNA synthesis artifact in long-read datasets that impacts the identity and quantitation of assembled transcripts. We developed a computational pipeline to strand long-read cDNA libraries that markedly improves assembly of transcripts from long-reads. Incorporating stranded long-reads in a new hybrid assembly approach, we demonstrate its efficacy for improved characterization of challenging lncRNA transcripts. Our workflow can be applied to a wide range of transcriptomics datasets for superior demarcation of transcript ends and refined isoform structure, which can enable better differential gene expression analyses and molecular manipulations of transcripts.
 
Overall design Nuclei from HL1 cells were isolated by detergent lysis, fractionated, then three chromatin and three nucleplasm RNA samples were converted to cDNA using NEBNext Ultra II Directional library kit, and sequenced on Illumina HiSeq 4000. In parallel, chromatin fraction RNA from two replicates were converted to cDNA using Oxford Nanopore direct cDNA sequencing kit and sequenced on MinION flowcell.
Web link http://10.1371/journal.pcbi.1011576
 
Contributor(s) Kainth AS, Haddad GA, Hall JM, Ruthenburg AJ
Citation(s) 37883581
Submission date Oct 12, 2022
Last update date Nov 10, 2023
Contact name Alexander J Ruthenburg
E-mail(s) [email protected]
Organization name University of Chicago
Department Molecular Genetics and Cell Biology
Street address 920 E. 58th Street
City Chicago
State/province IL
ZIP/Postal code 60637
Country USA
 
Platforms (2)
GPL21103 Illumina HiSeq 4000 (Mus musculus)
GPL24973 MinION (Mus musculus)
Samples (8)
GSM6634277 HL1_short-read_chromatinfraction_rep1
GSM6634278 HL1_short-read_chromatinfraction_rep2
GSM6634279 HL1_short-read_chromatinfraction_rep3
Relations
BioProject PRJNA889840

Download family Format
SOFT formatted family file(s) SOFTHelp
MINiML formatted family file(s) MINiMLHelp
Series Matrix File(s) TXTHelp

Supplementary file Size Download File type/resource
GSE215357_HL1_long-read_StringTie_transcriptabundanceTPM.csv.gz 2.9 Mb (ftp)(http) CSV
GSE215357_HL1_short-read_DESeq2_normalizedgenecounts.csv.gz 1.8 Mb (ftp)(http) CSV
SRA Run SelectorHelp
Raw data are available in SRA
Processed data are available on Series record

| NLM | NIH | GEO Help | Disclaimer | Accessibility |
NCBI Home NCBI Search NCBI SiteMap