Skip to content

a collection of R-based tools for the exploration and analysis of nanopore sequence data

License

Notifications You must be signed in to change notification settings

sagrudd/floundeR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

floundeR

An R package to simplify the tidy analysis of Nanopore sequence data.

Build Status Releases

The goal of floundeR is to provide a robust and Tidyverse compliant toolbox that can be used to explore Nanopore sequence data. The analytical R code contained within has been forked from earlier projects such as nanopoRe. The R code has been deduplicated and reimplemented as R6 objects.

The project goals diverge from earlier versions of nanopoRe and the ambition is now to enable abbreviated and simplified data exploration.

The documentation for the floundeR package can be found at https://sagrudd.github.io/floundeR/

Installation

There are a couple of dependencies for the successful usage of the floundeR package.

You can install the released version of floundeR from github with:

install.packages("devtools")
devtools::install_github("sagrudd/floundeR")

# devtools does not appear to pull in some of the bioconductor stuff ...
install.packages("BiocManager")
BiocMananger::install("rhdf5")
# please see documents below - this requires the vbz plugin

floundeR and a BasicQC analysis to assess a flowcell run

BasicQC was the name of the original Nanopore tutorial that introduced an R workflow to question a flowcell's performance on the basis of the sequencing_summary file produced by MinKNOW or Guppy (Albacore in the past). With the transition to (Python based) EPI2ME Labs and the deprecation of the original tutorial this workflow is being maintained to support the requirements of R-aficionados.

Instead of using the packaged and toy dataset, let's use a slightly more robust dataset to show what the tool can really do.

aws.s3::save_object(
   "/gm24385_2020.11/flowcells/20201026_1645_6B_PAG07165_d42912aa/sequencing_summary_PAG07165_2dfda515.txt", 
   bucket = "s3://ont-open-data/", 
   region="eu-west-1")
library(floundeR)
sequencing_summary <- "sequencing_summary_PAG07165_2dfda515.txt"
seqsum <- SequencingSummary$new(sequencing_summary)
seqsum$flowcell$density_data$plot

About

a collection of R-based tools for the exploration and analysis of nanopore sequence data

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages