SIGNIFICANT USER-VISIBLE CHANGES

BUG FIXES

BUG FIXES

SIGNIFICANT USER-VISIBLE CHANGES

  • Documentation website is now available at http://leekgroup.github.io/recount/. It gets updated with every commit on the master branch (bioc-devel) using GitHub Actions and pkgdown.
  • Added a NEWS.md file to track changes to the package.

NEW FEATURES

BUG FIXES

SIGNIFICANT USER-VISIBLE CHANGES

  • Renamed .load_install() as .load_check() as this function now only checks that the package(s) was installed and returns an error if missing. The error shows the user how to install the package(s) they are missing instead of installing them automatically. This complies with Marcel Ramos’ request at https://github.com/leekgroup/recount/issues/14.

NEW FEATURES

SIGNIFICANT USER-VISIBLE CHANGES

  • Cleaned up the documentation of add_metadata() and changed the default source from recount_brain_v1 to recount_brain_v2.

NEW FEATURES

  • citation('recount')[5] now lists the recount-brain bioRxiv pre-print citation information.

NEW FEATURES

BUG FIXES

  • I made the example code for geo_characteristics() more robust since currently rentrez can occasionally fails.

NEW FEATURES

BUG FIXES

BUG FIXES

SIGNIFICANT USER-VISIBLE CHANGES

BUG FIXES

  • Fix a NOTE about RefManageR.

SIGNIFICANT USER-VISIBLE CHANGES

  • Use BiocManager

BUG FIXES

  • Fix a unit test.

SIGNIFICANT USER-VISIBLE CHANGES

  • rse_tx URLs now point to v2 to reflect recent changes by Fu et al.

BUG FIXES

  • Change some examples to dontrun and improve the code that cleans up after the tests. This should reduce the size of files left in tmp although they didn’t seem too big to begin with.

SIGNIFICANT USER-VISIBLE CHANGES

NEW FEATURES

BUG FIXES

BUG FIXES

  • Fix a unit test for download_study(), add another test for the versions, and fix a NOTE in R CMD check.

NEW FEATURES

  • download_study() can now download the transcript counts (rse_tx.RData) files. The transcript estimation is described in Fu et al, 2018.

SIGNIFICANT USER-VISIBLE CHANGES

  • download_study() now has a version parameter (defaults to 2). This argument controls which version of the files to download based on the change on how exons were defined. Version 1 are reduced exons while version 2 are disjoint exons as described in further detail in the documentation tab of the recount website https://jhubiostatistics.shinyapps.io/recount/.
  • recount_url and the example rse_gene_SRP009615 have been updated to match the changes in version 2.

BUG FIXES

  • Changed reproduce_ranges() since disjoint exons are more useful than reduced exons for downstream analyses.

NEW FEATURES

SIGNIFICANT USER-VISIBLE CHANGES

SIGNIFICANT USER-VISIBLE CHANGES

SIGNIFICANT USER-VISIBLE CHANGES

NEW FEATURES

  • coverage_matrix() now has two new arguments: scale and round. Use scale = FALSE to get raw coverage counts, which you can then scale with scale_counts(). scale is set to TRUE by default, so the counts are scaled to a library size of 40 million reads. round is set to FALSE by default, but can be set to TRUE if you want to get integer counts, just as in the default of scale_counts().

SIGNIFICANT USER-VISIBLE CHANGES

  • Changed the default version argument of add_predictions() to latest. Internally, that’s still 0.0.03.

NEW FEATURES

  • Added the add_predictions() function which appends the predicted phenotypes to a RSE object downloaded with recount. The phenotypes were predicted by Shannon Ellis et al, 2017 (citation coming up soon!).

SIGNIFICANT USER-VISIBLE CHANGES

NEW FEATURES

  • Added the function getRPKM() which can be used with RangedSummarizedExperiment objects from recount and from other sources.

SIGNIFICANT USER-VISIBLE CHANGES

  • recount_url now includes the URLs for the GTEx bigWig files.

SIGNIFICANT USER-VISIBLE CHANGES

  • coverage_matrix() now returns a RangedSummarizedExperiment object. This matches the behavior of recount.bwtool::coverage_matrix_bwtool() and is more consistent with the use of RSE objects in recount.

BUG FIXES

BUG FIXES

  • Fixed a bug in the counts in coverage_matrix(). They were being incorrectly multiplied by 100.

SIGNIFICANT USER-VISIBLE CHANGES

  • Completed the change to Gencode v25 annotation for exon and gene counts.

SIGNIFICANT USER-VISIBLE CHANGES

  • We dropped TxDb.Hsapiens.UCSC.hg38.knownGene completely from recount and will be using Gencode v25 instead.

BUG FIXES

SIGNIFICANT USER-VISIBLE CHANGES

  • Updated the package so you can now access TCGA data. Now there’s over 8 terabytes of data available in the recount project!

SIGNIFICANT USER-VISIBLE CHANGES

SIGNIFICANT USER-VISIBLE CHANGES

  • Snaptron changed from stingray.cs.jhu.edu:8090 to snaptron.cs.jhu.edu so snaptron_query() has been changed accordingly.

SIGNIFICANT USER-VISIBLE CHANGES

  • The function reproduce_ranges() now has the db argument. By default it’s set to TxDb.Hsapiens.UCSC.hg38.knownGene to reproduce the actual information used in recount. But it can also be used with EnsDb.Hsapiens.v79 to use the ENSEMBL annotation. Then with coverage_matrix() you can get the counts for either an updated TxDb.Hsapiens.UCSC.hg38.knownGene or for EnsDb.Hsapiens.v79 at the exon and/or gene levels as shown in the vignette.

SIGNIFICANT USER-VISIBLE CHANGES

  • The vignette now describes how to download all the data, how to check exon-exon junctions by class, and how to use SciServer compute to access all the recount data (over 6 TB) via http://www.sciserver.org/

NEW FEATURES

  • Added the function snaptron_query() which queries Intropolis via Snaptron to find if an exon-exon junction is present in the data.

BUF FIXES

  • Fixed an bug in the vignette. Thanks to Michael Love for noticing it!

NEW FEATURES

  • Created the package skeleton for recount
  • Added the function reproduce_ranges() for re-creating the gene or exon level information used in the recount project.
  • Added the function abstract_search() for identifying SRA projects of interest by searching the abstracts.
  • Added the function browse_study() for opening a browser tab for further exploring a project.
  • Added the function download_study() for downloading the data from the recount project.
  • Added the function scale_counts() for properly scaling the counts before performing a differential expression analysis with the RangedSummarizedExperiment objects hosted in the recount project.
  • Added the function expressed_regions() for defining the expressed regions in a chromosome for a given SRA study.
  • Added the function coverage_matrix() for computing the coverage matrix based on the regions of interest for a given SRA study.
  • Added the function geo_info() for obtaining sample information from GEO.
  • Added the function find_geo() for finding the GEO accession id given a SRA run accession (id). This function will be useful for SRA projects that did not have GEO entries at the time recount’s data was created.
  • Added the function geo_characteristics() for building a data.frame() from geo_info()’s results for the characteristics.
  • Added the function all_metadata() which downloads all the phenotype data for all projects. This function can be useful for identifying projects and/or samples of interests.