Skip to content

Releases: kids-first/kf-cbioportal-etl

🚀 Automated ETL Workflow with Pip Installation Support

16 Dec 21:09
b45a2d9
Compare
Choose a tag to compare

This release introduces a fully refractored ETL pipeline that is now installable via pip, streamlining the setup and execution of the workflow.

  • users can easily trigger the ETL process via a single command
  • supports modular approach, allowing users to run specific steps of pipeline as needed
  • improved code modularity

What's Changed

  • ✏️ Minor Config/Template Updates by @migbro in #70
  • converted etl to standalone tool and updated readme.md by @wongjessica93 in #72

New Contributors

Full Changelog: v1.6.0...v2.0.0

📝 OPC V12 Doc Revision

23 Oct 14:55
Compare
Choose a tag to compare
Pre-release

This is a special case in which this particular branch has code and documentation relevant to the OpenPedCan v12 load, broken by later releases. It is a niche use case in the even that this specific study needs further revision or simply referenced for a repeat load.

Full Changelog: 0.8.1...v0.8.2

🗻CBTN Summit + Chordoma Updates

18 Oct 13:08
b53f371
Compare
Choose a tag to compare

Config and software updates made to help simplify ETL

  • Updated configs to reflect CNV changes
  • Updated download script to allow for new cbio_file_name_id.txt format that now has file_id and s3_path so that ...
    genomic file manifests eliminated and folded into cbio_file_name_id.txt files so that the command is much simpler. See line 21 in README documentation to see the difference
  • Updated maf merge to record sym link errors
  • Some QOL formatting updates

What's Changed

  • ⛰️ PBTA Summit and Chordoma Updates by @migbro in #68

Full Changelog: v1.5.0...v1.6.0

🛠️ Modify CNV Filtering amd fix chrodoma configs

26 Sep 15:18
37568b6
Compare
Choose a tag to compare
  • Provisional study loads of CNV data used a min CNV length cutoff of 50kb. It was recently found to be too simplistic a cut off as some WXS samples had oncogenes filtered out because the region was < 50kb. Since we currently use ControlFreeC inputs for provisional study loads, we have switch to requiring both the WilcoxonRankSumTestPvalue and KolmogorovSmirnovPvalue be less than 0.05 to consider geens in that region worthy of loading.
  • Also update the chordoma study data config files to point to the correct resources for CNV gene naming

What's Changed

Full Changelog: v1.4.0...v1.5.0

🛠️ Study Updates and Bug Fixes

17 Sep 15:03
6523cb3
Compare
Choose a tag to compare

What's Changed

  • 📝 update sample template to reflect new column addition by @migbro in #63
  • 🚀 add pbta pnoc BIXU-3723 by @migbro in #64
  • 🛠️ Update chdm, fix OpenPedCan DGD bug by @migbro in #66
  • ✏️ update paths, rerorg README for clarity by @migbro in #65

Full Changelog: v1.3.0...v1.4.0

🤩 Added Treatment Data to `pbta_all`

21 Jun 19:18
5d94e25
Compare
Choose a tag to compare

What's Changed

Full Changelog: v1.2.0...v1.3.0

🚀 OpenPedCan v15 and tll_sd_aq9kvn5p_2019 (Teachey) Added

21 Mar 15:17
c03c5d2
Compare
Choose a tag to compare
  • OpenPedCan v15 Added, with several bug fixes and adjustments to evolving realities of the project
  • Added new KF study and refactored another in terms of config file

What's Changed

Full Changelog: v1.1.0...v1.2.0

✨ Clinical Data Diff Tool

01 Mar 14:44
e3cd256
Compare
Choose a tag to compare

Added a tool to identify and summarize changes slated to be made to an existing study on the portal based on the cBio formatted data_clinical files. It does the following for each of sample and patient views:

  • Create a list, one per line, per ID, per attribute, of what would change if the data were loaded
  • Output list of IDs that would be removed from the portal, if any
  • Output list of IDs that would be added if any
  • Create summary of the number of changes of each attribute type printed to STDOUT
    Also contains study updates

What's Changed

Full Changelog: v1.0.1...v1.1.0

🔧 Fix Pandas, Numpy Calls

09 Nov 18:24
c2b5bfc
Compare
Choose a tag to compare

A recent upgrade in software versions used for pandas and numpy has caused some functions being used to be deprecated. This PR fixes that, and will be followed with an accompanying docker image and software list.

What's Changed

  • 🔧 Fixes for numpy pandas compatibility by @migbro in #53

Full Changelog: v1.0.0...v1.0.1

🧹Cleanup Legacy

19 Oct 17:30
1c9075a
Compare
Choose a tag to compare
  • Minor change to PBTA config to accommodate new file_type entries in manifests
  • Removed many legacy files to cleanup repo
  • Will make this now public as most changes are related to config file updates, less so software

What's Changed

Full Changelog: v0.9.1...v1.0.0