Releases: akanz1/klib
Releases · akanz1/klib
v0.2.3
What's Changed
- using poetry for environment and build/publish
- added some tests
- restructured package
Full Changelog: v0.2.2...v0.2.3
v0.2.2
v0.2.1
v0.2.0
v0.1.5
Changelog:
Changes
Update dist_plot()
- Update the implementation of dist_plot() to be compatible with the latest version of seaborn (0.11.1). The old implementation is deprecated and will be removed in future versions.
- Introduce sampling for large datasets (10k rows) what significantly speeds up plotting. Summary statistics continue to be based on the entire dataset, however, the figures use 10000 randomly sampled points.
- Minor cosmetic changes
Several fixes and code quality improvements
v0.1.2
v0.1.1
klib v0.1.0
v0.0.91
Changelog:
Additions
- clean_column_names():
Cleans the column names of the provided Pandas Dataframe and optionally provides hints on duplicate and long column names. This functionality is also added to data_cleaning() by default.
Changes
-
small fixes and refinements
Revert from split = {None, 'pos', 'neg', 'above', 'below'} to split = {None, 'pos', 'neg', 'high', 'low'} for all correlation functions. -
increase test coverage
-
update docstrings:
Several updates to docstrings to improve clarity and conform with numpy style. -
black formatting:
Format the entire codebase with black.
v0.0.86
Changelog:
Changes
-
data_cleaning():
- Changed the default setting to do a shallow instead of a deep analysis of memory_usage.
- Lowers function runtime compared to the previous version by about 70% - 80%!
-
missingval_plot():
- Minor changes to font size and spacing to accommodate very large datasets (40+ cols)
-
update docstrings:
- Several updates to the readme, to the examples as well as to docstrings to improve clarity and formatting.