26 Dec 15:47

akanz1

e76c19c

v0.2.3

What's Changed

using poetry for environment and build/publish
added some tests
restructured package

Full Changelog: v0.2.2...v0.2.3

Assets 2

17 Dec 15:42

akanz1

v0.2.2

12ed660

v0.2.2

Full Changelog: v0.2.1...v0.2.2

Fixes a bug in missingvalue plot (y-axis not displaying very small ratios of missing values). Thanks for pointing this out @Abermal

Contributors

Abermal

Assets 2

28 Nov 11:35

akanz1

v0.2.1

42165c1

v0.2.1

What's Changed

Update dependencies for python 3.10 by @akanz1 in #11

Full Changelog: v0.2.0...v0.2.1

Contributors

akanz1

Assets 2

23 Aug 16:32

akanz1

v0.2.0

75b2a60

v0.2.0

Changelog:

This release comes with several small fixes and improvements to the code quality.

Assets 2

17 Jan 15:10

akanz1

v0.1.5

9eebe50

v0.1.5

Changelog:

Changes

Update dist_plot()

Update the implementation of dist_plot() to be compatible with the latest version of seaborn (0.11.1). The old implementation is deprecated and will be removed in future versions.
Introduce sampling for large datasets (10k rows) what significantly speeds up plotting. Summary statistics continue to be based on the entire dataset, however, the figures use 10000 randomly sampled points.
Minor cosmetic changes

Several fixes and code quality improvements

Assets 2

05 Nov 09:14

akanz1

v0.1.2

ba1c299

v0.1.2

Adjustments & Fixes:

clean_column_names: adding additional cases to column name cleaning
data_cleaning: update the printout format, especially for large datasets with many duplicate rows
update and improve docstrings, code formatting and clarity

Assets 2

07 Aug 14:43

akanz1

v0.1.1

f0b002a

v0.1.1

Adjustments & Fixes:

dist_plot: avoid running into an error when the dataframe includes a binary columns
dist_plot: update the colors and slightly improve runtime
cat_plot: fixed hard coded colors in the heatmap of cat_plot

Assets 2

06 Aug 05:16

akanz1

v0.1.0

b0c8aba

klib v0.1.0

Assets 2

01 Aug 18:05

akanz1

v0.0.91

bf81aaf

v0.0.91

Changelog:

Additions

clean_column_names():
Cleans the column names of the provided Pandas Dataframe and optionally provides hints on duplicate and long column names. This functionality is also added to data_cleaning() by default.

Changes

small fixes and refinements
Revert from split = {None, 'pos', 'neg', 'above', 'below'} to split = {None, 'pos', 'neg', 'high', 'low'} for all correlation functions.
increase test coverage
update docstrings:
Several updates to docstrings to improve clarity and conform with numpy style.
black formatting:
Format the entire codebase with black.

Assets 2

20 Jun 18:00

akanz1

v0.0.86

3c6b474

v0.0.86

Changelog:

Changes

data_cleaning():
- Changed the default setting to do a shallow instead of a deep analysis of memory_usage.
- Lowers function runtime compared to the previous version by about 70% - 80%!
missingval_plot():
- Minor changes to font size and spacing to accommodate very large datasets (40+ cols)
update docstrings:
- Several updates to the readme, to the examples as well as to docstrings to improve clarity and formatting.

Assets 2

Uh oh!

Releases: akanz1/klib

v0.2.3

What's Changed

Uh oh!

v0.2.2

Contributors

Uh oh!

v0.2.1

What's Changed

Contributors

Uh oh!

v0.2.0

Changelog:

Uh oh!

v0.1.5

Changelog:

Changes

Uh oh!

v0.1.2

Uh oh!

v0.1.1

Uh oh!

klib v0.1.0

Uh oh!

v0.0.91

Changelog:

Additions

Changes

Uh oh!

v0.0.86

Changelog:

Changes

Uh oh!