feat: add to_dict method for binning table serialization #371

sudo-hannes · 2025-08-28T06:37:06Z

Add to_dict method for binning table serialization

Summary

Implements a to_dict() method that converts optimal bins, split points, and transformations to dictionary format for easy serialisation and export.

…nning class

sudo-hannes · 2025-08-29T09:43:42Z

@guillermo-navas-palencia

I saw that the test failed because "http://lib.stat.cmu.edu/datasets/boston" is no longer available. I found a similar dataset on Kaggle. When I use this dataset, all but one of the tests pass.

Can I open a separate PR to fix the test issue?

optbinning/tests/datasets/datasets.py

Lines 1 to 20 in 0720b8d

    
           import numpy as np 
        
           import pandas as pd 
        
           class Data: 
        
               def __init__(self, data, target, feature_names): 
        
                   self.data = data 
        
                   self.target = target 
        
                   self.feature_names = feature_names 
        
           def load_boston(): 
        
               data_url = "http://lib.stat.cmu.edu/datasets/boston" 
        
               raw_df = pd.read_csv(data_url, sep=r"\s+", skiprows=22, header=None) 
        
               raw_data = np.hstack([raw_df.values[::2, :], raw_df.values[1::2, :2]]) 
        
               target = raw_df.values[1::2, 2] 
        
               feature_names = ['CRIM', 'ZN', 'INDUS', 'CHAS', 'NOX', 'RM', 'AGE', 'DIS', 
        
                                'RAD', 'TAX', 'PTRATIO', 'B', 'LSTAT'] 
        
               return Data(raw_data, target, feature_names)

guillermo-navas-palencia · 2025-08-29T10:16:50Z

@guillermo-navas-palencia

I saw that the test failed because "http://lib.stat.cmu.edu/datasets/boston" is no longer available. I found a similar dataset on Kaggle. When I use this dataset, all but one of the tests pass.

Can I open a separate PR to fix the test issue?

optbinning/tests/datasets/datasets.py

Lines 1 to 20 in 0720b8d

import numpy as np

import pandas as pd

class Data:

def __init__(self, data, target, feature_names):

self.data = data

self.target = target

self.feature_names = feature_names

def load_boston():

data_url = "http://lib.stat.cmu.edu/datasets/boston"

raw_df = pd.read_csv(data_url, sep=r"\s+", skiprows=22, header=None)

raw_data = np.hstack([raw_df.values[::2, :], raw_df.values[1::2, :2]])

target = raw_df.values[1::2, 2]

feature_names = ['CRIM', 'ZN', 'INDUS', 'CHAS', 'NOX', 'RM', 'AGE', 'DIS',

'RAD', 'TAX', 'PTRATIO', 'B', 'LSTAT']

return Data(raw_data, target, feature_names)

Hi @sudo-hannes. Thanks for your contribution! Please feel free to work on that PR :). In addition, we could consider saving a csv file and loading it directly without relying on external sources.

optbinning/binning/binning.py

feat: add to_dict method for binning table serialization

188d95b

sudo-hannes closed this Aug 29, 2025

Merge branch 'develop' into add-to-dict-method

d38c8a7

sudo-hannes reopened this Aug 29, 2025

sudo-hannes changed the base branch from master to develop August 29, 2025 07:53

sudo-hannes added 2 commits August 29, 2025 09:54

refactor: remove unnecessary assignment of binning_table in OptimalBi…

9b989e8

…nning class

fix: remove unnecessary blank line in OptimalBinning class

a2bab32

guillermo-navas-palencia added the enhancement New feature or request label Aug 29, 2025

guillermo-navas-palencia added this to ToDo Aug 29, 2025

guillermo-navas-palencia added this to the v0.21.0 milestone Aug 29, 2025

Merge branch 'develop' into add-to-dict-method

b44e4cd

guillermo-navas-palencia requested changes Sep 1, 2025

View reviewed changes

optbinning/binning/binning.py Outdated Show resolved Hide resolved

sudo-hannes added 2 commits September 1, 2025 11:33

Merge branch 'develop' into add-to-dict-method

058c039

fix: Correct indentation in docstring for OptimalBinning class

667abdd

guillermo-navas-palencia approved these changes Sep 1, 2025

View reviewed changes

guillermo-navas-palencia merged commit b00530b into guillermo-navas-palencia:develop Sep 1, 2025
12 checks passed

guillermo-navas-palencia mentioned this pull request Oct 26, 2025

Develop #361

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add to_dict method for binning table serialization #371

feat: add to_dict method for binning table serialization #371

Uh oh!

sudo-hannes commented Aug 28, 2025

Uh oh!

sudo-hannes commented Aug 29, 2025 •

edited

Loading

Uh oh!

guillermo-navas-palencia commented Aug 29, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: add to_dict method for binning table serialization #371

feat: add to_dict method for binning table serialization #371

Uh oh!

Conversation

sudo-hannes commented Aug 28, 2025

Add to_dict method for binning table serialization

Summary

Uh oh!

sudo-hannes commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

guillermo-navas-palencia commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sudo-hannes commented Aug 29, 2025 •

edited

Loading

guillermo-navas-palencia commented Aug 29, 2025 •

edited

Loading