Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GSOC] Project 8: Support various Parameter Distribution in Katib #2374

Open
7 of 12 tasks
shashank-iitbhu opened this issue Jun 30, 2024 · 7 comments · Fixed by #2377, #2403 or #2404 · May be fixed by #2412 or #2446
Open
7 of 12 tasks

[GSOC] Project 8: Support various Parameter Distribution in Katib #2374

shashank-iitbhu opened this issue Jun 30, 2024 · 7 comments · Fixed by #2377, #2403 or #2404 · May be fixed by #2412 or #2446
Assignees

Comments

@shashank-iitbhu
Copy link
Contributor

shashank-iitbhu commented Jun 30, 2024

Motivation

Currently, Katib is limited to supporting only uniform distribution for integer, float, and categorical hyperparameters. By introducing additional distributions, Katib will become more flexible and powerful in conducting hyperparameter optimization tasks.
A Data Scientist requires Katib to support multiple hyperparameter distributions, such as log-uniform, normal, and log-normal, in addition to the existing uniform distribution. For instance, learning rates often benefit from a log-uniform distribution because small values can significantly impact performance. Similarly, normal distributions are useful for parameters that are expected to vary around a central value.

Goal

The project aims to enhance Katib's Experiment APIs by introducing support for various hyperparameter distributions beyond the current uniform distribution.
Currently, the Experiment APIs support parameterType and feasibleSpace. To provide greater flexibility, we expanded the feasibleSpace to include a Distribution field, allowing users to specify the type of distribution for each parameter.

This Distribution field is designed to be mutually exclusive, meaning it can be left empty, enabling users to tune hyperparameters without specifying a distribution if desired.

What I Did in GSoC Project & Ongoing Works

Preparation

Development

Documentation and Ongoing Work

  • Write e2e tests for normal and lognormal distributions in hyperopt: [GSOC] hyperopt suggestion service logic update #2412
  • Extend the Python SDK to include support for the Distribution field.
  • Update Katib documentation to reflect new API changes and usage guidelines.
  • Create examples demonstrating the use of different parameter distributions.

What I learnt during the project

  • One of the key hurdles was integrating multiple new distributions into the Katib Experiment APIs while ensuring backward compatibility. Modifying the feasibleSpace to include a Distribution field required careful consideration of API design and how different suggestion services, like Optuna and Hyperopt, would handle these changes.

  • Effective communication is key. The weekly syncs with @andreyvelich and @tenzen-y were extremely valuable, and having a clear agenda beforehand helped me get my questions answered efficiently and stay on track.

  • I learned the importance of end-to-end (E2E) tests. By implementing these tests, I gained valuable insights into how the different components of Katib interact and how critical it is to verify that new features, like the distribution support.

In the End

I am grateful to Google for the opportunity to work on this project and to the Kubeflow community for their support and guidance. Special thanks to my project mentors, @andreyvelich, @tenzen-y, and @terrytangyuan for your invaluable insights and encouragement. Your mentorship was pivotal in achieving the project's goals.

I hope the enhancements made will significantly benefit users and advance Katib’s capabilities in hyperparameter optimization. Looking forward to continue to contribute to Kubeflow.

@shashank-iitbhu
Copy link
Contributor Author

cc @andreyvelich @tenzen-y

@andreyvelich
Copy link
Member

/area gsoc

@tenzen-y
Copy link
Member

/reopen

Copy link

@tenzen-y: Reopened this issue.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@tenzen-y
Copy link
Member

/reopen

@google-oss-prow google-oss-prow bot reopened this Aug 18, 2024
Copy link

@tenzen-y: Reopened this issue.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@tenzen-y tenzen-y reopened this Aug 20, 2024
@shashank-iitbhu shashank-iitbhu linked a pull request Aug 21, 2024 that will close this issue
1 task
@shashank-iitbhu
Copy link
Contributor Author

/assign

@shashank-iitbhu shashank-iitbhu changed the title [GSOC] Project 8 Tracker: Support various Parameter Distribution in Katib [GSOC] Project 8: Support various Parameter Distribution in Katib Sep 8, 2024
@shashank-iitbhu shashank-iitbhu linked a pull request Oct 20, 2024 that will close this issue
1 task
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment