[WIP] Wide & Deep migration to PyTorch #2168

daviddavo · 2024-09-18T14:03:03Z

Description

Migrating Wide & Deep out of tensorflow.

Note: Previously I have only used models from high level libraries like Keras. I'm doing this to learn PyTorch, so feel free to give me any pointers or even scrap everything if it is not useful.

Related Issues

[BUG] tensorflow-estimator is removed from tensorflow 2.16.1 #2072

References

Checklist:

I have followed the contribution guidelines and code style for this project.
I have added tests covering my contributions.
I have updated the documentation accordingly.
I have signed the commits, e.g. git commit -s -m "your commit message".
This PR is being made to staging branch AND NOT TO main branch.

WIP Tasks

Signed-off-by: David Davó <[email protected]>

miguelgfierro · 2024-09-21T07:40:41Z

Sorry @daviddavo I pressed the wrong action. I changed the PR to ready for review because we are having some problems with the tests. Hopefully, we can fix it by next week.

daviddavo · 2024-09-21T15:10:06Z

NP, I still have quite some work to do

miguelgfierro · 2024-09-23T15:31:52Z

FYI @daviddavo the tests should be working now after #2169

daviddavo · 2024-09-23T21:43:49Z

The tensorflow estimators approach currently used by recommenders uses a binary regressor (default value of n_classes). To get the recommendations, all user-item pairs are used as input (created using the user_item_pairs method).

On the PyTorch notebook the output is not binary, instead there is a class for each movie. The aim of this notebook is to predict the next movie to watch, and not to make a top-k recommendations, a different problem.

The question is, what does the original paper do? Does it output a single scalar? Or does it output a vector with a value corresponding to each item? As I understand it, it should be a single value ( $P\left(Y=1|X\right)=\sigma\left(...\right)$ ), but that notebook made me doubt.

Nevertheless, I think I will modify the current model so the "head" returns a scalar and to get the top-k recommendations we pass all the possible user-item pairs, as the recommenders' tensorflow implementation does.

Edit: NVIDIA's deep learning examples also output a single value. My doubts have been resolved but I'll keep this post as some kind of documentation. I'll finish the model and do the training soon.

Signed-off-by: David Davó <[email protected]>

daviddavo · 2024-09-24T19:19:17Z

Loss function decreases over time in my jupyter notebook. the only thing remaining is the "software engineering" part

Signed-off-by: David Davó <[email protected]>

daviddavo · 2024-10-07T08:23:47Z

Now that I'm testing it with the full 100k dataset, I realized its very slow. I will profile it next weekend, but I have a hunch that the problem is the DataLoader, which uses a lot of slow .locs

miguelgfierro · 2024-11-25T14:51:44Z

@daviddavo how are you doing with this PR? Let me know if you need any help

miguelgfierro · 2025-01-14T17:18:42Z

@@daviddavo how are things, I would like to ask whether you would be continuing with this work.

daviddavo · 2025-02-27T18:31:34Z

Hi! Sorry for taking so long to reply! I changed jobs and moved to another city, so I did not have time for hobbies, sorry again for leaving you hanging.

I don't think I will have time in the near future, so it is fine if anyone else wants to keep working on this. If it remains open when I have time, I will retake it.

miguelgfierro · 2025-02-28T18:10:22Z

@daviddavo thanks David!
FYI @loomlike

anargyri · 2025-03-03T12:46:48Z

Hi! Sorry for taking so long to reply! I changed jobs and moved to another city, so I did not have time for hobbies, sorry again for leaving you hanging.

I don't think I will have time in the near future, so it is fine if anyone else wants to keep working on this. If it remains open when I have time, I will retake it.

Thanks for the contributions @daviddavo ! Feel welcome to give feedback or contribute to the repo any time you have the time!

[WIP] Started Wide & Deep pytorch migration

551bf33

Signed-off-by: David Davó <[email protected]>

daviddavo requested review from SimonYansenZhao, anargyri, gramhagen, loomlike, miguelgfierro and wutaomsft as code owners September 18, 2024 14:03

daviddavo marked this pull request as draft September 18, 2024 14:03

daviddavo added 3 commits September 20, 2024 15:14

Working wide_and_deep pytorch module

756ee6d

Signed-off-by: David Davó <[email protected]>

Added support for additional embeddings in wide and deep pytorch module

fc49b13

Signed-off-by: David Davó <[email protected]>

Removed old pytorch model

d5e461e

Signed-off-by: David Davó <[email protected]>

miguelgfierro marked this pull request as ready for review September 21, 2024 07:39

daviddavo marked this pull request as draft September 21, 2024 15:09

Added [start quote] hashed [end quote] cross features

9bd7d25

Signed-off-by: David Davó <[email protected]>

daviddavo added 6 commits September 29, 2024 17:12

Created WideandDeep wrapper class

81a8d27

Signed-off-by: David Davó <[email protected]>

Added continuous features (genres) to wide and deep

5829d16

Signed-off-by: David Davó <[email protected]>

Save wide and deep model

3df2dfe

Signed-off-by: David Davó <[email protected]>

Speedup 10x WideAndDeep._get_uip_cont

cc1e2f9

Signed-off-by: David Davó <[email protected]>

WideDeep Avoid eval every iter

f713841

Signed-off-by: David Davó <[email protected]>

Moved test_wide_deep_utils to pytorch

2dd40c0

Signed-off-by: David Davó <[email protected]>

loomlike approved these changes Jun 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Wide & Deep migration to PyTorch #2168

[WIP] Wide & Deep migration to PyTorch #2168

daviddavo commented Sep 18, 2024 •

edited

Loading

Uh oh!

miguelgfierro commented Sep 21, 2024 •

edited

Loading

Uh oh!

daviddavo commented Sep 21, 2024

Uh oh!

miguelgfierro commented Sep 23, 2024

Uh oh!

daviddavo commented Sep 23, 2024

Uh oh!

daviddavo commented Sep 24, 2024

Uh oh!

daviddavo commented Oct 7, 2024

Uh oh!

miguelgfierro commented Nov 25, 2024

Uh oh!

miguelgfierro commented Jan 14, 2025

Uh oh!

daviddavo commented Feb 27, 2025

Uh oh!

miguelgfierro commented Feb 28, 2025

Uh oh!

anargyri commented Mar 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[WIP] Wide & Deep migration to PyTorch #2168

Are you sure you want to change the base?

[WIP] Wide & Deep migration to PyTorch #2168

Conversation

daviddavo commented Sep 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

References

Checklist:

WIP Tasks

Uh oh!

miguelgfierro commented Sep 21, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

daviddavo commented Sep 21, 2024

Uh oh!

miguelgfierro commented Sep 23, 2024

Uh oh!

daviddavo commented Sep 23, 2024

Uh oh!

daviddavo commented Sep 24, 2024

Uh oh!

daviddavo commented Oct 7, 2024

Uh oh!

miguelgfierro commented Nov 25, 2024

Uh oh!

miguelgfierro commented Jan 14, 2025

Uh oh!

daviddavo commented Feb 27, 2025

Uh oh!

miguelgfierro commented Feb 28, 2025

Uh oh!

anargyri commented Mar 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

daviddavo commented Sep 18, 2024 •

edited

Loading

miguelgfierro commented Sep 21, 2024 •

edited

Loading