Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Imputed value not missing (Nan) #3

Open
mrtrunghieu1 opened this issue Oct 2, 2020 · 6 comments
Open

Imputed value not missing (Nan) #3

mrtrunghieu1 opened this issue Oct 2, 2020 · 6 comments

Comments

@mrtrunghieu1
Copy link

I run code in examples so I think GINN imputed value not missing (Nan value). Can you explain problems to me?

@spindro
Copy link
Owner

spindro commented Nov 17, 2020

Hi,
GINN is part of a family of algorithms whose objective is the imputation of missing values.
Basically, you have a dataset which is incomplete (i.e. some of the observations have missing elements) and use GINN, or other algorithms, to fill these holes. After this step, you can continue to perform any task that you want with the data.

@Hu-nie
Copy link

Hu-nie commented Jan 13, 2021

Are you saying that you are learning through complete data and imputation missing data afterwards?

@spindro
Copy link
Owner

spindro commented Jan 13, 2021

Not exactly. The task here is to learn from the incomplete data the values that most likely should fill the gaps in the dataset. After this imputation step, you have a new dataset with the predicted values instead of NaNs or whatever you use as a placeholder.

@mrtrunghieu1
Copy link
Author

mrtrunghieu1 commented Jan 13, 2021 via email

@Hu-nie
Copy link

Hu-nie commented Jan 14, 2021

In my opinion, you should generate random missing values with complete data and compare and evaluate them with MSE, RMSE, MAE, etc.

@hemarathore
Copy link

@Hu-nie But how to evaluate MSE, ...and other matrices? we don't have the missing values, or the ground truth. You are saying this model is filling your missing data and that's it, but how to access the performance measures?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants