Constraint Optimization with SingleTaskVariationalGP of binary classification task #2424
Replies: 1 comment
-
This is a really interesting question. There have been some discussions of "failure-aware BO", and that feels similar to me, as does the idea of data missing not-at-random. Your approach seems like it could work to me. I haven't worked with If you have a reproducible example, I could take a look -- I think this would be an interesting example for the BoTorch documentation. |
Beta Was this translation helpful? Give feedback.
-
Hi everyone,
I have a general question about Bayesian Optimization with unknown constraints.
I want to implement a BO loop with feasibility constraints. I have the discrete labels (0=feasible, 1=infeasible).
To model the constraint I am using the SingleTaskVariationalGP with a BernoulliLikelihood.
To implement the constraint into the BO loop I am using the LogConstrainedExpectedImprovement.
!!! Warning no reproducable code!!!
This is generally the implementation:
For a real BO loop implementation there is obviously a lot missing. But my question is regarding the implementation of the constraint.
I checked the SingleTaskVariationalGP model and the logits at the position where the training inputs are infeasible are negative.
But is it in general sufficient to implement the constraint like this?
Please correct me if i am wrong but to my knowledge, the output of the model_con is a mutlivariate gaussian distribution representing the logit of my constraint and the LogConstraintExpectedImprovment scales the acquisition with the probability of the logit distribution being between minus infinity and zero. Is that correct?
If my understanding of what is happening is correct the implementation of the binary feasibility constraint shoud be sufficient, correct?
I am happy for any help.
I tried to adjust the implementation from #640 but this issue is about multi-output optimization. There he also used a Variational GP model but implemented a new objective function and scaled it with the "real" probability of feasibility (not the logit distribution) calculated from MC samples and then proceeded with an MC-acquisition function.
But to my understanding if we are already using a VariationalGP model we should be able to use an analytic acquisition function.
Beta Was this translation helpful? Give feedback.
All reactions