ASK/TELL DEVELOP #1307

jlnav · 2024-05-09T17:27:15Z

codecov · 2024-05-16T20:14:01Z

Codecov Report

❌ Patch coverage is 87.75510% with 54 lines in your changes missing coverage. Please review.
✅ Project coverage is 78.75%. Comparing base (36ea32c) to head (a383dc0).
⚠️ Report is 26 commits behind head on develop.

Files with missing lines	Patch %	Lines
libensemble/generators.py	80.00%	16 Missing and 6 partials ⚠️
libensemble/gen_classes/aposmm.py	82.66%	4 Missing and 9 partials ⚠️
libensemble/utils/runners.py	92.22%	4 Missing and 3 partials ⚠️
libensemble/sim_funcs/borehole_kills.py	0.00%	5 Missing ⚠️
libensemble/utils/misc.py	95.65%	2 Missing and 3 partials ⚠️
libensemble/gen_funcs/aposmm_localopt_support.py	50.00%	1 Missing ⚠️
libensemble/libE.py	0.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1307      +/-   ##
===========================================
+ Coverage    78.23%   78.75%   +0.52%     
===========================================
  Files           76       79       +3     
  Lines         7561     7983     +422     
  Branches      1116     1195      +79     
===========================================
+ Hits          5915     6287     +372     
- Misses        1447     1477      +30     
- Partials       199      219      +20

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

shuds13 · 2024-05-21T18:03:50Z

libensemble/tests/unit_tests/test_persistent_aposmm.py

+    }
+
+    my_APOSMM = APOSMM(gen_specs)
+    my_APOSMM.setup()


Does this need to be separate to constructor?

Perhaps you've seen already, but setup() sets attributes that can't be pickled, but need to be done anyway. So I believe a separate generator.setup() needs to exist.

But perhaps for outside-of-libE purposes, to save a line, this could resemble: my_APOSMM = APOSMM(gen_specs, setup=True).

shuds13 · 2024-05-30T18:40:58Z

jlnav · 2024-05-30T20:44:43Z

@jlnav

Currently this has two existing user functions refactored to ask/tell, and it is a breaking change for codes using those gens. I don't think this is what we want. So, for now, we could supply these as duplicates (keeping the original), or remove them. Alternative would be to refactor all appropriate gens to ask/tell, but that would hold this up, so I think it may be better to supply these two as a duplicate for now.

Sounds good. Maybe we could raise DeprecationWarnings?

Also, would it be better for these ask/tell gens (rand sample and gpCAM) to use AskTellGenRunner and not use the wrapper gen_f?

I prefer the AskTellGenRunner myself (I did develop it), but if all the gens move to my runner, then what would you want to do with your wrapper?

shuds13 · 2024-05-30T20:59:20Z

I think there is also some opportunity for inheritence with the gpCAM class. And gp_cam_simple is more complicated than gp_cam_asktell. So, if anything the latter would be the base class, and renamed to avoid confusion. See #1316

shuds13 · 2024-06-01T20:08:29Z

libensemble/libE.py

 # ==================== Local version ===============================


+def _retrieve_generator(gen_specs):


I don't know why this is needed. If gen is on a thread, should not need to be pickled.

I've removed on my branch for now, so don't change here

jlnav · 2024-06-05T17:49:55Z

Checking random sample when running without Optimas:

>>> import numpy as np
>>> normal = np.load("persistent_aposmm_nlopt_history_length=2005_evals=2000_workers=4.npy")
>>> asktell = np.load("persistent_aposmm_nlopt_asktell_history_length=2002_evals=2000_workers=3.npy")
>>> all([i in asktell["x"][:100] for i in normal["x"][:100]])
True

jlnav · 2024-06-05T17:52:17Z

With Optimas, currently looks like random sample doesn't match, even when trying to account for seed

jlnav · 2024-06-06T16:06:44Z

Precision fixed in the Optimas example: optimas-org/optimas@08a835b

shuds13 · 2024-07-17T21:19:28Z

We may need to update gpCAM gens for latest gpCAM release. Make sure any changes made in gen_f is reflected here before pulling in.

Check gpCAM updates

shuds13 · 2024-07-31T16:11:03Z

libensemble/gen_classes/gpCAM.py

+            self.all_y = np.vstack((self.all_y, self.y_new))
+
+            if self.my_gp is None:
+                self.my_gp = GP(self.all_x, self.all_y, noise_variances=self.noise * np.ones(len(self.all_y)))


I need to update for new gpCAM interface

shuds13 · 2024-07-31T16:11:29Z

libensemble/gen_classes/sampling.py

+    """
+
+    def __init__(self, _, persis_info, gen_specs, libE_info=None) -> list:
+        # self.H = H


Remove I guess.

shuds13 · 2024-07-31T16:21:01Z

libensemble/gen_classes/gpCAM.py

+        self.all_y = np.empty((0, 1))
+        np.random.seed(0)
+
+    def __init__(self, H, persis_info, gen_specs, libE_info=None):


I will put above _initialize_gpcAM

and make _initialize_gpcAM _initialize_gpCAM

shuds13 · 2024-07-31T16:26:50Z

libensemble/gen_classes/gpCAM.py

+        self.all_x = np.empty((0, self.n))
+        self.all_y = np.empty((0, 1))
+        np.random.seed(0)
+


We need to decide __init__ interface.
We questioned before whether we keep the same interface - which mirrors the current gen_f, or to rearrange, as H is often not given (basically an H0). So it could be gen_specs first. I'm leaning towards keeping the original ordering as it mirrors our user functions, but this should be discussed.

Fair enough. My opinion/intuition is a user is more likely to prefer either "classical" gens (e.g. Jeff) or ask/tell gens (e.g. other CAMPA folks). With these gens' interfaces and users being so different, I don't think an arguably simpler rearrangement of the input parameters is too confusing.

Similarly to how some people prefer numpy or pandas; they do similar things, but their interfaces being different isn't a point of contention.

I'd also lean towards if someone were to initialize some object, like a gen, themselves, they'd prefer their specifications be provided as early and clearly as possible:

my_gen = Generator(param=1, option="two", )

vs.

my_gen = Generator(None, {}, {"param": 1, "option": "two"}, {}, )

shuds13

If we pull this in, are we treating this as provisional, without documenting gen_classes, so interface (e.g. __init__ option ordering) is mutable. Or are we ready to say a user can use the gen_classes interface for their own gens if desired? This questions applies to the numpy interface (ask_np/tell_np). As for the outer API, we certainly have not finalized that, and it should somehow be noted.

E.g.,
# This feature is in beta and its interface may change in future releases

Note, that currently the two implemented classes (rearranged to ask/tell) are duplicates of gen_f, but in the long term we will not want to have duplicates.

Discussions:

Interfacer:

Does it work with executor (consideration of things added to executor in worker.py - including first-tier comm object)
Does it work when there is no executor
For user function should we insert comm into libE_info["comm"] inside qcomm_main
Works with processes and threads - or picked one (and named right in generators.py)
Works with fork and spawn (for spawn be clear what in calling script needs to be inside main block).
Sometimes get hang at end of runs using the interfacer - esp. if no final_tell, is there something not getting shut-down? Check this works.
Need to deal with final_tell when using gen (e.g. optimas)
Need to sepaarate final_tell as required by standard - tell and extract data function.
Constructor wrapper approach - demonstrate alternatives in notebook

Todo:

SH - update gpCAM to reflect current interface.
Change/rename original gpCAM to match naming in the class.
Need to run 'extra' tests for gpCAM tests to get run. Check passes.
Check correct no. points produced each call of gen (usually number sent back).

Check works:

Running libE ask/tell generator - e.g. gpCAM through Optimas (it will go via ask/tell wrappers).
Use libEnsemble ask/tell generator standalone (outside of libE)
must libE be installed to use generators?
Running libE with our ask_numpy/tell_numpy functions.
Running libE with ask/tell function (standardised format).

jmlarson1 · 2024-08-01T16:08:17Z

libensemble/generators.py

+
+class Generator(ABC):
+    """
+    v 0.7.2.24


Is this version number something we need?

libensemble/generators.py

shuds13 · 2024-08-06T16:17:12Z

libensemble/generators.py

+    """
+
+    def __init__(
+        self, gen_specs: dict, History: npt.NDArray = [], persis_info: dict = {}, libE_info: dict = {}


Here we have a different constructor signature to those functions in gen_classes.

shuds13 · 2024-08-06T16:33:17Z

In the full tests: https://github.com/Libensemble/libensemble/actions/runs/10256911231/job/28376940998#step:22:684

 ---Test 7: test_asktell_surmise.py starting with local on 4 processes 
/home/runner/miniconda3/envs/condaenv/lib/python3.11/site-packages/pydantic/_internal/_config.py:322: UserWarning: Valid config keys have changed in V2:
* 'orm_mode' has been renamed to 'from_attributes'
  warnings.warn(message, UserWarning)
Traceback (most recent call last):
  File "/home/runner/work/libensemble/libensemble/libensemble/tests/regression_tests/test_asktell_surmise.py", line 90, in <module>
    H_out, _a, _b = borehole(list_dicts_to_np(point), {}, sim_specs, {"H_rows": np.array([point["sim_id"]])})
                             ^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/runner/work/libensemble/libensemble/libensemble/utils/misc.py", line 102, in list_dicts_to_np
    first = list_dicts[0]
            ~~~~~~~~~~^^^
KeyError: 0
Error: The operation was canceled.

libensemble/utils/runners.py

libensemble/gen_funcs/persistent_gen_wrapper.py

libensemble/generators.py

libensemble/utils/runners.py

Update finalize to meet standard

jlnav marked this pull request as ready for review May 16, 2024 20:06

shuds13 reviewed May 21, 2024

View reviewed changes

shuds13 reviewed Jun 1, 2024

View reviewed changes

jlnav requested a review from shuds13 July 30, 2024 22:00

shuds13 reviewed Jul 31, 2024

View reviewed changes

shuds13 requested changes Jul 31, 2024

View reviewed changes

shuds13 requested a review from jmlarson1 July 31, 2024 16:47

jmlarson1 approved these changes Aug 1, 2024

View reviewed changes