py-why · v-asatha · Jan 30, 2020 · Jan 31, 2020 · Feb 14, 2020
diff --git a/montecarlo/README.md b/montecarlo/README.md
@@ -0,0 +1,100 @@
+<!-- Copyright (c) Microsoft Corporation. All rights reserved. -->
+<!-- Licensed under the MIT License. -->
+
+# Monte Carlo Library
+The folder mcpy contains code related to running monte carlo experiments from config files and saving and running the results.
+Look at dml_te_confi.py and dml_te_functions.py as an example.
+
+We can run a monte carlo simulation with the following from the command line:
+```
+python3 run_mc_from_config.py --config dml_te_config
+```
+
+The config dictionary allows one to run monte carlo experiments for some configuration of the parameters of the dgp, estimation methods, metrics, and plot functions. The MonteCarlo class will run experiments for each dgp, for each method, and on those results of each of those, calculate metrics and plots. DGPs, estimation methods, metrics, and plot functions can all be user defined. Thus this is a general framework to make monte carlo simulations easy to automate.  
+
+The config looks like the following:
+```python
+CONFIG = {
+  # whether to do a simulation for a fixed set of parameters vs. 'sweep_parameter',
+  # where it will run multiple simulations for all permutations of the parameters
+  "type" : 'single_parameter,
+  # references to dgp functions
+  "dgps" : {
+    'dgp1': dml_te_functions.instance_params
+  },
+  # dgp options for each dgp
+  "dgp_opts" : {
+    'dgp1': {
+        'n_samples': 2000,
+        'n_features': 1,
+        'n_controls': 30,
+        'support_size': 5
+      },
+  },
+  # references to methods
+  "methods" : {
+
+  },
+  # method options, for each method
+  "method_opts" : {
+    "method1": {
+      "LinearDMLCate": dml_te_functions.linear_dml_fit,
+    },
+    "method2": {
+
+    }
+  },
+  # references to metric functions
+  "metrics" : {
+    'rmse': metrics.rmse,
+    'conf_length': metrics.conf_length,
+    'coverage': metrics.coverage,
+    'std': metrics.std,
+    'coverage_band': metrics.coverage_band
+  },
+  # references to plot functions
+  "plots" : {
+    'plot1': plotting.plot_metrics,
+    'plot2': plotting.plot_visualization,
+    'plot3': plotting.plot_violin
+  },
+  # different metrics are plotted differently
+  # single summary metrics are single value per dgp and method
+  "single_summary_metrics" : ['coverage_band'], # list of the metrics that are single summary metrics
+  # plot to run for when the type is sweep_parameter.
+  "sweep_plots" :{
+  },
+  # monte carlo simulation options
+  "mc_opts" : {
+    'n_experiments': 5, # number of monte carlo experiments
+    "seed": 123
+  },
+  # comparison method reference # not supported currently
+  "proposed_method" : {
+  },
+  # directory to save things to
+  "target_dir" : "",
+  "reload_results" : False # to rerun or load previous results
+}
+```
+
+# Some Next Steps
+
+- Use cloud computing service to run all simulations with a lot of cores, in parallel, on the cloud. This will yield better simulation results and allow us to run simulations over many more parameters and better evaluate the empirical performance of the estimators.
+- A lot of HTE are calculated for some treatment T0 to T1. When evaluating the performance, the user should perhaps specify those specific vectors on their own in the dgp amd method functions that they write, or perhaps they should also be swept over in the MonteCarloSweep. A question is for how to sweep over different values for them, e.g, in what increments, especially as they get to be high dimensional.
+- Construct more dgps, and look at more real data sets to perform simulations on
+- Integrate this with the rest of the code suite such that whenever significant updates to the implementations of the estimators are made, a host of MC simulations are run and the results are compared to previous ones before the implementation changes. This would allow us to understand the change in empirical performance of the estimators after the change in implementation.
+- More metrics and plotting functions, try to generalize them and not have them specific to specific DGP/method/ implementations.  
+- Overall challenge of how to keep the framework general and but also be able to define nitty-gritty DGPs/methods/metrics/plots. Return types that are more vague like lists or tuples or dictionaries may allow for lots of different DGP/method implementations that use the framework at the same time, where each of those is specified in the references from the config file. Best way to realize what small adjustments to make is to implement a bunch of DGPs/methods/metrics/plots and see what return types need to be made more general or more specific/tailored. For example, when X_test is multidimensional there's an issue for how to plot the metrics. Do you plot them with respect to the first column or X_test? or? This means we may have to make the plot functions specific to that DGP/method set up. The framework allows for anyone to write their own dgps/methods/metrics/plot functions, but achieving more generality would be preferable. Maybe for sets/classes of certain DGPs that are used in testing a lot together. Anything to avoid lots of specific code for each dgp/etc., otherwise the test codebase will get big.
+
+# Example Plots
+<p align="center">
+  <img src="example_plots/plot1_rmse.png" height="100" title="rmse">
+  <img src="example_plots/plot1_coverage.png" height="100" title="coverage">
+  <img src="example_plots/plot1_conf_length.png" height="100" title="conf_length">
+  <img src="example_plots/plot1_std.png" height="100" title="std">
+  <img src="example_plots/plot2.png" height="100" title="Performance comparison of estimators">
+  <img src="example_plots/plot3.png" height="100" title="Coverage band">
+  <br/>
+  <i>Example plots for DGP in dml_te_config.py</i>
+</p>
diff --git a/montecarlo/dml_te_config.py b/montecarlo/dml_te_config.py
@@ -0,0 +1,92 @@
+# Copyright (c) Microsoft Corporation. All rights reserved.
+# Licensed under the MIT License.
+
+import os
+import numpy as np
+import dml_te_functions
+from mcpy import metrics
+from mcpy import plotting
+from mcpy import utils
+from sklearn.linear_model import Lasso, LassoCV, LogisticRegression, LogisticRegressionCV,LinearRegression,MultiTaskElasticNet,MultiTaskElasticNetCV
+from sklearn.ensemble import RandomForestRegressor,RandomForestClassifier
+from sklearn.preprocessing import PolynomialFeatures
+
+CONFIG = {
+        "type": 'single_parameter',
+        "dgps": {
+            "dgp1": dml_te_functions.gen_data
+        },
+        "dgp_instance_fns": {
+            'dgp1': dml_te_functions.instance_params
+        },
+        "dgp_opts": {
+            'dgp1': {
+                'n_samples': 2000,
+                'n_features': 1,
+                'n_controls': 30,
+                'support_size': 5
+            },
+        },
+        "methods": {
+            "LinearDMLCate": dml_te_functions.linear_dml_fit,
+            "SparseLinearDMLCate": dml_te_functions.sparse_linear_dml_poly_fit,
+            # "DMLCate": dml_te_functions.dml_poly_fit,
+            # "ForestDMLCate": dml_te_functions.forest_dml_fit
+        },
+        "method_opts": {
+                'LinearDMLCate': {
+                    'model_y': RandomForestRegressor(),
+                    'model_t': RandomForestRegressor(),
+                    'inference': 'statsmodels'
+                },
+                'SparseLinearDMLCate': {
+                    'model_y': RandomForestRegressor(),
+                    'model_t': RandomForestRegressor(),
+                    'featurizer': PolynomialFeatures(degree=3),
+                    'inference': 'debiasedlasso'
+                },
+            'DMLCate': {
+                    'model_y': RandomForestRegressor(),
+                    'model_t': RandomForestRegressor(),
+                    'model_final': Lasso(alpha=0.1, fit_intercept=False),
+                    'featurizer': PolynomialFeatures(degree=10),
+                    'inference': 'bootstrap'
+                },
+                'ForestDMLCate': {
+                    'model_y': RandomForestRegressor(),
+                    'model_t': RandomForestRegressor(),
+                    'discrete_treatment': False,
+                    'n_estimators': 1000,
+                    'subsample_fr': 0.8,
+                    'min_samples_leaf': 10,
+                    'min_impurity_decrease': 0.001,
+                    'verbose': 0,
+                    'min_weight_fraction_leaf': 0.01,
+                    'inference': 'bootstrap'
+                }
+        },
+        "metrics": {
+            'rmse': metrics.rmse,
+            'conf_length': metrics.conf_length,
+            'coverage': metrics.coverage,
+            'std': metrics.std,
+            'coverage_band': metrics.coverage_band
+        },
+        "plots": {
+            'plot1': plotting.plot_metrics,
+            'plot2': plotting.plot_visualization,
+            'plot3': plotting.plot_violin
+        },
+        # different metrics are plotted differnetly
+        # single summary metrics are a single value per dgp and method
+        "single_summary_metrics": ['coverage_band'],
+        "sweep_plots": {
+        },
+        "mc_opts": {
+            'n_experiments': 5, # number of monte carlo experiments
+            "seed": 123
+        },
+        "proposed_method": "CrossOrtho",
+        "target_dir": "dml_te_test",
+        "reload_results": False
+    }