py-why
diff --git a/‎README.md‎
Lines changed: 63 additions & 5 deletions b/‎README.md‎
Lines changed: 63 additions & 5 deletions
diff --git a/‎doc/spec/comparison.rst‎
Lines changed: 18 additions & 1 deletion b/‎doc/spec/comparison.rst‎
Lines changed: 18 additions & 1 deletion
diff --git a/‎doc/spec/estimation/orthoiv.rst‎
Lines changed: 80 additions & 0 deletions b/‎doc/spec/estimation/orthoiv.rst‎
Lines changed: 80 additions & 0 deletions
diff --git a/‎doc/spec/estimation_iv.rst‎
Lines changed: 2 additions & 1 deletion b/‎doc/spec/estimation_iv.rst‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎doc/spec/references.rst‎
Lines changed: 5 additions & 0 deletions b/‎doc/spec/references.rst‎
Lines changed: 5 additions & 0 deletions
@@ -319,23 +319,81 @@ lb, ub = est.effect_interval(X_test, alpha=0.05)
 </details>
 
 <details>
-<summary>Orthogonal Instrumental Variables (click to expand)</summary>
+<summary>Double Machine Learning with Instrumental Variables (click to expand)</summary>
 
-* Intent to Treat Doubly Robust Learner (discrete instrument, discrete treatment)
+* Orthogonal instrumental variable learner
+
+```Python
+from econml.iv.dml import OrthoIV
+
+est = OrthoIV(projection=False, 
+              discrete_treatment=True, 
+              discrete_instrument=True)
+est.fit(Y, T, Z=Z, X=X, W=W)
+treatment_effects = est.effect(X_test)
+lb, ub = est.effect_interval(X_test, alpha=0.05) # OLS confidence intervals
+```
+* Nonparametric double machine learning with instrumental variable
+
+```Python
+from econml.iv.dml import NonParamDMLIV
+
+est = NonParamDMLIV(projection=False, 
+                    discrete_treatment=True, 
+                    discrete_instrument=True)
+est.fit(Y, T, Z=Z, X=X, W=W) # no analytical confidence interval available
+treatment_effects = est.effect(X_test)
+```
+</details>
+
+<details>
+<summary>Doubly Robust Machine Learning with Instrumental Variables (click to expand)</summary>
+
+* Linear final stage
+```Python
+from econml.iv.dr import LinearDRIV
+
+est = LinearDRIV(discrete_instrument=True, discrete_treatment=True)
+est.fit(Y, T, Z=Z, X=X, W=W)
+treatment_effects = est.effect(X_test)
+lb, ub = est.effect_interval(X_test, alpha=0.05) # OLS confidence intervals
+```
+
+* Sparse linear final stage
+
+```Python
+from econml.iv.dr import SparseLinearDRIV
+
+est = SparseLinearDRIV(discrete_instrument=True, discrete_treatment=True)
+est.fit(Y, T, Z=Z, X=X, W=W)
+treatment_effects = est.effect(X_test)
+lb, ub = est.effect_interval(X_test, alpha=0.05) # Debiased lasso confidence intervals
+```
+
+* Nonparametric final stage
+```Python
+from econml.iv.dr import ForestDRIV
+
+est = ForestDRIV(discrete_instrument=True, discrete_treatment=True)
+est.fit(Y, T, Z=Z, X=X, W=W)
+treatment_effects = est.effect(X_test)
+# Confidence intervals via Bootstrap-of-Little-Bags for forests
+lb, ub = est.effect_interval(X_test, alpha=0.05) 
+```
+
+* Linear intent-to-treat (discrete instrument, discrete treatment)
 
 ```Python
 from econml.iv.dr import LinearIntentToTreatDRIV
 from sklearn.ensemble import GradientBoostingRegressor, GradientBoostingClassifier
-from sklearn.linear_model import LinearRegression
 
 est = LinearIntentToTreatDRIV(model_y_xw=GradientBoostingRegressor(),
                               model_t_xwz=GradientBoostingClassifier(),
                               flexible_model_effect=GradientBoostingRegressor())
-est.fit(Y, T, Z=Z, X=X) # OLS inference by default
+est.fit(Y, T, Z=Z, X=X, W=W)
 treatment_effects = est.effect(X_test)
 lb, ub = est.effect_interval(X_test, alpha=0.05) # OLS confidence intervals
 ```
-
 </details>
 
 <details>
 
@@ -35,7 +35,24 @@ Detailed estimator comparison
 +---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
 | :class:`.NonParamDML`                       | 1-d/Binary   |              |                  | Yes         |                 | Yes        |              | Yes                |
 +---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
-
+| :class:`.OrthoIV`                           | Any          | Yes          | Yes              | Yes         | Assumed         | Yes        | Yes          |                    |
++---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
+| :class:`.DMLIV`                             | Any          | Yes          |                  | Yes         | Assumed         | Yes        | Yes          | Yes                |
++---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
+| :class:`.NonParamDMLIV`                     | 1-d/Binary   | Yes          |                  | Yes         |                 | Yes        |              | Yes                |
++---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
+| :class:`.DRIV`                              | 1-d/Binary   | Yes          | Yes              | Yes         |                 |            |              | Yes                |
++---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
+| :class:`.LinearDRIV`                        | 1-d/Binary   | Yes          | Yes              | Yes         | Projected       |            |              |                    |
++---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
+| :class:`.SparseLinearDRIV`                  | 1-d/Binary   | Yes          | Yes              | Yes         | Projected       |            |              | Yes                |
++---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
+| :class:`.ForestDRIV`                        | 1-d/Binary   | Yes          | Yes              | Yes         |                 |            |              | Yes                |
++---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
+| :class:`.IntentToTreatDRIV`                 | Binary       | Yes          | Ye               | Yes         |                 |            |              | Yes                |
++---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
+| :class:`.LinearIntentToTreatDRIV`           | Binary       | Yes          | Yes              | Yes         | Projected       |            |              |                    |
++---------------------------------------------+--------------+--------------+------------------+-------------+-----------------+------------+--------------+--------------------+
 
 Treatment Type
     Some estimators can only estimate effects of particular kinds of treatments. 
 
@@ -0,0 +1,80 @@
+.. _orthoivuserguide:
+
+=================================
+Orthogonal instrumental variables
+=================================
+
+
+What is it?
+==================================
+Orthogonal instrumental variables is a suite of methods to estimate heterogeneous treatment effects with arbitrary machine
+learning methods in the presence of unobserved confounders with the aid of a valid instrument. We develop a statistical learning
+approach to the estimation of heterogeneous effects, reducing the problem to the minimization of an appropriate loss function that
+depends on a set of auxiliary models (each corresponding to a separate prediction task). The reduction enables the use of all
+recent Machine Learning models (e.g. random forest, boosting, neural nets). We show that the estimated effect model is robust to
+estimation errors in the auxiliary models, by showing that the loss satisfies a Neyman orthogonality criterion.
+Our approach can be used to estimate projections of the true effect model on simpler hypothesis spaces.
+When these spaces are parametric, then the parameter estimates are asymptotically normal,
+which enables construction of confidence intervals. 
+For a more detailed overview of these methods, see e.g. [Syrgkanis2019]_.
+
+
+What are the relevant estimator classes?
+========================================
+
+This section describes the methodology implemented in the classes :class:`.OrthoIV`,
+:class:`.DMLIV`, :class:`.NonParamDMLIV`, :class:`.LinearDRIV`, :class:`.SparseLinearDRIV`, :class:`.ForestDRIV`,
+:class:`.IntentToTreatDRIV`, :class:`.LinearIntentToTreatDRIV`.
+Click on each of these links for detailed module documentation and input parameters of each class.
+
+When should you use it?
+==================================
+Suppose you have observational (or experimental from an A/B test) historical data, where some treatment(s)/intervention(s)/action(s) 
+:math:`T` were chosen and some outcome(s) :math:`Y` were observed. However, not all of the variables :math:`W` that could have
+potentially gone into the choice of :math:`T`, and simultaneously could have had a direct effect on the outcome :math:`Y`
+(aka controls or confounders) are recorded in the dataset. At the same time if you could observe a variable :math:`Z` which
+will have a direct effect on treatment and an indirect effect on outcome which only goes through the treatment, we could use classes
+mentioned above to learn the heterogeneous treatment effect on high dimensional dataset. In other words, we learn the effect of the
+treatment on the outcome as a function of a set of observable characteristics :math:`X`.
+
+In particular, these methods are especially useful in A/B tests with an intent-to-treat structure, where the experimenter randomizes over
+which user will receive a recommendation to take an action, and we are interested in the effect of the downstream action. 
+
+For instance call:
+
+.. testsetup::
+
+    # LinearIntentToTreatDRIV
+    import numpy as np
+    X = np.random.normal(size=(100, 3))
+    y = np.random.normal(size=(100,))
+    T = np.random.binomial(1, 0.5, size=(100,))
+    Z = np.random.binomial(1, 0.5, size=(100,))
+    W = np.random.normal(size=(100, 10))
+
+.. testcode::
+
+    from econml.iv.dr import LinearIntentToTreatDRIV
+    est = LinearIntentToTreatDRIV()
+    est.fit(y, T, Z=Z, X=X, W=X)
+    est.effect(X)
+
+
+Class Hierarchy Structure
+==================================
+In this library we implement variants of several of the approaches mentioned in the last section. The hierarchy
+structure of the implemented CATE estimators is as follows.
+
+    .. inheritance-diagram:: econml.iv.dml.OrthoIV econml.iv.dml.NonParamDMLIV econml.iv.dml.DMLIV econml.iv.dr.DRIV 
+                            econml.iv.dr.LinearDRIV econml.iv.dr.SparseLinearDRIV econml.iv.dr.ForestDRIV
+                            econml.iv.dr.IntentToTreatDRIV econml.iv.dr.LinearIntentToTreatDRIV
+        :parts: 1
+        :private-bases:
+        :top-classes: econml._ortho_learner._OrthoLearner, econml._cate_estimator.StatsModelsCateEstimatorMixin, econml._cate_estimator.DebiasedLassoCateEstimatorMixin
+
+
+Usage Examples
+==================================
+
+For more extensive examples check out the following notebooks:
+`OrthoIV and DRIV Examples Jupyter Notebook <https://github.com/microsoft/EconML/blob/master/notebooks/OrthoIV%20and%20DRIV%20Examples.ipynb>`_.
@@ -15,4 +15,5 @@ of [Newey2003]_.
     :maxdepth: 2
 
     estimation/deepiv.rst
-    estimation/two_sls.rst
+    estimation/two_sls.rst
+    estimation/orthoiv.rst
@@ -124,3 +124,8 @@ References
     Hernán, Miguel A., and James M. Robins (2010).
     Causal inference.
     URL https://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/
+
+.. [Syrgkanis2019]
+    Syrgkanis, V., Lei, V., Oprescu, M., Hei, M., Battocchi, K., Lewis, G. (2019)
+    Machine Learning Estimation of Heterogeneous Treatment Effects with Instruments
+    URL https://arxiv.org/abs/1905.10176