Skip to content

Conversation

@MalteKurz
Copy link
Member

@MalteKurz MalteKurz commented Jul 29, 2021

This PR adds functionality for cluster robust double machine learning. The main reference is:
Chiang, H. D., Kato K., Ma, Y. and Sasaki, Y. (2021), Multiway Cluster Robust Double/Debiased Machine Learning, Journal of Business & Economic Statistics, https://doi.org/10.1080/07350015.2021.1895815.

  • The DGP from the paper was added as function make_pliv_multiway_cluster_CKMS2021.

  • A new data-backend for cluster data named DoubleMLClusterData was added. It is inherited from the DoubleMLData class and primarily adds functionality to add the cluster variables.

  • Cluster robust cross-fitting with resampling of cluster variables is implemented in the abstract base class DoubleML and used whenever a data-backend of class DoubleMLClusterData is passed as input. The implemented approach is described as Algorithm 1 in Chiang et al. (2021). For details, see the new notebook added to the example gallery (see Cluster robust double machine learning doubleml-docs#40).

  • Methods for estimation of cluster-robust standard errors in double machine learning models have been added to the abstract base class DoubleML. It implements standard error estimation as described in Eq. (3.4)-(3.6) in Chiang et al. (2021). For details, see the new notebook added to the example gallery (see Cluster robust double machine learning doubleml-docs#40).

  • The current implementation is restricted to the one-way and two-way clustering cases. The extension to the general multiway-clustering case would be easily doable.

  • In the unit tests we check against functional implementations for the one-way and two-way clustering case. Additionally standard unit tests for the newly added data-backend and functionalities have been added.

  • A notebook for the example gallery will be added in this PR Cluster robust double machine learning doubleml-docs#40

  • The implementation for the Python-package will be added in this PR Cluster robust double machine learning doubleml-for-py#116

  • A comparison of the Py and R version will be added in this PR Cluster robust double machine learning doubleml-py-vs-r#8

MalteKurz added 30 commits July 23, 2021 14:50
…ring to be aligned with Chiang et al. (2021)
…d unit tests and functional implementations of clustering
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants