An interesting demonstration of causal inference in the People Analytics space using the famous IBM Employee Attrition dataset.
To be honest, whenever I see an analysis using the popular IBM Employee Attrition dataset, I tend to ignore it and quickly skip to something more interesting and engaging. A classic search for shiny new things in action. 😉
Btw, did you know this dataset has already been with us for about 10 years? Pretty nice milestone anniversary! At least, that’s what my OpenAI deep research found—it seems it was first released by IBM around September 2015 on its blog to showcase the IBM Watson Analytics platform’s capabilities in an HR context. So, please take this finding with a grain of salt. However, after checking the sources (including my own memory), it seems pretty plausible to me. But feel free to correct me/us if I’m/we’re wrong.
Recently, however, I broke this habit when I came across a Jupyter notebook on EconML’s GitHub showcasing how to effectively combine classical ML—providing a list of strongest predictors—with their subsequent causal interpretation using Double ML (DML) and Heterogeneous Treatment Effect (HTE) estimation, nicely packaged into the CausalAnalysis class.
The entire pipeline includes the following steps:
IMO, pretty cool stuff. It seems we indeed live not only in an AI revolution but also in a causal one ✊🙂
⚠️ Just a small (or big?) warning at the end: Despite the smooth and easy analytical workflow enabled by the CausalAnalysis class, it doesn’t replace the need for strong domain knowledge and understanding which variables make sense to include in the model in the first place.
For attribution, please cite this work as
Stehlík (2025, March 18). Ludek's Blog About People Analytics: How to get causal interpretation for the Employee Attrition dataset?. Retrieved from https://blog-about-people-analytics.netlify.app/posts/2025-03-18-econml-and-employee-attriton/
BibTeX citation
@misc{stehlík2025how, author = {Stehlík, Luděk}, title = {Ludek's Blog About People Analytics: How to get causal interpretation for the Employee Attrition dataset?}, url = {https://blog-about-people-analytics.netlify.app/posts/2025-03-18-econml-and-employee-attriton/}, year = {2025} }