--

Thanks!

To answer your question: I would recommend to use Cook's distance if you have a linear regression problem (only numerical features), DBSCAN if you only have numerical (scaled) features (not necessarily linear regression), and Isolation Forest if your data also contains categorical features.

--

--

Hennie de Harder
Hennie de Harder

Written by Hennie de Harder

📈 Data Scientist & ML Engineer 💡 Simplifying complex topics ✨ Sharing fun side projects 💻 Working at IKEA and BigData Republic 🐈 Love math, cats, & running

No responses yet