skip to content

Search

Posts RSS feed

2025

  • A practical guide to creating representative development datasets through experimental design principles—randomization, grouping, and stratification—demonstrated with the MovieLens 100K recommendation dataset

  • A practical, thorough guide to training XGBoost using Optuna for hyperparameter optimization: defining the study space, trials and experiments; balancing search dimensionality vs. cost; random warm‑up, pruning, early stopping, resumes, parallelism, and picking a model by query instead of vibes.

  • This post is used for validating if duplicate tags are removed, regardless of the string case

2024