Test your inventory planning skills: cut holding costs, prevent shortages, and master real-world inventory planning.
A practical RL-style tuning notebook that wraps an organizer-like simulator to optimize ordering policies. It supports grid-tuned baseline, per‑Department Optuna tuning, and a feature‑driven policy search, each producing a ready‑to‑submit CSV in the platform’s index order.
This notebook is designed to turn the VN2 simulator into an optimization loop for actionable ordering policies.
What it does
Simulator wrapper: Weekly transition aligned with the organizer’s logic (no backorders, lead time 2, weekly review, holding on EndInv, shortage on missed sales).
Three submission paths:
Key assumptions
Inputs (Week 0)
How to run
1) Setup and simulator instantiation.2) Baseline stats/policy + grid evaluation; emit orders_round1_rl.csv.3) Optuna segment‑wise tuning; emit orders_round1_rl_segments.csv.4) Optuna feature policy search; emit orders_round1_rl_policy.csv.
Tuning tips
Repository
Keep up to date by participating in our global community of data scientists and AI enthusiasts. We discuss the latest developments in data science competitions, new techniques for solving complex challenges, AI and machine learning models, and much more!
Comments