According to arxiv.org, Meituan Group and Huazhong University of Science and Technology co-developed a real-time AI framework that reduced daily order cancellations on its meal delivery platform by over 25% — addressing a systemic bottleneck affecting 165,000 orders per day rejected by crowdsourced drivers.
The Scale of the Problem
These driver-rejected orders — termed “NA-canceled orders” — directly undermine customer retention and platform reputation. The research documents that such cancellations generate over 55% of all negative platform reviews (roughly 30,000 daily) and trigger billions of RMB annually in food waste compensation paid to restaurants. Unlike isolated service failures, this is a structural supply chain gap at the last-mile execution layer: insufficient real-time incentive alignment between platform, driver, and time-sensitive order fulfillment.
Why Traditional Incentives Fall Short
Meituan previously relied on fixed bonus rules — for example, 3 RMB after 10 minutes, 6 RMB after 20 minutes — which lack dynamic coordination across an order’s lifecycle. As the paper states, these static policies fail to optimize globally across stages (e.g., initial assignment, reassignment after timeout, escalation before cancellation), resulting in suboptimal budget use and persistent rejection spikes during peak volatility.
The Multi-Stage Bonus Allocation (MSBA) Framework
The team introduced a four-component system deployed at scale:
- Semi-black-box acceptance probability model: Estimates how bonus amounts affect driver acceptance likelihood, balancing interpretability and predictive accuracy using real-world behavioral data
- Lagrangian dual-based dynamic programming (LDDP) algorithm: Computes empirical Lagrangian multipliers offline for each stage using historical order-driver interaction data
- Online allocation algorithm: Makes real-time bonus decisions in milliseconds (O(1) complexity), leveraging precomputed multipliers
- Periodic control strategy: Adjusts allocations dynamically based on live order flow and remaining budget consumption
Validation and Real-World Impact
Offline experiments on Meituan’s historical dataset showed MSBA reduced canceled orders by ~25% versus single-stage allocation and ~50% versus unified bonus mechanisms. Crucially, online A/B tests on live production traffic confirmed sustained gains: over 25% reduction in cancellations and over 30% reduction in restaurant food waste compensation. The framework now processes millions of orders daily on Meituan’s platform.
This work exemplifies how granular, real-time incentive engineering — grounded in operations research and machine learning — directly strengthens supply chain resilience at the last-mile interface. For global supply chain professionals, it underscores that optimizing human-agent coordination in decentralized, time-critical networks requires not just visibility or automation, but adaptive, budget-aware decision logic embedded in production systems. Similar approaches are emerging elsewhere: Amazon’s Flex program uses dynamic surge pricing for delivery windows; Deliveroo’s 2021 pilot in London applied reinforcement learning to shift-based incentives; and Instacart’s 2023 earnings call noted improved shopper retention following rollout of tiered, context-aware bonuses. All reflect a broader industry pivot from static SLAs to responsive, data-driven labor economics within digital logistics ecosystems.
Source: arxiv.org
Compiled from international media by the SCI.AI editorial team.









