"A deep reinforcement learning framework for solving two-stage stochast" by Dogacan Yilmaz and Esra Büyüktahtakın
 

A deep reinforcement learning framework for solving two-stage stochastic programs

Document Type

Article

Publication Date

12-1-2024

Abstract

In this study, we present a deep reinforcement learning framework for solving scenario-based two-stage stochastic programming problems. Stochastic programs have numerous real-time applications, such as scheduling, disaster management, and route planning, yet they are computationally challenging to solve and require specially designed solution strategies such as hand-crafted heuristics. To the extent of our knowledge, this is the first study that decomposes two-stage stochastic programs with a multi-agent structure in a deep reinforcement learning algorithmic framework to solve them faster. Specifically, we propose a general two-stage deep reinforcement learning framework that can generate high-quality solutions within a fraction of a second, in which two different learning agents sequentially learn to solve each stage of the problem. The first-stage agent is trained with the feedback of the second-stage agent using a new policy gradient formulation since the decisions are interconnected through the stages. We demonstrate our framework through a general multi-dimensional stochastic knapsack problem. The results show that solution time can be reduced up to five orders of magnitude with sufficiently good optimality gaps of around 7%. Also, a decision-making agent can be trained with a few scenarios and can solve problems with many scenarios and achieve a significant reduction in solution times. Considering the vast state and action space of the problem of interest, the results show a promising direction for generating fast solutions for stochastic online optimization problems without expert knowledge.

Identifier

85160721399 (Scopus)

Publication Title

Optimization Letters

External Full Text Location

https://doi.org/10.1007/s11590-023-02009-5

e-ISSN

18624480

ISSN

18624472

First Page

1993

Last Page

2020

Issue

9

Volume

18

Grant

1554018

Fund Ref

Division of Mathematical Sciences

This document is currently not available here.

Plum Print visual indicator of research metrics
PlumX Metrics
  • Citations
    • Citation Indexes: 5
  • Usage
    • Abstract Views: 1
  • Captures
    • Readers: 17
  • Mentions
    • News Mentions: 1
see details

Share

COinS