JOURNAL ARTICLE

Optimization of 2D Irregular Packing: Deep Reinforcement Learning with Dense Reward.

  • Published In: International Journal of Semantic Computing, 2024, v. 18, n. 3. P. 405 1 of 3

  • Database: Applied Science & Technology Source Ultimate 2 of 3

  • Authored By: Crescitelli, Viviana; Oshima, Takashi 3 of 3

Abstract

This paper introduces a method to solve the 2D irregular packing problem using Deep Reinforcement Learning (Deep RL) for logistics. Our method employs a Q agent trained to predict the best placement within a container, maximizing available space. Unlike previous Deep RL algorithms, our method introduces a dense reward function at each packing step, providing immediate feedback and accelerating learning. To our knowledge, this is the first approach to use a dense reward to address the 2D irregular packing problem. Building on our earlier work, we improve the deep neural network by incorporating the Double Deep Q-Network (DDQN) framework to enhance our deep Q-learning approach, reducing overestimation biases and improving decision-making reliability. Simulation results show the method's effectiveness in completing the online 2D irregular packing tasks, achieving promising volume efficiency and packed piece metrics. This research extends our initial findings, highlighting the practical importance of DDQN and dense reward in advancing 2D irregular packing problem-solving. These advancements not only broaden the applications of deep learning but also hold practical importance for real-world logistics challenges. [ABSTRACT FROM AUTHOR]

Additional Information

  • Source:International Journal of Semantic Computing. 2024/09, Vol. 18, Issue 3, p405
  • Document Type:Article
  • Subject Area:Computer Science
  • Publication Date:2024
  • ISSN:1793351X
  • DOI:10.1142/S1793351X24430025
  • Accession Number:180169225
  • Copyright Statement:Copyright of International Journal of Semantic Computing is the property of World Scientific Publishing Company and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Looking to go deeper into this topic? Look for more articles on EBSCOhost.