Loading [MathJax]/extensions/MathMenu.js
RL-CSL: A Combinatorial Optimization Method Using Reinforcement Learning and Contrastive Self-Supervised Learning | IEEE Journals & Magazine | IEEE Xplore

RL-CSL: A Combinatorial Optimization Method Using Reinforcement Learning and Contrastive Self-Supervised Learning


Abstract:

Reinforcement learning-based methods have shown great potential in solving combinatorial optimization problems. However, the related research has not been mature in terms...Show More

Abstract:

Reinforcement learning-based methods have shown great potential in solving combinatorial optimization problems. However, the related research has not been mature in terms of both models and training methods. This paper proposes a method based on reinforcement learning and contrastive self-supervised learning. To be specific, the proposed method uses an attention model to learn a policy for generating solutions and combines a contrastive self-supervised learning model to learn the attention encoder in the way of node-by-node. Correspondingly, a two-phase learning method, including node-wise learning and solution-wise learning, is adopted to train the attention model and the contrastive self-supervised model jointly and collaboratively. The performance of the proposed method has been verified by numerical experiments on various combinatorial optimization problems.
Page(s): 1010 - 1024
Date of Publication: 25 January 2022
Electronic ISSN: 2471-285X

Funding Agency:


I. Introduction

The combinatorial optimization problems [1], [2], such as travel salesman problem (TSP) and vehicle routing problem (VRP), are frequently raised in various real-world applications fields and research fields [3]. Developing high-efficient methods to find optimal (or near-optimal) solutions from a finite set of discrete solutions for the combinatorial optimization problems has always been a hot research topic [4]. During the past decades, the methods developed for solving combinatorial optimization problems can be roughly grouped into three categories, i.e., exact methods [5], heuristic methods [6], [7], and learning-based methods [8], [9].

Contact IEEE to Subscribe

References

References is not available for this document.