Ant Colony Optimization with Policy Gradients and Replay

dc.contributor.authorJardee, William
dc.contributor.authorSheppard, John W.
dc.date.accessioned2025-12-01T19:18:50Z
dc.date.issued2025-07
dc.description.abstractAnt Colony Optimization (ACO) has served as a widely-utilized metaheuristic algorithm for decades for solving combinatorial optimization problems. Since its initial construction, ACO has seen a wide variety of modifications and connections to Reinforcement Learning (RL). Substantial parallels can be seen as early as 1995 with Ant-Q's relationship with Q-learning, through 2022 with ADACO's connection with Policy Gradient. In this work, we describe ACO, more specifically the Stochastic Gradient Descent ACO algorithm (ACOSGD), explicitly as an off-policy Policy Gradient (PG) method. We also incorporate experience replay into several ACO algorithm variants, including AS, MaxMin-ACO, ACOSGD, ADACO, and our two policy gradient-based versions: PGACO and PPOACO, drawing the connection to elitist ACO strategies. We show that our implementation of PG in ACO with experience replay and a baselined reward update strategy applied to eight TSP problems of varying sizes performs competitively with both fundamental ACO and SGD-based ACO versions. We also show that the replay buffer seems to unilaterally improve the performance of ACO algorithms through an ablation study.
dc.identifier.citationJardee, W., & Sheppard, J. (2025, July). Ant Colony Optimization with Policy Gradients and Replay. In Proceedings of the Genetic and Evolutionary Computation Conference (pp. 240-248).
dc.identifier.doi10.1145/3712256.3726452
dc.identifier.urihttps://scholarworks.montana.edu/handle/1/19560
dc.language.isoen_US
dc.publisherACM
dc.rightscc-by
dc.rights.urihttps://creativecommons.org/licenses/by/4.0
dc.subjectAnt Colony Optimization
dc.subjectAnt Algorithms
dc.subjectMetaheuristics
dc.subjectReinforcement Learning
dc.subjectReplay Buffer
dc.subjectPolicy Gradient
dc.titleAnt Colony Optimization with Policy Gradients and Replay
dc.typeArticle
mus.citation.extentfirstpage1
mus.citation.extentlastpage9
mus.citation.journaltitleProceedings of the Genetic and Evolutionary Computation Conference
mus.relation.collegeCollege of Engineering
mus.relation.departmentComputer Science
mus.relation.universityMontana State University - Bozeman

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ant-colony-opt-2025.pdf
Size:
925.46 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
825 B
Format:
Item-specific license agreed upon to submission
Description: