Reinforcement Learning for Combinatorial Optimization

1. Encoder의 변화에 따라

Pointer Network