Using reinforcement learning and $4.80 of GPU time to find the best HN post from Hacker News on 2024-10-28 17:17 (#6RT0K) Comments