Reinforcement Learning

Discussions center on the use, effectiveness, and application of reinforcement learning (RL) to the problem in the HN post, including comparisons to supervised learning, DeepMind approaches, and RL techniques like RLHF.

➡️ Stable 1.3x AI & Machine Learning
3,557
Comments
20
Years Active
5
Top Authors
#8121
Topic ID

Activity Over Time

2007
1
2008
1
2009
2
2010
4
2011
7
2012
5
2013
16
2014
22
2015
87
2016
239
2017
317
2018
379
2019
238
2020
243
2021
242
2022
243
2023
365
2024
346
2025
771
2026
29

Keywords

TensorFlow MEC LSTM AC A3C openai.com ReAgent AI TF OpenSpiel reinforcement rl learning agent deepmind reward rewards robot algorithm environment

Sample Comments

sharemywin Jun 24, 2016 View on HN

reinforcement learning goes along way too.

gwern May 25, 2017 View on HN

Advertising: if you're interested in RL, subscribe to https://www.reddit.com/r/reinforcementlearning/ !

mugivarra69 Aug 16, 2023 View on HN

seems like they gunning for deepmind rl based ideas.

b4je7d7wb Nov 15, 2022 View on HN

Maybe it's some reinforcement learning.

mkl95 May 2, 2023 View on HN

It looks like the wrong AI for the problem. RL should be more successful.

amelius Nov 3, 2022 View on HN

Has anyone attempted yet to run a reinforcement learning strategy on it?

6gvONxR4sf7o Oct 28, 2024 View on HN

Why use RL for this instead of plain old supervised learning?

romanows Apr 2, 2021 View on HN

They're clearly taking a reinforcement learning approach. Just a few thousand more returns and you'll be good to go!

throwanem Jun 4, 2024 View on HN

More likely reinforcement learning, I think.

adroniser Aug 10, 2024 View on HN

Isn't RL the algorithm we want basically?