Reinforcement Learning

Discussions center on the use, effectiveness, and application of reinforcement learning (RL) to the problem in the HN post, including comparisons to supervised learning, DeepMind approaches, and RL techniques like RLHF.

➡️ Stable 1.3x AI & Machine Learning

3,557

Comments

Years Active

Top Authors

#8121

Topic ID

Activity Over Time

2007

2008

2009

2010

2011

2012

2013

2014

2015

2016

239

2017

317

2018

379

2019

238

2020

243

2021

242

2022

243

2023

365

2024

346

2025

771

2026

Top Contributors

visarga (68) gwern (58) YeGoblynQueenne (30) Buttons840 (23) currymj (17)

Keywords

TensorFlow MEC LSTM AC A3C openai.com ReAgent AI TF OpenSpiel reinforcement rl learning agent deepmind reward rewards robot algorithm environment

Sample Comments

sharemywin • Jun 24, 2016 • View on HN

reinforcement learning goes along way too.

gwern • May 25, 2017 • View on HN

Advertising: if you're interested in RL, subscribe to https://www.reddit.com/r/reinforcementlearning/ !

mugivarra69 • Aug 16, 2023 • View on HN

seems like they gunning for deepmind rl based ideas.

b4je7d7wb • Nov 15, 2022 • View on HN

Maybe it's some reinforcement learning.

mkl95 • May 2, 2023 • View on HN

It looks like the wrong AI for the problem. RL should be more successful.

amelius • Nov 3, 2022 • View on HN

Has anyone attempted yet to run a reinforcement learning strategy on it?

6gvONxR4sf7o • Oct 28, 2024 • View on HN

Why use RL for this instead of plain old supervised learning?

romanows • Apr 2, 2021 • View on HN

They're clearly taking a reinforcement learning approach. Just a few thousand more returns and you'll be good to go!

throwanem • Jun 4, 2024 • View on HN

More likely reinforcement learning, I think.

adroniser • Aug 10, 2024 • View on HN

Isn't RL the algorithm we want basically?