OA5 tag

Gwern Branwen

See Also
Links
Miscellaneous
Link Bibliography

[Warning: JavaScript Disabled!]

[For support of key website features (link annotation popups/popovers & transclusions, collapsible sections, backlinks, tablesorting, image zooming, sidenotes etc), you must enable JavaScript.]

Links

“Towards Playing Full MOBA Games With Deep Reinforcement Learning”, Ye et al 2020

Towards Playing Full MOBA Games with Deep Reinforcement Learning

“Mastering Complex Control in MOBA Games With Deep Reinforcement Learning”, Ye et al 2019

Mastering Complex Control in MOBA Games with Deep Reinforcement Learning

“Dota 2 With Large Scale Deep Reinforcement Learning”, Berner et al 2019

Dota 2 with Large Scale Deep Reinforcement Learning

“OpenAI Five: 2016–2019”, OpenAI 2019

OpenAI Five: 2016–2019

“Solving Rubik’s Cube With a Robot Hand”, OpenAI et al 2019

Solving Rubik’s Cube with a Robot Hand

“Solving Rubik’s Cube With a Robot Hand [blog]”, OpenAI 2019

Solving Rubik’s Cube with a Robot Hand [blog]

“An Empirical Model of Large-Batch Training”, McCandlish et al 2018

An Empirical Model of Large-Batch Training

“How AI Training Scales”, McCandlish et al 2018

How AI Training Scales

“Emergent Complexity via Multi-Agent Competition”, Bansal et al 2017

Emergent Complexity via Multi-Agent Competition

“Proximal Policy Optimization Algorithms”, Schulman et al 2017

Proximal Policy Optimization Algorithms

“Net2Net: Accelerating Learning via Knowledge Transfer”, Chen et al 2015

Net2Net: Accelerating Learning via Knowledge Transfer

“Dota 2 With Large Scale Deep Reinforcement Learning § Pg11”, Rerun 2024 (page 11 org openai)

Dota 2 with Large Scale Deep Reinforcement Learning § pg11

“If You Want to Solve a Hard Problem in Reinforcement Learning, You Just Scale. It's Just Gonna Work Just like Supervised Learning. It's the Same, the Same Story Exactly. It Was Kind of Hard to Believe That Supervised Learning Can Do All Those Things, but It's Not Just Vision, It's Everything and the Same Thing Seems to Hold for Reinforcement Learning Provided You Have a Lot of Experience.”

If you want to solve a hard problem in reinforcement learning, you just scale. It's just gonna work just like supervised learning. it's the same, the same story exactly. It was kind of hard to believe that supervised learning can do all those things, but it's not just vision, it's everything and the same thing seems to hold for reinforcement learning provided you have a lot of experience.

Wikipedia

Dota 2
OpenAI Five

Miscellaneous

Link Bibliography

https://arxiv.org/abs/2011.12692#tencent: “Towards Playing Full MOBA Games With Deep Reinforcement Learning”, Deheng Ye, Guibin Chen, Wen Zhang, Sheng Chen, Bo Yuan, Bo Liu, Jia Chen, Zhao Liu, Fuhao Qiu, Hongsheng Yu

link-bibliography
https://openai.com/research/how-ai-training-scales: “How AI Training Scales”, Sam McCandlish, Jared Kaplan, Dario Amodei

link-bibliography

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]