[N] First OpenAI OA5 DoTA2 match begins livestreaming at The International (TI) tournament

gwern · 2018-08-23T00:53:20+00:00

And after a long tense game, Team Pain beats OA5!

One thing from the after-game discussion: the much-maligned '5 invincible couriers' has been reduced to 1 (vincible?) courier, as of Saturday 18 August. One HN comment says more heroes were available? I haven't seen anything on that yet.

More commentary from Cook: https://twitter.com/mtrc/status/1032413638311780352

courier reduction appears to have reduced aggression and OA5 went for Roshan (twice) this time, even sacrificing part of their base. That's a big change.
as I pointed out before, humans should be able to adapt in-game in away OA5 can't, giving it an advantage, with the historical example of the OA 1x1 agent getting crushed after practice; Brockman's Twitter implied today as much about previous matchups, and Cook says

A crucial thread throughout all of this is adaptation. Dendi seemed to have not seen the bot before, and was blown away by it. But what we heard last year, and this year too, is that teams and players who get many runs at this find ways to crack the AI open. #OAI

Apparently the way these matchups work is that OA5 plays the losers of each TI day? So presumably each match will get harder; losing the first one bodes very poorly for the upcoming ones tomorrow & Friday, since not only do the opponents get learn within-game and to watch the previous games to do some (ahem) off-policy learning, they also are better than the team before.
On the frequent accusation of cheating via not really having 200ms reactions:

The system has a 200ms reaction time cap, but it's important to realise that in that 200ms it reads the entire game state - things offscreen, things a human has to click to read. So human-comparable reaction time, but superhuman information processing #OAI
neither team drafted? huh? Isn't that much of the point? Sure, it ensures OA5 doesn't get a huge lead in the draft like it did in the Benchmark, but surely that's part of the game...

See also https://www.reddit.com/r/DotA2/comments/94vdpm/openai_hex_was_within_the_200ms_response_time/
more odd, erratic, clearly mistaken behavior by OA5 when it gets behind:

OpenAI beginning to do a few strange things as they come under pressure. Gyrocopter uses a disabling spell on a single tiny monster, Death Prophet (tall ghost lady) casts her most important spell without any enemies nearby. #OAI

43 minutes in. The humans have taken more objectives, and are ahead on gold. More importantly, the bots are doing a few weird things - using important spells for odd reasons. But honestly it's a nailbiter still. The bots are bad at the big decisions, but the small ones? Surgical.

We're also seeing a few things used incorrectly, including an item called a "Refresher Shard" which refreshes the cooldown on spells and items. This is an item rarely seen - it appears on the third death of Rosh. It's likely the bots don't have as much experience using it. #OAI
:)

Holy shit, a team of bots played a 45 minute game against a pro-level team on stage at The International.

Dota2 subreddit: https://www.reddit.com/r/DotA2/comments/99idug/the_international_8_openai/ (they're very upset about scheduling) HN: https://news.ycombinator.com/item?id=17823286 Currently brief comments at https://www.reddit.com/r/MachineLearning/comments/99ix2d/d_openai_five_loses_against_first_professional/

gwern · 2018-08-22T23:49:45+00:00

Previous: Benchmark competition: https://www.reddit.com/r/reinforcementlearning/comments/94uziv/openai_five_benchmark_crushes_audience_team/

mnbvcxzlkjhgfdssa · 2018-08-23T13:28:20+00:00

Where is this?

FatChocobo · 2018-08-23T00:56:27+00:00

I'd still like to know what the OpenAI Five team meant when they said the agent has a 200ms reaction time, since it was blaringly obvious that's not actually the case from that game.

reinforcementlearning

MODERATORS

Welcome to Reddit.

Want to add to the discussion?