Google's AlphaStar AI Is Already Defeating Top Starcraft Pros

StarCraft II's New Deadly Tactic: Rushing With Pylons

In our battle of man vs machine, the future looks very promising for our AI overlords. This morning Google's DeepMind team showcased their AlphaStar AI agent against two Starcraft 2 pros.

While the AI dominated there was still a glimmer of hope for mankind.

Starcraft 2 is currently considered the grand challenge of AI development as the competitive game due to the game's complexity, the imperfect information given to players and mix of both long-term planning and real time actions.

So, of course, the DeepMind team had to conquer it.

In a livestream earlier this morning, replays from a series of games played by Starcraft 2 pros Grzegorz "Mana" Komincz and Dario "TLO" Wünsch against AlphaStar had the AI decisively routing the human opponents.

All matches were Protoss vs Protoss on the Catalyst map under standard competitive conditions. This was to reduce training time for AlphaStar by allowing it to focus solely on one match-up.

TLO was the first pro to fall 5-0 to AlphaStar although he was at a distinct disadvantage as he normally plays Zerg instead of Protoss. Mana - who usually plays Protoss and was thus more comfortable in the match-up - fared a little better but also lost 5-0.

In the games that I was able to watch, AlphaStar made great use of micromanagement to get full advantage of the Stalker units while splitting the human player's focus across the map. One of the biggest challenges of playing Starcraft 2 comes from being able to divide your focus between multiple specific tasks. AlphaStar did not suffer from that problem.

Those matches took place in December and AlphaStar has had time to learn more since then so there was one more match played between Mana and AlphaStar at the end of the stream. The funny thing is that humans are also capable of learning.

Mana managed to squeeze out a victory against AlphaStar by building an overwhelming force after surviving through an initial wave of economy disruption, using lessons learned from his previous defeats. Humanity is on the scoreboard!

"AlphaStar takes well-known strategies and turns them on their head. The agent demonstrated strategies I hadn’t thought of before, which means there may still be new ways of playing the game that we haven’t fully explored yet," said TLO

You can download and watch the full replays here.

You may think that because AlphaStar is a machine, it would have a distinct mechanical advantage over human players but the team at DeepMind made the games as even a battlefield as they could.

Actions per minute (APM) are given a disproportionate focus when people talk about Starcraft 2 and you would think that an AI agent would be able to easily crush human opponents by simply doing more actions than them. That was not the case with AlphaStar, which had the comparatively low average APM of around 280 in the matches against TLO and Mana.

Often AlphaStar's APM was below the pro's, even during intense engagements.

The agent was also restricted to perceive and act on only on-screen information, just like a human player would.

An in-depth account of how the AlphaStar agent was trained can be found on the DeepMind blog.

It started by being fed anonymised replays of human games and was quickly able to defeat Blizzard's 'Elite' AI in game with 95% success rate. From there the agent was split into multiple copies with their own goals that competed against each other in an ever-growing league to build up knowledge of Starcraft 2's expansive strategic space through population-based reinforcement learning.

Over the course of the league, the agent developed new strategies. First there were the cheesey cannon rushes and Dark Templar builds that all new Starcraft 2 players quickly learned (and learned to loathe). Soon those strategies were supplanted by more economic focused game plans that built a strong economy while disrupting the opponent's economy as much as possible.

Some of these tactics, including a deliberate sacrifice of two Oracles to destroy enemy workers, were used in the matches against the pro players.

[DeepMind]


Comments

    From there the agent was split into multiple copies with their own goals that competed against each other in an ever-growing league to build up knowledge of Starcraft 2's expansive strategic space through population-based reinforcement learning.

    This is truly scary stuff. If AI somehow plots a takeover of the world, we will never see it coming and we will have no chance of resisting.

      Even without an Ai uprising this sort of tech applied to autonomous robots on the battlefield is a scary thought

    Maybe they should try it against some Korean pros.

      i know he is not korean, but apprently Serrral will be facing off against Alphastar later this year and he is the first non korean to win blizzcon and GSL vs World and won 4 out of 5 IEMs last year

        Yeah Serral's pretty crazy and comfortably the best non-Korean currently. He's arguably better than many Koreans too. Once again though he's a Zerg main so I hope he's not forced to play Protoss.

    Actions per minute (APM) are given a disproportionate focus when people talk about Starcraft 2 and you would think that an AI agent would be able to easily crush human opponents by simply doing more actions than them. That was not the case with AlphaStar, which had the comparatively low average APM of around 280 in the matches against TLO and Mana.

    We all know that high APM != doing more action. It is more like high APM == doing more useless action. The AI will use the least amount of action possible to get the maximum effect.

    I remember reading somewhere else that people were raising their APM by doing useless actions like constantly selecting and deselecting a group of unit without doing anything just to raise APM. Kinda silly.

      they do that in the begining to keep mentally active so that when they enter a battle they can still macro while doing insane levels of mirco

        I wish that is the case but I can tell you if you watch the steam you will notice a lot of extra useless actions. That is simply the reality of human vs AI.

          I watched an interview with... I want to say it was Boxer, back when Brood War was still a thing. He said he does that random clicking stuff at the start to keep his hands warmed up and mind sharp, so that when he needs the high APM his body is ready.

    Do you want Skynet? Because this is how you get Skynet

    Would be interesting to see exactly how this 'AI' interfaces with the game. I would consider reading memory for data about it's opponents as cheating.

    That being said I haven't read up on it, so who knows what magical things lie underneath its hood.

    Last edited 25/01/19 2:01 pm

      "The agent was also restricted to perceive and act on only on-screen information, just like a human player would."

Join the discussion!

Trending Stories Right Now