This is the author s version of a work that was submitted/accepted for publication in the following source:

Similar documents
Evolutionary Game Design

Sudoku puzzles and how to solve them

Symmetrical Pentomino Pairs

ALGEBRA. sequence, term, nth term, consecutive, rule, relationship, generate, predict, continue increase, decrease finite, infinite

Sudoku Puzzles Generating: from Easy to Evil

Analysis of Micromouse Maze Solving Algorithms

Kenken For Teachers. Tom Davis June 27, Abstract

The resulting tile cannot merge with another tile again in the same move. When a 2048 tile is created, the player wins.

Determining Degree Of Difficulty In Rogo, A TSP-based Paper Puzzle

A Review of Sudoku Solving using Patterns

MAKING MATH MORE FUN BRINGS YOU FUN MATH GAME PRINTABLES FOR HOME OR SCHOOL

Genetic Algorithms and Sudoku

Gamesman: A Graphical Game Analysis System

Three daily lessons. Year 5

Monte-Carlo Methods. Timo Nolle

Chapter 25: Exchange in Insurance Markets

A Knowledge-based Approach of Connect-Four

Battleship. Big bands

Making Decisions in Chess

Using Emergent Behavior to Improve AI in Video Games

Mathematical Induction

Colored Hats and Logic Puzzles

How To Play Go Lesson 1: Introduction To Go

Ready, Set, Go! Math Games for Serious Minds

Sue Fine Linn Maskell

Web Data Extraction: 1 o Semestre 2007/2008

Introduction Solvability Rules Computer Solution Implementation. Connect Four. March 9, Connect Four

Hooray for the Hundreds Chart!!

Note: To increase your bet by another amount, select another chip from the bottom right of the game panel.

Reinforcement Learning and the Reward Engineering Principle

The Circumference Function

Generalized Widening

JOURNEY THROUGH CHESS

Session 6 Number Theory

LAMC Beginners Circle: Parity of a Permutation Problems from Handout by Oleg Gleizer Solutions by James Newton

Pedagogical Possibilities for the 2048 Puzzle Game

Offline 1-Minesweeper is NP-complete

Math Quizzes Winter 2009

Minimax Strategies. Minimax Strategies. Zero Sum Games. Why Zero Sum Games? An Example. An Example

Name Partners Date. Energy Diagrams I

Teachers should read through the following activity ideas and make their own risk assessment for them before proceeding with them in the classroom.

CALCULATIONS & STATISTICS

Video Game Description Language (VGDL) and the Challenge of Creating Agents for General Video Game Playing (GVGP)

A NOTE ON OFF-DIAGONAL SMALL ON-LINE RAMSEY NUMBERS FOR PATHS

Lesson Plan Vats Grade 8 Write Algebraic Expressions

Math Games For Skills and Concepts

Evolving Combat Robots

Sowing Games JEFF ERICKSON

E XPLORING QUADRILATERALS

Keywords-Chess gameregistration, Profile management, Rational rose, Activities management.

Freehand Sketching. Sections

A Determination of g, the Acceleration Due to Gravity, from Newton's Laws of Motion

Lab 11. Simulations. The Concept

SMART NOTEBOOK 10. Instructional Technology Enhancing ACHievement

Point and Interval Estimates

Inv 1 5. Draw 2 different shapes, each with an area of 15 square units and perimeter of 16 units.

The Open University s repository of research publications and other research outputs

The Principles of Training

Fighting an Almost Perfect Crime

OPM3. Project Management Institute. OPM3 in Action: Pinellas County IT Turns Around Performance and Customer Confidence

Lessons Learned using CBR for Customer Support

TEACHER S GUIDE TO RUSH HOUR

The Theory of Tetris

Decision Making under Uncertainty

Contents. Sample worksheet from

Interactive Web-based Teaching for Computing in an Engineering Degree*

Fuzzy Cognitive Map for Software Testing Using Artificial Intelligence Techniques

VARIANCE REDUCTION TECHNIQUES FOR IMPLICIT MONTE CARLO SIMULATIONS

Escher Style Tessellations

Adversarial Hierarchical-Task Network Planning for Complex Real-Time Games

Rules for TAK Created December 30, 2014 Update Sept 9, 2015

2048 is NP-Complete. Ahmed Abdelkader Aditya Acharya Philip Dasler. Theorem GAME is NP-Complete.

Roulette Wheel Selection Game Player

The art of linking the elements.

Minesweeper as a Constraint Satisfaction Problem

This puzzle is based on the following anecdote concerning a Hungarian sociologist and his observations of circles of friends among children.

ALGEBRA. Find the nth term, justifying its form by referring to the context in which it was generated

All sessions were observed at each of the Academy Venues

Holland s GA Schema Theorem

Ian Stewart on Minesweeper

Instructions. Tic, Tac, Toe Take turns placing a piece down. The first player to get 3 in a row (up, down or diagonally) wins!

FHI TECHNICAL REPORT. Racing to the Precipice: a Model of Artificial Intelligence Development. Stuart Armstrong Nick Bostrom Carl Shulman

BPM: Chess vs. Checkers

Magic Word. Possible Answers: LOOSER WINNER LOTTOS TICKET. What is the magic word?

ACL Soccer 4 v 4 Small Sided Games (SSG s)

There are a number of superb online resources as well that provide excellent blackjack information as well. We recommend the following web sites:

Using Microsoft Word. Working With Objects

A Sarsa based Autonomous Stock Trading Agent

ExmoR A Testing Tool for Control Algorithms on Mobile Robots

SIMS 255 Foundations of Software Design. Complexity and NP-completeness

Degeneracy in Linear Programming

8. KNOWLEDGE BASED SYSTEMS IN MANUFACTURING SIMULATION

Software review The role of social networks in marketing

On the Laziness of Monte-Carlo Game Tree Search in Non-tight Situations

SuperSpeed Math. Addition, Subtraction, Multiplication, Division And the Gnarlies!

Enumerating possible Sudoku grids

How to Design Your Perfect Kitchen in Just 15 Minutes Using Microsoft Word

4 Year Primary Degree QTS Student Survey Summer 2007

Escalator to Pips Pipping Your Way to The Top

A Client-Server Interactive Tool for Integrated Artificial Intelligence Curriculum

Transcription:

This is the author s version of a work that was submitted/accepted for publication in the following source: Browne, Cameron B. (2014) What can game AI teach us? ICGA Journal, 37(3), pp. 171-175. This file was downloaded from: http://eprints.qut.edu.au/85004/ c Copyright 2014 International Computer Games Association Notice: Changes introduced as a result of publishing processes such as copy-editing and formatting may not be reflected in this document. For a definitive version of this work, please refer to the published source:

1 WHAT CAN GAME AI TEACH US? Cameron Browne 1 QUT, Brisbane, Australia ABSTRACT Artificial intelligence (AI) applications typically involve encoding expert knowledge in machine form to find optimal solutions for a given problem. However, this paper deals with the opposite process of extracting new and human-comprehensible insights from emergent AI behaviour. Some examples of useful game-related insights drawn from observing AI players in action are presented. 1. INTRODUCTION Artificial intelligence (AI) and machine learning approaches have proved invaluable for the computer-based analysis and playing of games. AI approaches typically encode expert knowledge about a given domain, so that machines can exploit this knowledge to find optimal solutions for the problem at hand. However, this paper deals with the opposite process: What can game AI teach us that we do not already know? This question harks back to Turing s landmark paper Computing Machinery and Intelligence (Turing, 1950). In response to the objection that machines can never really do anything new that they have not been programmed to do, Turing points out the many times that machines have surprised him with solutions to problems beyond his understanding of them, which he argues is a form of creative mental act. The question of what AI can teach us becomes important if we wish AI to progress beyond mimicking our own understanding of the world, and produce new insights into problems that might be generalised to other contexts. This paper presents four recent cases of such creative surprises from my own work in AI game analysis. Each case describes unexpected or emergent AI behaviour yielding novel (to me) insight into a recently invented game. Each case involves a different aspect of game analysis, but together they point to a trend of AI players yielding new and useful knowledge in game-related domains. 2. DESIGN RULES The combinatorial board game Ndengrod was invented by the computer program LUDI in 2007, through the evolution of rule sets and their evaluation through AI self-play (Browne, 2011). 2 Ndengrod is played on a hexagonal grid, on which players take turns placing a piece of their colour and aim to make 5-in-a-row, subject to Go-like surround capture; following each move, connected groups of enemy pieces with no freedom (i.e. adjacent empty cells) are captured and removed. Now, the board game Go requires a special rule called the ko rule, which states that the player cannot make a move which would repeat the board state of the previous turn, in order to avoid infinite cycles of play. For example, Figure 1 shows a board position (left) in which White captures a black stone by removing its last freedom with move a (middle). Black cannot then immediately recapture the white piece by playing again at the point marked, as that would return the board state to that of the previous turn and constitute a cycle. Black must play 1 email:c.browne@qut.edu.au 2 Incidentally, such evolutionary approaches were also first proposed in Turing s landmark 1950 paper.

2 ICGA Journal June 2014 elsewhere before making this move on a future turn, if so desired. The ko rule is typically needed to avoid cycles in any game involving Go-like surround capture on the square grid. a Figure 1: The ko rule: immediate recapture at the point marked is forbidden. However, Ndengrod excels as a game even though it evolved without a ko rule. AI analysis through self-play revealed that games were balanced, ended in good time, and followed trends in play consistent with other games deemed to be interesting by human players. In fact, Ndengrod was ranked the #1 game of all the evolved rule sets by both LUDI and human play testers. So why does it succeed without ko where similar games do not? The answer lies in the geometry of the hexagonal grid, which simply negates the need for a ko rule. Consider the situation shown in Figure 2 (left), which is a hexagonal analogy of the position shown in Figure 1. White can capture the black piece by removing its last freedom with move b (right), but Black is now unable to immediately recapture the white piece just played; local move cycles cannot occur and the ko rule is not needed in this case. The reasons for this are explained in (Browne, 2012), but essentially boil down to the fact that cells on the hexagonal grid do not have diagonal neighbours. b Figure 2: No immediate recapture on the hexagonal grid. The fact that a ko rule is not needed on hexagonal grids is known among Go players who have experimented with hexagonal variants, and is somewhat obvious in hindsight. But this fundamental design principle was unknown to me, and would probably have remained so except for the AI s recognition of this rule set. 3. SOLVABILITY BoxOff is a solitaire puzzle game invented by American game designer Steven Meyers (2013). The standard game is played with 16 pieces in each of three colours, randomly placed on a 6 8 square grid. The aim is to clear the board by successively removing pairs of same-coloured pieces that form opposite corners of rectangles that do not contain any other pieces. Figure 3 (left) shows a typical BoxOff challenge ready to play. BoxOff is elegant and interesting to play, but there remained a concern about the solvability of randomly generated challenges: just how likely are such challenges to actually have solutions? This question prompted a recent study of the game based on random sampling of the challenge space (Browne and Maire, 2014) that produced a surprising result, summarised in the graph shown in Figure 3 (right). This graph shows the observed probabilities with which two given AI players are able to solve randomly sampled challenges over a range of board sizes. The first AI player, P r (dotted line), plays a random game by selecting a legal move each turn uniformly at random from those available. The second AI player, P s (solid line), is a perfect solver that finds the first available solution (if any) through depth-first backtracking of legal moves.

3 1.5 0 0 16 32 48 64 Figure 3: A 6 8 BoxOff challenge (left) and P r (dotted) and P s (solid) solvability rates per board size (right). It can be seen that P r solvability (dotted line) starts high for small boards then drops quickly to around 10% for medium and large board sizes. P s solvability, on the other hand, initially follows a similar trend but then unexpectedly inflects to increase again for larger board sizes. The arrows in Figure 3 (right) indicate the discrepancy between P r and P s for the standard 6 8 = 48 cell board. Surprisingly, it turns out that over 99.9% of randomly generated challenges on the standard board will be solvable, but that only around 10% will be solvable through random play. This means that players can be reassured that any challenge they generate randomly will almost certainly have a solution, but that those challenges will not likely be trivial to solve, validating the designer s choice of board size and colour count. A similar analysis of the 1D case (i.e. 1 N board sizes) reveals the 1D game to be trivial. That is, for any given 1D BoxOff challenge, either every move will be a winning move (if a solution exists) or every move will be a losing move (if no solution exists). This property is proven in Browne and Maire (2014), but was not even suspected until AI analysis revealed identical solvability profiles for both P r and P s for the 1D case. 4. EMBEDDED STRATEGIES 2048 is a digital solitaire puzzle game recently designed by Gabriele Cirulli (2014). The game is played on a 4 4 square grid, in which the player can push numbered tiles in each of the four cardinal directions, causing tiles to move as far as possible in each direction thus pushed and merge with same-numbered tiles they collide with, with the aim being to form a tile at least 2048 in value. Despite being an update of similar previous games, 2048 became an instant hit due to its elegant design, ease of use and addictive game play. 1024 256 128 16 1024 256 128 16 128 64 16 4 128 64 16 4 16 4 2 4 2 16 8 2 2 2 Figure 4: A 2048 position before (left) and after (right) an upward move. Figure 4 shows a typical 2048 board position (left) and the result of an upward move (right). Note that the two 4 tiles merge to form an 8 tile, the 2 tile moves up as far as it can go, and a new 2 tile is randomly placed at an empty cell (bottom right) as part of the move; 2048 is a non-deterministic game.

4 ICGA Journal June 2014 As a player, I initially found 2048 quite interesting, but soon tired of pushing numbers around without purpose and gave up on it. It was not until seeing an AI solver by Matt Overlan (2014) in action that the game clicked into place. From observing a single play through, it became obvious that an effective strategy is to adopt a circular movement style that pushes higher valued tiles up and towards a top corner. The basic strategy is as follows: move left if possible, otherwise move up if possible, otherwise move right if possible. This strategy pushes tiles of increasing value up and towards the top left corner, as shown in Figure 4, using the lower rows for rough working. Downward moves should generally be avoided, otherwise the beneficial ordering is likely to be disrupted and require some work to repair. This strategy must be overridden from time to time as the situation dictates in order to actually reach a solution, which keeps the game interesting, but it provides an effective basis for general play that makes most challenges solvable. This learnt strategy is a case of knowledge transfer from the programmer to the player, much like an inadvertent tutorial, as the strategy is scripted into the AI solver rather than an emergent behaviour. However, it is still another case of new insights learnt from observing an AI player, which has rekindled my enjoyment of this game and raised my playing strength to another level. 5. EMERGENT STRATEGIES Quantum Leap is a new combinatorial board game designed by Néstor Romeral Andrés (2014). The game is played on a hexagonal board (5 cells per side), and starts with the board randomly filled with 30 white and 31 black stones. Starting with White, players take turns moving a piece of their colour in a straight line exactly N cells (where N is that piece s number of friendly adjacent neighbours) to capture an enemy piece by replacement. A player loses if they have no legal moves on their turn. Figure 5 shows a typical board position with Black to play (left), and the result of a distance-2 capture by a black piece with two friendly neighbours (right). Figure 5: A Quantum Leap position before (left) and after (right) the indicated move. An AI player was developed to explore the game, 3 using a basic implementation of the Upper Confidence Bounds for Trees (UCT) algorithm. This app plays at a reasonable level even though the UCT player uses strictly random playouts, without any of the standard enhancements or domain knowledge exploitation (Browne et al., 2012). It was therefore satisfying to see the AI player discover an emergent strategy outlined by the game s designer in his description of the rules, namely that players should strive to form their pieces into connected groups where possible. This can be seen in Figure 5 (right), where the AI player (Black) has formed two dominant groups which can be easily joined, while the human player (White) has many small isolated groups spread across the 3 Available at: http://www.cameronius.com/research/quantum Leap 1.0.jar

5 board to give a much weaker position. The advantage of forming groups is that they allow greater freedom of movement, as their component pieces mutually adjoin each other. Note that this strategy is not implicit in the rules, but emerged naturally during AI search simply by following statistically superior lines of play. Further, the game s designer noticed an additional nicety when playing against the AI; not only did the AI tend to form its pieces into groups and split enemy groups (as per the move shown in Figure 5), but the groups thus formed tended to be long strings of pieces rather than solid clumps. 4 Most AI pieces therefore had potential moves of distance 1, 2 or 3, which turn out to be the optimal movement distances for this game. 5 An AI based on strictly random simulations, with no tactical or strategic knowledge about the game, inherently found not only a known strategy but also a refinement of this strategy known only to the game s designer. 6. CONCLUSION The examples outlined above demonstrate cases of AIs teaching something new (to me) about various games. While each case concerns a different aspect of game analysis, together they constitute a pattern of AIs yielding new insights that have enhanced my understanding and enjoyment of these games, which has the potential for true machine intelligence in a sense that Turing might have appreciated. The generalisation of such insights to new contexts could also hold promise for progress towards artificial general intelligence (AGI). However, it is important to note that these insights were not recognised as such by the AIs in question and presented to the user in human-comprehensible form, but were observed subjectively as a side-effect of AI play and could easily have been overlooked. Such insights possibly occur all the time, but go unnoticed in the absence of an experienced observer to appreciate their value. The obvious next step towards true machine intelligence is to not only produce such insights, but also to automatically recognise, exploit and report them to the user in comprehensible form, e.g. as game design guidelines or strategic advice. ACKNOWLEDGEMENTS This work was funded by a Vice-Chancellor s Research Fellowship from the Queensland University of Technology (QUT), Brisbane, Australia, as part of the project Games Without Frontiers. 7. REFERENCES Browne, C. (2011). Evolutionary Game Design. Springer, Berlin. Browne, C. (2012). Go Without Ko on Hexagonal Grids. International Computer Games Association (ICGA) Journal, Vol. 35, No. 1, pp. 37 40. Browne, C. and Maire, F. (2014). Monte Carlo Analysis of a Puzzle Game. IEEE Transactions on Computational Intelligence and AI in Games, Vol. 6 (submitted). Browne, C., Powley, E., Whitehouse, D., Lucas, S., Cowling, P. I., Rohlfshagen, P., Tavener, S., Perez, D., Samothrakis, S., and Colton, S. (2012). A Survey of Monte Carlo Tree Search Methods. IEEE Transactions on Computational Intelligence and AI in Games, Vol. 4:1, pp. 1 43. Cirulli, G. (2014). 2048. http://gabrielecirulli.github.io/2048/. Meyers, S. (2013). BoxOff: A New Solitaire Board Game. GAMES, Vol. 37, No. 6, pp. 12 13. Overlan, M. (2014). 2048 AI Solver. http://ov3y.github.io/2048-ai/. Romeral Andrés, N. (2014). Yavalath & Co. Lulu, Raleigh. Turing, A. (1950). Computing Machinery and Intelligence. Mind, Vol. LIX, pp. 433 460. 4 Personal correspondence. 5 Moves of distance 4 or more (i.e. more than 3 neighbours) become difficult to play within the board area and limit the player s options.