Posted on Jan 23, 2021 • Originally published at rakeroutes.com on Feb 28, 2012

Fun with Rock, Paper, Scissors

James Edward Gray II’s Ruby Quiz #16 was to implement Rock, Paper, Scissors playing classes to compete on a playing field managed by a given Game class. Today we revisit this quiz for a bit of coding fun. We’ll implement some simple players and move on to some basic metaprogramming techniques and write players that manipulate each other or the game itself.

If you want to follow along:

$ git clone git://github.com/sdball/ruby_quiz.git $ cd ruby_quiz/16 # play the "always rock" player against the "always scissors" player $ ruby -I . rock_paper_scissors.rb players/always_rock.rb players/always_scissors.rb AlwaysRock vs. AlwaysScissors AlwaysRock: 1000 AlwaysScissors: 0 AlwaysRock Wins

Simple Player

Right. So here’s what a very simple player class looks like.

# Michael Bluth class AlwaysRock < Player def choose :rock end end

The rock_paper_scissors.rb file defines a Player class, a Game class, and provides a script to run the contest. Our own custom player classes have to inherit from the given Player class and implement a choose method that returns :rock, :paper, or :scissors. Our players can optionally implement a result method that is used as a callback from the game to allow our players to see the result of their play. If our players have an initialize method it is called with the classname of the opponent.

So, back to Michael Bluth (or Poor Predictable Bart) who always chooses rock. His choose method returns the symbol :rock. Easy. Let’s write a player to beat him. Also easy.

# GOB class AlwaysPaper < Player def choose :paper end end

GOB will beat Michael Bluth 100% of the time.

$ ruby -I . rock_paper_scissors.rb players/always_rock.rb players/always_paper.rb AlwaysRock vs. AlwaysPaper AlwaysRock: 0 AlwaysPaper: 1000 AlwaysPaper Wins

Ok, now let’s write a player to beat them both. Consistently.

Reactive Player

class Reactive < Player MOVE_THAT_BEATS = { rock: :paper, paper: :scissors, scissors: :rock } def choose @my_move || :paper end def result(my_move, opponent_move, outcome) @my_move = MOVE_THAT_BEATS[opponent_move] end end

This player is kind of interesting, but still straightforward. Reactive has a constant MOVE_THAT_BEATS that maps moves to the move that would beat them. Reactive then uses that knowledge to play the move that beats the last move played by its opponent (or :paper initially). This strategy should prove extremely effective against a player who always makes the same move.

$ ruby -I . rock_paper_scissors.rb players/always_rock.rb players/always_paper.rb players/reactive.rb AlwaysRock vs. Reactive AlwaysRock: 0 Reactive: 1000 Reactive Wins AlwaysPaper vs. Reactive AlwaysPaper: 0.5 Reactive: 999.5 Reactive Wins

Yep, Always Rock loses every game because Reactive plays paper from the start. Paper gets in one draw, then loses every game afterwards.

Now, let’s write a player that will beat all of the players thus far.

Random Player

Theoretically, the best strategy in Rock, Paper, Scissors is to be completely random. That’s a pretty easy strategy for a computer to play (although surprisingly difficult for a human) so let’s code that up next.

class RandomPlayer < Player def choose [:rock, :paper, :scissors].shuffle.first end end

Well that was easy. Let’s pit Random against some of the players so far.

$ ruby -I . rock_paper_scissors.rb players/always_rock.rb players/random.rb # out of 10 runs, Random won 8 games $ ruby -I . rock_paper_scissors.rb players/reactive.rb players/random.rb # out of 10 runs, Random won 8 games

Fun, but not all that interesting. How about we write a player who watches their opponent and builds a strategy to defeat them? This player is going to be a bit more complex than those we’ve written so far.

Pattern Matching Player

The plan: build up a pattern memory of player moves, opponent moves, and the next move the opponent played under those conditions. After every move:

remember the moves that were just played
record the previous game’s moves and the opponents move
use that record of game patterns to determine the opponent’s next likely move and play the move that beats it

This pattern set will allow us to make observations such as “the last ten times I played rock and my opponent played paper, my opponent’s next move was scissors” or “out the last 97 times I played rock and my opponent played scissors my opponent played scissors next 96% of the time”.

class PatternMatching < Player MOVE_THAT_BEATS = { rock: :paper, paper: :scissors, scissors: :rock } def initialize(opponent) @first_game = true @patterns = { rock: {}, paper: {}, scissors: {} } end def choose @my_move || :rock end def result(mine, theirs, outcome) store_moves(mine, theirs) plan_next_move @first_game = false end private def store_moves(mine, theirs) unless @first_game store_pattern(mine, theirs) end @my_last_move = mine @their_last_move = theirs end def store_pattern(mine, theirs) if @patterns[@my_last_move][@their_last_move] @patterns[@my_last_move][@their_last_move] << theirs else @patterns[@my_last_move][@their_last_move] = [theirs] end end def plan_next_move @my_move = MOVE_THAT_BEATS[their_likely_next_move] || :rock end def their_likely_next_move their_moves = @patterns[@my_last_move][@their_last_move] return [:rock, :paper, :scissors].shuffle.first if their_moves.nil? count = Hash.new(0) their_moves.each do |move| count[move] += 1 end count.sort_by {|key, value| value}.last.first end end

This player looks complicated, but the only tricky logic is around the whole “Is this the first game or not?” I’m not entirely happy with the way I’ve worked it out here, but it gets the job done.

store_pattern and their_likely_next_move are the two key methods. The move patterns are stored as nested hashes:

[my move][opponent move] => [array of observed moves] [:rock][:paper] => [ :scissors, :paper, :scissors, :scissors, :scissors, :paper ]

The their_likely_next_move method looks into the pattern and counts up the data seen so far. If there’s no data yet, it guesses randomly.

Let’s see how our pattern matcher fares!

$ ruby -I . rock_paper_scissors.rb players/always_rock.rb players/pattern_matching.rb AlwaysRock vs. PatternMatching AlwaysRock: 1.0 PatternMatching: 999.0 PatternMatching Wins

Not bad! Our pattern matcher correctly determines the simple pattern employed by Always Rock.

$ ruby -I . rock_paper_scissors.rb players/reactive.rb players/pattern_matching.rb Reactive vs. PatternMatching Reactive: 3.5 PatternMatching: 996.5 PatternMatching Wins

Bam! Our pattern matcher handily figures out what our simple Reactive player is likely to do. Now for a real challenge.

$ ruby -I . rock_paper_scissors.rb players/random.rb players/pattern_matching.rb RandomPlayer vs. PatternMatching RandomPlayer: 491.0 PatternMatching: 509.0 PatternMatching Wins $ ruby -I . rock_paper_scissors.rb players/random.rb players/pattern_matching.rb RandomPlayer vs. PatternMatching RandomPlayer: 518.5 PatternMatching: 481.5 RandomPlayer Wins # it goes back and forth

So PatternMatching fares well against RandomPlayer but can’t consistently win. Let’s make a player who can consistently and completely defeat RandomPlayer. Let’s make a player who cheats.

Cheater

class Cheater < Player def initialize(opponent) Kernel.const_get(opponent).class_eval('def choose; :scissors; end') end def choose :rock end end

Mwahaha! Cheater utilizes that so far unused game feature to get the opponent’s name. Using that and some Ruby magic Cheater hypnotizes its opponent into always playing scissors.

Does it work? Absolutely.

$ ruby -I . rock_paper_scissors.rb players/always_rock.rb players/cheater.rb AlwaysRock vs. Cheater AlwaysRock: 0 Cheater: 1000 Cheater Wins $ ruby -I . rock_paper_scissors.rb players/reactive.rb players/cheater.rb Reactive vs. Cheater Reactive: 0 Cheater: 1000 Cheater Wins $ ruby -I . rock_paper_scissors.rb players/pattern_matching.rb players/cheater.rb PatternMatching vs. Cheater PatternMatching: 0 Cheater: 1000 Cheater Wins $ ruby -I . rock_paper_scissors.rb players/random.rb players/cheater.rb RandomPlayer vs. Cheater RandomPlayer: 0 Cheater: 1000 Cheater Wins

So what’s going on here? To answer that, let’s dive into irb

$ irb -I . 1.9.3p0 :001 > require 'rock_paper_scissors' => true 1.9.3p0 :002 > require 'players/always_rock' => true

Ok, we’re in irb and we’ve got our game and a player loaded. Let’s see what Cheater is up to.

Kernel.const_get(opponent).class_eval('def choose; :scissors; end') 1.9.3p0 :003 > Kernel.const_get('AlwaysRock') => AlwaysRock 1.9.3p0 :004 > Kernel.const_get('AlwaysRock').class => Class

const_get checks a module for a constant with the given name. Since a class is defined as a constant and Kernel sits over every class, we can ask Kernel to find our opponent’s class for us. Which is what we’re doing here.

Basically Kernel.const_get('AlwaysRock') is an easy way to turn the string “AlwaysRock” into a constant.

Now class_eval. This method says to a class, “evaluate this string as if it were written as part of your code.”

1.9.3p0 :005 > String.class_eval('def magic; "magic!"; end') => nil 1.9.3p0 :006 > "hi".magic => "magic!"

Put those pieces together, and our Cheater:

Takes the string of its opponents name and turns it into a constant to get at its opponent’s class.
Rewrites its opponent with a new choose method that it controls.
Beats its opponents modified choose method.

1.9.3p0 :007 > AlwaysRock.new('Cheater').choose => :rock 1.9.3p0 :008 > Kernel.const_get('AlwaysRock').class_eval('def choose; :scissors; end') => nil 1.9.3p0 :009 > AlwaysRock.new('Cheater').choose => :scissors

Insidious! Even worse, if you’re running a contest with more than two players the cheater permanently modifies its opponents into always playing their redefined choose method.

Let’s level the playing field and force the cheater to play fair.

Level Playing Field

LevelPlayingField avoids being manipulated by the cheater by having a strong will to resist Cheater’s hypnotism. Cheater works by modifying the class of its opponents, so LevelPlayingField doesn’t have a choose method defined by its class. LevelPlayingField defines its own choose method on initialization.

class LevelPlayingField < Player def initialize(opponent) # prevent cheater from winning by waiting to pick a strategy self.class.class_eval do def choose [:rock, :paper, :scissors].shuffle.first end end end end

Yes, that’s right. LevelPlayingField uses the same trick that Cheater does, but on its own code to try and protect its own strategy logic.

1.9.3p0 :010 > require 'players/level_playing_field' => true 1.9.3p0 :011 > LevelPlayingField.new('Cheater').choose => :paper 1.9.3p0 :012 > Kernel.const_get('LevelPlayingField').class_eval('def choose; :scissors; end') => nil 1.9.3p0 :011 > LevelPlayingField.new('Cheater').choose => :rock # LevelPlayingField is random so you actually might see :scissors here $ ruby -I . rock_paper_scissors.rb players/cheater.rb players/level_playing_field.rb Cheater vs. LevelPlayingField Cheater: 496.5 LevelPlayingField: 503.5 LevelPlayingField Wins

Now this works, but there’s one flaw. Our LevelPlayingField player only succeeds when initialized after the Cheater.

$ ruby -I . rock_paper_scissors.rb players/level_playing_field.rb players/cheater.rb LevelPlayingField vs. Cheater LevelPlayingField: 0 Cheater: 1000 Cheater Wins

If Cheater comes into play after LevelPlayingField has completed his trick to try and isolate his choose method, then the Cheater will still just override LevelPlayingField’s choose method. It’s a remarkably tough strategy to defeat. Any trick we use to cleverly hide our choose method from the Cheater can ultimately be sidestepped by the Cheater supplying a new choose method.

Even if we write a player who removes the ability for the Cheater to cheat, if Cheater is loaded first then it wins.

class LevelPlayingField < Player def initialize(opponent) # rewrite class_eval to prevent cheating Module.class_eval('def class_eval(obj) end') end def choose [:rock, :paper, :scissors].shuffle.first end end

So let’s bring in a player who always wins. A player who out-cheats the cheater. A player who modifies the game itself. The Batman!

Batman

class Batman < Player def initialize(opponent) Game.class_eval do def play( match ) match.times do next win @player1, :rock, :scissors if @player1.instance_of? Batman next win @player2, :rock, :scissors if @player2.instance_of? Batman end end end end end

Batman’s playing strategy is to enter the Game class itself and ensure that he’s the winner no matter what’s been played. Not only that, but Batman completely alters the game for all other player matches. They all get recorded as draws, since only Batman can win.

$ ruby -I . rock_paper_scissors.rb players/cheater.rb players/batman.rb Cheater vs. Batman Cheater: 0 Batman: 1000 Batman Wins $ ruby -I . rock_paper_scissors.rb players/batman.rb players/cheater.rb Batman vs. Cheater Batman: 1000 Cheater: 0 Batman Wins $ ruby -I . rock_paper_scissors.rb players/batman.rb players/reactive.rb Batman vs. Reactive Batman: 1000 Reactive: 0 Batman Wins

Gosh, it makes you almost feel bad for Batman’s opponents.