MIYAO BLOG：宮尾ブログ：SSブログ

	ブログをはじめるログイン

2009年03月14日｜ 2009年11月29日｜2010年08月09日ブログトップ

ゲーム理論の簡単な実験例：A Simple Experiment in Game Theory　[Report]

ゲーム理論に関する授業での実験とその結果

テーマ：「囚人のジレンマ」と繰り返しゲーム
教師：宮尾尊弘（国際教養大学・グローバル・ビジネス・客員教授）
コース：中級ミクロ経済学
日時：2009年11月26日午後5：00～6：15（75分授業）
参加学生：13名（グロバール・ビジネス専攻の3、4年生）

基本的なアイデア:
学生たちを2人1組にして、じゃんけん（ただし「グー」と「パー」のみ）を繰り返し行い、できるだけ多くの点数を取るように戦わせる。教師は学生の1人と組んでじゃんけんをするが、その際に教師は「修正版しっぺ返し戦略」(以下の＊印を参照)を取り、2人の合計点が最大になることを狙う。実験後、結果を発表し、それを踏まえて「囚人のジレンマ」と繰り返しゲームにおける「しっぺ返し戦略」とその修正版の戦略について講義を行う。

実際に行った実験：
ステップ1：
学生たちを2人1組にする。もし学生の数が奇数であれば、教師は学生の1人と組む。もし偶数であれば、予め授業が始まる前に、他の学生に気付かれないように1人の学生に、「修正版しっぺ返し戦略」を取るように話しておく。
ステップ2：
ゲームのルールを説明する。特に、じゃんけんで「グー」（石）と「パー」（紙）のみを使うこと。そして以下のそれぞれの場合に、（　）内の点数が得られることを説明する：
[ケース1] グー（2）対グー（2）、[ケース2] グー（0）対パー（3）、
[ケース3] パー（3）対グー（0）、[ケース4] パー（1）対パー（1）
ステップ 3:
1回目は「グー」を出してお互いに2点ずつを得るようにさせる。それ以降は、グーかパーか好きな方を出してよいこととする。学生に紙を配り、毎回得た点数を書き込むようにさせる。
ステップ4:
毎回、教師は「一、二、三」と叫び、「三」で一斉に学生たちがグーかパーを相手に対して出すようにする。ここで、予め学生たちに何回じゃんけんを行うかを言わないことが重要で、教師が「止め」と叫んだときに止めるように言っておく。
（＊注）
「（修正版）しっぺ返し戦略」：プレーヤーAは、プレーヤーBが裏切るまで（パーを出すまで）、二人にとって最善な戦略（グー）を取り続ける。もしBが裏切ったら（パーを出したら）、Aはその次の回で裏切る（パーを出す）ことで、相手に裏切らないようにという信号を送る。本来の「しっぺ返し戦略」では、それでもBがパーを出し続けたら、Aもパーを出し続けてメッセージを送り続けるが、ここでの「修正版しっぺ返し戦略」では、Bの裏切りに対してAが一度パーを出した後の次の回でAはもとの戦略（グー）に戻る。そして、相手がグーに戻って裏切りをやめればそれ以降はグーのままで、もし相手がまた裏切ればその次の回で同じように裏切って警告を与え、またその次の回では元のグーに戻って相手の出方を見る戦略。
この実験でしっぺ返し戦略の修正版を採用した理由は、学生たちが必ずしも自分が得る点数を最大にすることを目的に行動するとは限らず、相手と自分の得る合計点を最大にしようとする可能性もあったからである。つまりこの場合は、相手の警告に気づきやすく、元の戦略に戻る可能性が高いと考えられたからである。
この実験の主な目的の一つは、できるだけ「囚人のジレンマ」を避けて、自分とともに、自分の組の合計点が最高になるようなよい戦略（必ずしも最善な戦略ではないかもしれないが）を見出すことである。

実験結果：実験が終わり次第判明。直ちに発表。
じゃんけんの回数： 12
最高（組）得点数： 38　（教師と学生の組）
他の組の（組）得点数： 37, 36, 35, 34, 31, 26
最高（個人）得点数： 23　（ただしこの組の総得点は34にとどまる）
他の個人の得点数： 22　（教師と組んだ学生）、
21、20、19、17、17、16（教師）、16、15、14、13、13、11

実験後の授業内容：
1) このゲームが以下のように「囚人のジレンマ」のゲームになっていることを説明：
[利得行列]..................プレーヤーB
.........................戦略1：グー戦略2：パー
プレーヤーA 戦略1：グー 2＼2.....0＼3
..............戦略2：パー 3＼0.....1＼1
2）一回限りのゲームでは、各プレーヤーにとって「グー対グー」というこの組にとって最善の選択から離れて、それぞれもっと利得を高めようとする強い誘因が働き、その結果としてこの組にとって最悪の選択に陥ってしまうことを説明する。
3) しかし繰り返しゲームでは、「修正版しっぺ返し戦略」を取ることで、相手に対して裏切らないようにというシグナルを送ることができる。実際にクラスでの実験で示されたように、この戦略は最悪の選択を避けるために非常に有効であることを説明する。
4) 実際のビジネスでも、「しっぺ返し戦略」やその修正版の戦略はよく採用されており、例えば競争相手が値引きするたびにそれと同じだけ値引きをするなどのビジネス戦略が取られる。学生たちに、他にどのような具体例があるかを考えさせる。
-----------------------------------------------------------------------
参考：実験結果の実例（3組のみ提示）

....回 1 2 3 4 5 6 7 8 9 10 11 12 Total Pair total
教師...2 0 1 2 2 0 2 1 2 0 1 3.....16......38（1番）
学生Ａ 2 3 1 2 2 3 2 1 2 3 1 0.....22（2番）

学生Ｂ 2 1 0 1 3 3 0 0 2 3 0 2.....17......37（2番）
学生Ｃ 2 1 3 1 0 0 3 3 2 0 3 2.....20

学生Ｄ 2 3 2 3 1 1 1 1 1 3 3 2.....23（1番）34
学生Ｅ 2 0 2 0 1 1 1 1 1 0 0 2.....11
-------------------------------------------------------------------------
CLASS EXPERIMENT IN GAME THEORY

Topic: Prisoner’s Dilemma in Repeated Game
Instructor: Takahiro Miyao, Visiting Professor in Global Business Program,
Akita International University (Kokusai Kyoyo Daigaku)
Course: Intermediate Microeconomics (ECN328/338)
Date/Time: November 26, 2009; 5:00-6:15pm (75 minutes)
Participants: 13 undergraduate (junior/senior) students in Global Business
Basic Idea:
Make the students form pairs and play the “janken game of paper and stone” repeatedly for each student to win as many points as possible. The instructor, pairing with a student, uses the “modified tit-for-tat strategy" (see below*) in order to show that his pair may well get the highest number of points than any other pair in class. After the game, the instructor explains the nature of prisoner’s dilemma and the tit for tat strategy (as well as its modified version) in repeated game.
One of the main purposes of this experiment is to find a good, if not the best, strategy to get the highest payoff for him(her)self as well as his/her pair, by avoiding the "prisonners' dilemma" as much as possible.
Actual Experiment:
Step 1:
Make the students form pairs. If there are an odd number of students, the instructor forms a pair with one student. In the case of an even number of students, the instructor should tell one of the students to use the “modified tit-for-tat strategy" before the class starts, so that no other students can notice it.
Step 2:
Explain the rule of the game, specifically, the “janken game of paper, scissors and stone,” where only paper and stone should be allowed to use. Basic rule: the student will get a point in ( ) in each of the following cases:
[Case 1] stone (2) vs. stone (2); [Case 2] stone (0) vs. paper (3),
[Case 3] paper (3) vs. stone (0); [Case 4] paper (1) vs. paper (1)
Step 3:
Tell all the students to choose “stone” in the first round so that they all will get 2 points, but from the second round on they can choose either stone or paper as they like. Give out a sheet of paper to the students and tell them to write down the points they get in each round.
Step 4:
For each round, the instructor says “one, two, three” and exactly when the instructor says “three!”, the students must show his/her choice of stone or paper to each other. It is important NOT to tell the students how many rounds to play, but to tell them to stop when the instructor says “stop!”.
(*) (Modifiied) Tit-for-Tat Strategy: Player A keeps adopting the best joint strategy until player B defects. If player B defects in a certain round, player A will punish player B by defecting in the next round. In the case of the usual tit-for-tat strategy, A will keep punishing B so long as B defects (showing "paper"), but in the case of the modified tit-for-tat strategy, A will return to the original strategy (showing "stone") to see if B returns to the original stategy too. If B does not return to the original strategy, then A will punish B in the next round, and then return to the original strategy to see how B will respond.
The modified version of the tit-for-tat strategy was adopted in this experiment, because individuals were not necessarily motivated to maximize his/her payoff, but could possibly be interested in maximizing the total payoff for the pair, so one might easily get a message from the opponent's warning sign.

Results in Class: Known and announced immediately after the experiment
Number of rounds: 12
Higher number of points for pair: 38 (the instructor’s pair)
Other pairs’ points: 37, 36, 35, 34, 31, 26
Higher number of points for individual: 23 (but total pair points 34)
Other individuals’ points: 22 (the student paring with the instructor),
21, 20, 19, 17, 17, 16 (the instructor), 16, 15, 14, 13, 13, 11

Lecture After the Experiment:
1) The game has the structure of “prisoner’s dilemma,” as shown below:
[Payoff Matrix].............................Player B
................................Strategy 1: Stone Strategy 2: Paper
Player A Strategy 1 Sone.........2＼2..............0＼3
............Strategy 2 Paper........3＼0..............1＼1
2) Explain that in one-time game situations there is a strong incentive for each player to defect from the stone-stone choice (best for the pair) to gain more for him(her)self, resulting in the worst for the pair.
3) In repeated game situation, however, the “modified tit-for-tat strategy" can be used to send a signal to the opponent not to defect. As actually shown in the experiment, this strategy is quite effective in avoiding the worst choice.
4) In reality, many businesses actually seem to be using the tit-for-tat strategy or its modified version such as matching a competitor’s price cut whenever it happens. Make the students think of other real-world examples.
---------------------------------------------------------------------------
APPENDIX: Selected Results in Experiment
.....Round 1 2 3 4 5 6 7 8 9 10 11 12 Total Pair total
Instructor.2 0 1 2 2 0 2 1 2 0 1 3......16.......38 (1st)
Student A 2 3 1 2 2 3 2 1 2 3 1 0......22 (2nd)

Student B 2 1 0 1 3 3 0 0 2 3 0 2......17.......37 (2nd)
Student C 2 1 3 1 0 0 3 3 2 0 3 2......20

Student D 2 3 2 3 1 1 1 1 1 3 3 2......23 (1st) 34
Student E 2 0 2 0 1 1 1 1 1 0 0 2......11

2009-11-29 21:04 nice!(0) コメント(0) トラックバック(3)