Lecture 2: Büchi condition

Teachers: Wieslaw Zielonka

Games and Graphs

  • G(V,E)
  • VVAVB
  • A: Alice, B: Bill
Paths:

the set of finite paths in G

Paths:

the set of infinite paths

p=v1v2i,(vi,vi+1)E
WAPaths:

the set of infinite paths winning for Alice

WB=PathsWAPaths:

the set of infinite paths winning for Bill

Strategy σ for Alice:

is a mapping σ:VVAV such that xv1vnVVA(vn,σ(x))E

(σ,τ)-strategy profile:

if σ (resp. τ) is strategy for Alice (resp. Bill).

A path pv1vn is consistent with the strategy σ of Alice:

if for each i such that viVA, vi+1=σ(v1vi)

If we fix an initial vertex vV, then there exists a unique infinite path p(v,σ,τ) which starts at v and is consistent with σ and τ.

σ is a winning strategy for Alice for the initial vertex v:

if each infinite path p strating at v and consistent with σ belongs to WA.

For a given initial vertex v, σ,τ, or none of them, may be a winning strategy.

The game is determined:

if for each initial vertex, one of the players has a winning strategy.

Memoryless/positional strategies

The strategy σ is memoryless (or positional):

iff for all xyvVVA: σ(x)=σ(v)

σ:VAV(v,σ(v))EvVA

Ex: If σ and τ are memoryless and the graph is finite: all the plays are “lasso” loops in the graph.

Normally, when considering games on graphs, one looks for a partition:

V=WinAvertices winning for AliceWinB(V(WinAWinB))

Moreover, usually: people look for memoryless strategy that are independent on the initial vertex.

OneStepA(X):

set of vertices such that, strating from one of these vertices, Alice can reach X in exactly one move.

OneStepA(X){vVAwX;(v,w)E}{vVBwV,(v,w)EwX}
OneStepA(X)=OneStepA(X)X

and:

{X0XXi+1OneStepA(Xi)

Then

XOneStepA(X)

and if G is finite, there exists k such that Xk+1=Xk.

ReachA(X)iXi

For all vV,

rank(v)min{ivXi}min

Let vReachA(X) such that rank(v)>0.

If

  • vVA, then there exists w s.t. (v,w)E and rank(w)<rank(v)
  • vVB, then for all w s.t. (v,w)E, rank(w)<rank(v)

Büchi condition

Büchi: Alice wins iff the play visits infinitely often a fixed set R of vertices.


If XY:

  • OneStepA(X)OneStepA(Y)
  • ReachA(X)ReachA(Y)

So the following mapping is clearly monotone: YOneStepA(ReachA(Y))

And on top of that:

ReachA(Y)YOneStepA(ReachA(Y))ReachA(Y)

NB: ReachA(Y)Y because there may exist some vY s.t. vOneStepA(ReachA(Y)), that is: for which Bill has a strategy to leave ReachA(Y) in one step.

ϕ(Y)OneStepA(ReachA(YRfixed set of vertices))

Let W s.t. ϕ(W)=W. Then Alice has a strategy to visit R infinitely often.

(Indeed:

ReachA(WR)(WR)W=OneStepA(ReachA((WR)))ReachA((WR))

so ReachA(WR)(WR)=, and ReachA(WR)WR)

Let

Y0V,Y1ϕ(Y0),,Yi+1=ϕ(Yi)

Then by induction: this sequence is decreasing: Yi+1Yii.

So iYi is a fixed point of ϕ. And it is the greatest fixed point. Indeed, if W is another such fixed point. Then WV, and by applying ϕ over and over, we get the result.

Yi+1(YiR)=ReachA(YiR)

Path consistent with Bill’s strategy at YiYi+1

p=v1
  • either i2, viR
  • or there exists j2 s.t. vjYic and all vertices v2,,vj1 do not belong to R.

Forgetting the first moment, the index corresponds to the maximal number of times that Alice can force the visit to R.

Example

Imagine a pie that two players share to eat.

  1. Player 1 proposes a proportion x1 of the pie to player 2
  2. Player can either accept, in which the proportions gotten are (1x1,x1), or rejects.
  3. etc… (as described below)
%3 1 1 2 2 1->2 P1 proposes x_1 (1-x_1, x_1) (1-x_1, x_1) 2->(1-x_1, x_1) P2 accepts 0 0 2->0 P2 rejects (0, 0) (0, 0) 0->(0, 0) 1 - δ 2' 2' 0->2' δ 1' 1' 2'->1' P2 proposes x_2 (x_2, 1-x_2) (x_2, 1-x_2) 1'->(x_2, 1-x_2) P1 accepts 0' 0' 1'->0' P1 rejects 0'->1 δ (0, 0)' (0, 0)' 0'->(0, 0)' 1-δ

With proba 1, the game stops.

Nash equilibria: for each 0x1, the proportion (x,1x) is a Nash equilibrium. Indeed: P1 never accepts less than x, and never proposes more than 1x. Symmetrically for P2.

Can we find a subgame-perfect equilibrium? (where each time P1 (resp. P2) plays, he/she makes the same proportion).

Backward induction: begin at node 1 and go backward (along the δ arrow):

  1. Expectation for each player: (z,1z)
  2. At 0, expectation: (1δ)×(0,0)+δ(z,1z)=(δz,δ(1z))
  3. At 2: P2 proposes (zδ,1zδ), so that P1 accepts and P2’s payoff is 1zδ>(1z)δ
  4. etc…
%25 1 | (1-(δ-zδ²), δ-zδ²) 1 | (1-(δ-zδ²), δ-zδ²) 2 2 1 | (1-(δ-zδ²), δ-zδ²)->2 P1 proposes x_1 (1-x_1, x_1) | (1-(δ-zδ²), δ-zδ²) (1-x_1, x_1) | (1-(δ-zδ²), δ-zδ²) 2->(1-x_1, x_1) | (1-(δ-zδ²), δ-zδ²) P2 accepts 0 | (zδ², δ-zδ²) 0 | (zδ², δ-zδ²) 2->0 | (zδ², δ-zδ²) P2 rejects (0, 0) (0, 0) 0 | (zδ², δ-zδ²)->(0, 0) 1 - δ 2' | (zδ, 1-zδ) 2' | (zδ, 1-zδ) 0 | (zδ², δ-zδ²)->2' | (zδ, 1-zδ) δ 1' 1' 2' | (zδ, 1-zδ)->1' P2 proposes x_2 (x_2, 1-x_2) | (zδ, 1-zδ) (x_2, 1-x_2) | (zδ, 1-zδ) 1'->(x_2, 1-x_2) | (zδ, 1-zδ) P1 accepts 0' | (zδ, (1-z)δ) 0' | (zδ, (1-z)δ) 1'->0' | (zδ, (1-z)δ) P1 rejects 1 1 0' | (zδ, (1-z)δ)->1 δ (0, 0)' (0, 0)' 0' | (zδ, (1-z)δ)->(0, 0)' 1-δ

So now:

(z,1z)=(1δ+zδ2,δzδ2)

Therefore:

z=1δ1δ2=11+δ

Leave a comment