1 / 29

Stackelberg Strategies

Stackelberg Strategies. Algorithmic Game Theory Course Co.RE.Lab. - N.T.U.A. TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A A A. Stackelberg Routing. In (classic) selfish routing all players act selfishly .

smithmarc
Télécharger la présentation

Stackelberg Strategies

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Stackelberg Strategies Algorithmic Game Theory Course Co.RE.Lab. - N.T.U.A. TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAAAAA

  2. Stackelberg Routing • In (classic) selfish routing all players act selfishly. • In Stackelberg routing there exist players willing to cooperate for social welfare (a fraction of the total players). • Both Selfish and Cooperative players are present. • Leader determines the paths of the coordinated players. • Selfish players (followers) minimize their own cost. • Nash Equilibria are considered as the possible outcomes of the game. • A Stackelberg Strategyis an algorithm that allocatespathstocoordinated players so as to lead selfish players to a good Nash Equilibrium.

  3. Example: Pigou’s Network c(x)=x s t One unit of flow is to be routed from s to t c(x)=1

  4. Example: Pigou’s Network c(x)=x Flow = ½ Optimal flow s t One unit of flow is to be routed from s to t c(x)=1 Flow = ½

  5. Example: Pigou’s Network c(x)=x Flow = ½ Optimal flow s t One unit of flow is to be routed from s to t c(x)=1 Flow = ½ (Classic) Nash flow x Flow = 1 s t 1

  6. Example: Pigou’s Network c(x)=x Flow = ½ Optimal flow s t One unit of flow is to be routed from s to t c(x)=1 Flow = ½ Nash flow when a fraction αof (coordinated) playersis sent through the lower edge (Classic) Nash flow x x Flow = 1 Flow = 1-α s t s t 1 1 Flow = α

  7. Example: Braess’s Network x 1 One unit of flow is to be routed from s to t 0 s t x 1

  8. Example: Braess’s Network ½ ½ x 1 One unit of flow is to be routed from s to t 0 s t ½ ½ x 1 Optimal flow

  9. Example: Braess’s Network ½ ½ x 1 One unit of flow is to be routed from s to t 0 s t ½ ½ x 1 Optimal flow (Classic) Nash flow 1 x 1 0 s t 1 1 x

  10. Example: Braess’s Network ½ ½ x 1 One unit of flow is to be routed from s to t 0 s t ½ ½ x 1 Optimal flow Nash flow when a fraction αof coordinated playersis sent through the lower edge (Classic) Nash flow α/2 1 x 1 x 1 1-α 0 0 s t s t 1 1 1 x x α/2

  11. Slightly more formal • We will consider single commodity networks. • An instance in such networks: • Assume that a fraction α of the players are cooperative. • A Stackelberg strategy assigns cooperative players to paths. • They induce a congestion • A new game is “created”: • Where

  12. In the “new” game • Selfish players choose paths (as usual), and Nash flows are considered as the possible outcomes of the game (as usual). • On Equilibrium, selfish players induce a congestion • The Price of Anarchy is

  13. The Central Questions • Given a Stackelberg routing instance, we can ask: • Among all Stackelberg strategies, can we characterize and/or compute the strategy inducing the Stackelberg equilibrium - i.e., the eq. of minimum total latency? • What is the worst-case ratio between the total latency of the Stackelberg eq. and that of the optimal assignment of users to paths?

  14. Finding best strategy: NP-hard Reduction from problem: Given n positive integers is there an satisfying: Given an instance of create an instance of stackelberg routing: • A network G with n+1parallel links • Demand: • ¼ of the players are followers • Cost functions:

  15. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency

  16. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 x s t 2x

  17. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Flow =3 x Opt routes: • 3 to upper edge • 2 to middle edge • 1 to lower edge Flow =2 s t 2x Flow = 1

  18. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Flow =3 x Opt routes: • 3 to upper edge • 2 to middle edge • 1 to lower edge Flow =2 s t 2x Flow = 1 4 In Nash Flow players are routed: • 4 to middle edge • 2 to lower edge x s t Flow=4 2x Flow=2 Nash Flow

  19. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Flow =3 x Opt routes: • 3 to upper edge • 2 to middle edge • 1 to lower edge Flow =2 s t 2x Flow = 1 4 In Nash Flow players are routed: • 4 to middle edge • 2 to lower edge x s t Flow=4 2x Flow=2 Nash Flow

  20. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Flow =3 x Opt routes: • 3 to upper edge • 2 to middle edge • 1 to lower edge Flow =2 s t 2x Flow = 1

  21. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Flow =3 x Opt routes: • 3 to upper edge • 2 to middle edge • 1 to lower edge Flow =2 s t 2x Flow = 1 4 Flow =1½ LLF controlling ¼ players, e.g. 1½ units, routes: • 1½ to upper edge x s t Flow=3 2x Flow=1½ Nash Flow

  22. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Flow =3 x Opt routes: • 3 to upper edge • 2 to middle edge • 1 to lower edge Flow =2 s t 2x Flow = 1 4 Flow =1½ LLF controlling ¼ players, e.g. 1½ units, routes: • 1½ to upper edge x s t Flow=3 2x Flow=1½ Nash Flow

  23. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Flow =3 x Opt routes: • 3 to upper edge • 2 to middle edge • 1 to lower edge Flow =2 s t 2x Flow = 1

  24. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Flow =3 x Opt routes: • 3 to upper edge • 2 to middle edge • 1 to lower edge Flow =2 s t 2x Flow = 1 4 Flow =3 LLF controlling ½ players, e.g. 3 units, routes: • 3 to upper edge x s t Flow=2 2x Flow=1 Nash Flow

  25. LLF Strategy • Largest Latency First (LLF): • Compute an optimal configuration • Assign coordinated players to optimal paths of largest latency 6 units to be routed. 4 Flow =3 x Opt routes: • 3 to upper edge • 2 to middle edge • 1 to lower edge Flow =2 s t 2x Flow = 1 4 Flow =3 LLF controlling ½ players, e.g. 3 units, routes: • 3 to upper edge x s t Flow=2 2x Flow=1 Nash Flow

  26. LLF in parallel links Let α be the fraction of the cooperative players. Theorem 1: In parallel links LLF induces an assignment of cost no more than 1/α times the OPT: Proof by induction: When LLF saturates a link we can restrict to the instance that has: • this link deleted and • fraction of players the “remainders” of the previous instance. Some problems: • LLF may fail to saturate any link. No problem: Let m be the “heaviest” link. If L is the cost of selfish players and x* is the optimal assignment, it is • When a link gets saturated selfish users could use it. No problem: There is an induced equilibrium that doesn’t use it.

  27. Networks with UnboundedPoA Theorem: Let and . There is an instance such that for any Stackelberg strategy inducing s, it is: Proof: The network is the following The demands are: (total flow=1) Cost functions: B=1, C=0 and A is

  28. LLF in parallel links Let oe denote the optimal congestion Lemma: The proof follows from the variational inequality, similar to the “classic” result.

  29. LLF in parallel links Let oe denote the optimal congestion Lemma: The proof follows from the variational inequality, similar to the “classic” result. Theorem 2: Proof: and . It is This is maximized for with maximum value

More Related