Understanding Duality in Game Theory: The Knapsack Problem and Its Implications
410 likes | 524 Vues
Explore the concept of duality in game theory through the lens of the Knapsack Problem. This insightful lecture illustrates a scenario where a thief must maximize the value of stolen goods while staying within weight limits. Learn about the linear programming formulation that governs decision-making for the thief and the corresponding dual problem faced by a crime syndicate. The interplay between the primal and dual problems exemplifies key concepts in optimization, showcasing how optimal solutions can be reached through strategic planning and inequality constraints.
Understanding Duality in Game Theory: The Knapsack Problem and Its Implications
E N D
Presentation Transcript
Lecture 4 Duality and gametheory
Knapsack problem – dualityillustration • A thiefrobs a jewelery shop with a knapsack • He cannotcarrytoo much weight • He canchooseamongwelldivisibleobjects (gold, silver, diamondsand) • A thiefwants to takethe most valuablegoodswithhim
The model • Parameters: W – knapsackmaximalweight N – number of goodsin a shop wi – goodi’sweight vi – goodi’svalue • Decisionvariables: xi – share of totalamount of good i taken to theknapsack • Objectivefunction: Maximizethetotalvalue • Constraints: • Cannottakemorethanavailable • Cannottakemorethantheknapsackcapacity • Cannottakethenegative (ifheis a thiefindeed)
The model • Formulate as an LP: Max
Problem of a thief (a primal problem) • Substitute N=3, W=4, w=(2,3,4) i v=(5,20,3) gold, diamondsand and silver. max p.w. A thief problem solution: (x1,x2,x3)=(0.5, 1, 0) Objectivefunctionvalue: 22.5
Analysis • Only one good will be takenpartially (gold). Itis a general ruleinallknapsackproblemswith N divisiblegoods. • Intuition: • Theoptimalsolutionisunique. • In order to uniquelydetermine3 unknowns, we need 3 independent linearequations. • So atleast3 constraintsshould be satisfiedas equalities. • One constraintistheknapsackweight, but anothertwoareaboutgoodsquantities0≤xi≤1. • Henceonly one goodmay be takeninfractionalamountinthe optimum.
Crime syndicate buys out thethief’s business • Thecrime syndicate wants to buy out thegoodsfromthethieftogetherwith his thief business (equipment etc. here: knapsack). • Theyproposeprices y1for gold, y2for diamondsand, y3for silverand y4for 1 kg knapsackcapacity. • But thethiefmayuse 2 kg knapsackcapacity and allthegold to generate 5 units of profit , so thepriceoffered for gold2y4+y1should be atleast5. Similarlywithothergoods. • The syndicate wants to minimizetheamountithas to paythethiefy1+y2+y3+4y4 • Thepricesshould not be negative, otherwisethethief will be insulted.
The syndicate problem (a dual problem) • The syndicate problem may be formulated as follows: min p.w. Syndicate problem solution: (y1,y2,y3,y4)=(0,12.5,0,2.5) Objectivefunctionvalue: 22.5
Thethief problem Isequivalent Becausee.g. Transforming: Becausee.g. Itisequivalent to thesybdicate problem
Syndicate problem optimalsolution: (y1,y2,y3,y4)=(0,12.5,0,2.5) dual prices Optimalobjectuvefunctionvalue: 22.5 Thief problem optimalsolution: (x1,x2,x3)=(0.5, 1, 0) Optimalobjectivefunctionvalue: 22.5
Rozwiązanieproblemusyndyka: (y1,y2,y3,y4)=(0,12.5,0,2.5) cenydualne Optymalnawartośćfunkcjicelu: 22.5 Rozwiązanieproblemuzłodzieja: (x1,x2,x3)=(0.5, 1, 0) Optymalnawartośćfunkcjicelu: 22.5
Matching/assignment http://mathsite.math.berkeley.edu/smp/smp.html
Zero-sum games • In zero-sum games, payoffsineachcell sum up to zero • Movement diagram
Zero-sum games • Minimax = maximin = value of thegame • Thegamemayhavemultiplesaddlepoints
Zero-sum games • Or itmayhave no saddlepoints • To findthevalue of suchgame, considermixedstrategies
Zero-sum games • Ifthereismorestrategies, youdon’tknowwhich one will be part of optimalmixedstrategy. • LetColumnmixedstrategy be (x,1-x) • Then Raw will try to maximize
Zero-sum games • Column will try to choose x to minimizetheupperenvelope
Zero-sum games • TranformintoLinearProgramming
Fishing on Jamaica • In the fifties, Davenport studied a village of 200 people on thesouthshore of Jamaica, whoseinhabitantsmadetheirliving by fishing.
Twenty-six fishing crews in sailing, dugout canoes fish this area [fishing grounds extend outward from shore about 22 miles] by setting fish pots, which are drawn and reset, weather and sea permitting, on three regular fishing days each week … The fishing grounds are divided into inside and outside banks. The inside banks lie from 5-15 miles offshore, while the outside banks all lie beyond … Because of special underwater contours and the location of one prominent headland, very strong currents set across the outside banks at frequent intervals … These currents are not related in any apparent way to weather and sea conditions of the local region. The inside banks are almost fully protected from the currents. [Davenport 1960]
Strategies • Therewere 26 woodencanoes. Thecaptains of thecanoesmightadopt 3 fishingstrategies: • IN – putallpots on theinside banks • OUT – putallpots on theoutside banks • IN-OUT) – putsomepots on theinside banks, somepots on theoutside
Advantages and disadvantages of fishingintheopensea Disadvantages • Ittakesmore time to reach, so fewerspotscan be set • Whenthecurrentisrunning, itisharmful to outsidepots • marksaredraggedaway • potsmay be smashedwhilemoving • changesintemeperaturemaykillfishinsidethepots Advanatages • Theoutside banks producehigherqualityfishbothinvariaties and insize. • If many outsidefishareavailable, theymaydrivetheinsidefishoffthe market. • The OUT and IN-OUT strategiesrequirebettercanoes. • Theircaptainsdominatethe sport of canoe racing, whichisprestigious and offerslargerewards.
Collecting data • Davenport collected the data concerning the fishermenaveragemonthly profit depending on the fishingstrategiestheyused to adopt.
Zero-sum game? Thecurrent’s problem • Thereis no saddle point • Mixedstrategy: • Assumethatthecurrentisvicious and playsstrategy FLOW withprobability p, and NO FLOW withprobability 1-p • Fishermen’sstrategy: IN with prob. q1, OUT with prob. q2, IN-OUT with prob. q3 • For every p, fishermenchoose q1,q2 and q3 thatmaximizes: • And theviciouscurrentchooses p, so thatthefishermenget min
Graphicalsolution of thecurrent’s problem Solution: p=0.31 Mixedstrategy of thecurrent
Thefishermen’s problem • Similarly: • For everyfishermen’sstrategy q1,q2 and q3, theviciouscurrentchooses p so thatthefishermenearntheleast: • Thefishermen will try to choose q1,q2 and q3 to maximizetheirpayoff:
Maximin andminimax Optimalstrategy for thefishermen Value of thegame Optimalstrategy for thecurrent
Forecast and observation Gametheorypredicts Observationshows No fishermenrisksfishingoutside Strategy69% IN, 31% IN-OUT [Payoff: 13.38] Current’s „strategy”: 25% FLOW, 75% NO FLOW • No fishermenrisksfishingoutside • Strategy67% IN, 33% IN-OUT [Payoff: 13.31] • Optimalcurrent’sstrategy31% FLOW, 69% NO FLOW The similarity is striking Davenport’s finding went unchallenged for several years Until …
Currentis not vicious • Kozelka 1969 and Read, Read 1970 pointed out a seriousflaw: • The currentis not a reasoningentityand cannotadjust to fishermenchangingtheirstrategies. • HencefishermenshoulduseExpected Value principle: • Expectedpayoff of the fishermen: • IN: 0.25 x 17.3 + 0.75 x 11.5 = 12.95 • OUT: 0.25 x (-4.4) + 0.75 x 20.6 = 14.35 • IN-OUT: 0.25 x 5.2 + 0.75 x 17.0 = 14.05 • Hence, all of the fishermenshouldfishOUTside. • Maybe, theyare not welladaptedafterall
Currentmay be viciousafterall • The currentdoes not reason, but itisveryrisky to fishoutside. • Evenif the currentruns 25% of the timeON AVERAGE, itmight run considerablymoreor less in the short run of a year. • Suppose one yearit ran 35% of the time. Expectedpayoffs: • IN: 0.35 x 17.3 + 0.65 x 11.5 = 13.53 • OUT: 0.35 x (-4.4) + 0.65 x 11.5 = 11.85 • IN-OUT: 0.35 x 5.2 + 0.65 x 17.0 = 12.87. • By treating the current as theiropponent, fishermenGUARANTEEthemselvespayoff of atleast13.31. • Fishermenpay 1.05 pounds as insurancepremium
Decisionmaking under uncertainty Regretmatrix
Decisionmaking under uncertainty Regretmatrix