Questions tagged [game-theory]

Question 1

The TVD-MI (Total Variation Distance–Mutual Information) framework provides a reference-free method for certifying LLM judges—models that evaluate outputs like code correctness or theorem proofs ...

Question 2

I'm trying to solve the game of chess with a Boolean Satisfiability Sovler. For that, I implement a C/C++ program with some non-deterministic values, convert this program to a Boolean Formula (CNF) ...

Question 3

This specific convergence rate with dominant logarithmic factors is indeed highly desirable for large-scale systems. While there's a rich body of work on stochastic approximation and equilibrium ...

Question 4

Suppose we have a set $N$ of $n$ players and a set $M$ of $m$ items. We are given a matrix $P_{n \times m}$ whose element $p_{ij} \geq 0$ $(i \in N, m \in M)$ denotes the valuation of good $j$ by ...

Question 5

Suppose there are $n$ players $N = \{p_1, \cdots, p_n\}$ and a set $M$ of $m \geq n$ items that are to be claimed. Suppose further that players have strict, additive, ordinal rankings $\succ_{p_j}$ ...

Question 6

I am trying to code a bot that can play ultimate tic tac toe and beat other bots https://en.wikipedia.org/wiki/Ultimate_tic-tac-toe. I am doing so using minimax with alpha-beta pruning to limited ...

Question 7

In fields such as game theory and reinforcement learning, it is standard to consider the regret-minimization strategy. I don't get the motivation for the definition. Yes, doing your best under worst-...

Question 8

I am studying from the MARL textbook by Albrecht, Christianos and Schäfer. They define a stochastic game in Sec 3.3 as the multi-agent version of an MDP. In Fig 3.3 (pg 50) they give an intuition for ...

Question 9

Can anyone point me to the details of a verified Nash equilibrium for Leduc Hold'em? I have an implementation of Counterfactual Regret Minimization (CFR) that I am testing with Leduc, and would like ...

Question 10

I'm trying to solve a variant of Voronoi Game. Given a Graph $G=(V, E)$, with uniform distance $d(E)=1$, a subset of the vertices $\hat{V} \subseteq V$ are vertices of interests. $k$ players take ...

Question 11

Solving $N$-player general sum matrix games is known to be PPAD-complete, and even for two players algorithms such as Lemke-Howson can take exponential time in the worst case. On the other hand, ...

Question 12

I'm currently working on a problem to come up with ideas to solve a stable flow problem but unlike the traditional stable flow problem where every node has preferences on its incoming and outgoing ...

Question 13

Consider the following "game". There is an ant on a strip of length N. The ant can perform the following actions: move left, move right, and paint the cell he is on with white or black. The ...

Question 14

One classic fair division allocation problem is to allocate indivisible goods among agents with respect to envy freeness (no agent envies another agent). In my settings, all the goods are divisible (...

Question 15

I found the following question in my book and I have no clue on what the answer should be: What is the condition on search graph so that BFS Algorithm for graph and BFS Algorithm for tree generate ...

Stack Exchange Network

Questions tagged [game-theory]

Can We Design Incentive-Compatible LLM Judge Certification Beyond Shared Information Invariants? [closed]

How to model adversarial moves (e.g. chess) with Boolean Satisfiability?

Can stochastic optimization algorithms achieve convergence in $\Theta(N^{m^{1/s}} / \log^k N)$ with $m > 1, s > 1, k \gg m$? [closed]

Show that in every pure SPNE, the first agent weakly prefers their own bundle to any other

Show that both the standard and bottom-up greedy strategies give the same outcomes

need board evaluation heuristics for ultimate tic tac toe

Why does it make sense to minimize regret?

In multi-agent RL stochastic games (multi-agent version of an MDP), why does the Nash equilibrium policy only depend on the state, not history?

Nash equilibrium details for Leduc Hold'em

Multiplayer one-round one-move voronoi game on graphs

Examples of challenging general-sum matrix games

Stable Flow Problem with one sided preferences

Game of permutations as a minimization problem

Which good should I split?

BFS on a graph and BFS on a tree

Hot Network Questions