cs.GT 편의 논문 | Gist.Science

Offer of a reward does not always promote trust in spatial games

이 논문은 공간적 신뢰 게임에서 보상이 항상 신뢰를 촉진하는 것은 아니며, 과도한 보상이 오히려 비환원 전략을 유발해 신뢰 진화를 억제하고, 적정 수준의 비용이 더 큰 보상이 신뢰 형성에 더 유리할 수 있음을 보여줍니다.

Haidong Zhang, Chaoqian Wang, Shuo Liu, Charo I. del Genio, Stefano Boccaletti, Xin LuTue, 10 Ma💻 cs

A symmetric recursive algorithm for mean-payoff games

이 논문은 평균 보상 게임을 해결하기 위한 새로운 결정론적 대칭 재귀 알고리즘을 제안합니다.

Pierre OhlmannTue, 10 Ma💻 cs

Coordination Games on Multiplex Networks: Consensus, Convergence, and Stability of Opinion Dynamics

이 논문은 다중 레이어 사회 네트워크에서의 의견 역학을 동기화 조정 게임으로 모델링하여, 레이어 간 결합 메커니즘이 단일 레이어만으로는 달성할 수 없는 전역적 합의나 안정성을 유도하거나 저해할 수 있음을 분석하고 수렴 조건을 규명합니다.

Ruey-An Shiu, Parinaz NaghizadehTue, 10 Ma💻 cs

Deep Incentive Design with Differentiable Equilibrium Blocks

이 논문은 게임과 무관한 미분 가능 균형 블록 (DEB) 을 모듈로 활용하여 계약 설계, 기계 스케줄링, 역균형 문제 등 다양한 인센티브 설계 과제를 단일 신경망으로 해결하는 '딥 인센티브 설계 (DID)' 프레임워크를 제안합니다.

Vinzenz Thoma, Georgios Piliouras, Luke MarrisTue, 10 Ma🤖 cs.LG

Rigidity in LLM Bandits with Implications for Human-AI Dyads

이 논문은 LLM 이 두 팔 밴딧 과제에서 학습률 저하와 높은 역온도로 인해 확률적 편향을 강화하고 경직된 탐험을 보이며, 이는 인간-AI 상호작용에 중요한 함의를 가진다는 것을 발견했습니다.

Haomiaomiao Wang, Tomás E Ward, Lili ZhangTue, 10 Ma💻 cs

A Lightweight MPC Bidding Framework for Brand Auction Ads

이 논문은 브랜드 광고의 고유한 특성을 활용하여 복잡한 머신러닝 모델 없이도 온라인 등분위 회귀를 통해 단조로운 입찰 - 지출 및 입찰 - 전환 모델을 구축하는 경량 예측 제어 (MPC) 프레임을 제안하며, 이를 통해 실시간 입찰 환경에서 지출 효율성과 비용 통제력을 크게 향상시킨다고 설명합니다.

Yuanlong Chen, Bowen Zhu, Bing Xia, Yichuan WangTue, 10 Ma🤖 cs.LG

Leaderboard Incentives: Model Rankings under Strategic Post-Training

이 논문은 현재 벤치마크가 모델 개발자에게 테스트 데이터에 특화된 전략적 학습을 유도하여 내재적 품질을 왜곡하는 문제를 지적하고, '튜닝 후 테스트 (tune-before-test)' 프로토콜을 통해 내재적 품질에 기반한 유일한 나시 균형을 달성할 수 있음을 증명합니다.

Yatong Chen, Guanhua Zhang, Moritz HardtTue, 10 Ma🤖 cs.LG

The biased interaction game: Its dynamics and application in modelling social systems

이 논문은 희소성과 편향된 상호작용 하에서 계층과 불평등이 어떻게 창발하는지, 그리고 편향을 고려한 게임 이론이 자본주의나 평등주의와 같은 사회 시스템의 비선형적 동역학과 재분배 정책 분석에 어떻게 적용될 수 있는지를 규명합니다.

Phil Mercy, Martin NeilTue, 10 Ma💻 cs

A New Lower Bound for the Random Offerer Mechanism in Bilateral Trade using AI-Guided Evolutionary Search

이 논문은 AI 기반 진화 탐색 기법인 AlphaEvolve 를 활용하여 양자 거래에서 무작위 제안자 (RO) 메커니즘의 최적 효율성 대비 worst-case 성능 하한을 기존 2.02 에서 2.0749 로 개선한 새로운 사례를 발견했다고 요약할 수 있습니다.

Yang Cai, Vineet Gupta, Zun Li, Aranyak MehtaTue, 10 Ma🤖 cs.LG

What Do Agents Think One Another Want? Level-2 Inverse Games for Inferring Agents' Estimates of Others' Objectives

이 논문은 기존 1 단계 역게임 이론의 한계를 극복하기 위해, 다중 에이전트 간 상호작용에서 각 에이전트가 상대방의 목표에 대해 어떻게 추론하는지를 파악하는 2 단계 역게임 추론 프레임워크를 제안하고, 이를 통해 실제 시나리오에서 발생하는 목표 불일치를 효과적으로 규명합니다.

Hamzah I. Khan, Jingqi Li, David Fridovich-KeilThu, 12 Ma💻 cs

Simplifying Preference Elicitation in Local Energy Markets: Combinatorial Clock Exchange

이 논문은 분산형 에너지 자원의 복잡한 선호도를 반영하면서도 프로슈머의 인지적 부담을 줄이기 위해, 기계 학습을 활용한 조합 시계 교환 방식을 도입하여 지역 에너지 시장의 선호도 수집을 간소화하고 효율적인 가격 수렴을 달성하는 새로운 시장 메커니즘을 제안합니다.

Shobhit Singhal, Lesia MitridatiThu, 12 Ma⚡ eess

Sequential Causal Normal Form Games: Theory, Computation, and Strategic Signaling

이 논문은 리더 - 팔로워 상호작용에 Pearl 의 인과 계층을 도입한 순차적 인과적 다중 에이전트 시스템 (S-CMAS) 을 제안하고 이론적 분석을 수행했으나, 50 회 이상의 시뮬레이션과 합성 예시를 통해 합리적 최선 대응을 전제로 한 역산 (backward induction) 하에서는 고전적 스택버그 균형 대비 후생 개선 효과가 전혀 나타나지 않는다는 부정적 결론을 도출하여, 합리적 선택에 기반한 고전적 게임 이론 프레임워크가 인과적 추론의 이점을 포착하는 데 근본적인 한계가 있음을 시사합니다.

Dennis ThummThu, 12 Ma📊 stat

← 이전 다음 →

cs.GT

Offer of a reward does not always promote trust in spatial games

A symmetric recursive algorithm for mean-payoff games

Coordination Games on Multiplex Networks: Consensus, Convergence, and Stability of Opinion Dynamics

Deep Incentive Design with Differentiable Equilibrium Blocks

Rigidity in LLM Bandits with Implications for Human-AI Dyads

A Lightweight MPC Bidding Framework for Brand Auction Ads

Leaderboard Incentives: Model Rankings under Strategic Post-Training

The biased interaction game: Its dynamics and application in modelling social systems

A New Lower Bound for the Random Offerer Mechanism in Bilateral Trade using AI-Guided Evolutionary Search

What Do Agents Think One Another Want? Level-2 Inverse Games for Inferring Agents' Estimates of Others' Objectives

Simplifying Preference Elicitation in Local Energy Markets: Combinatorial Clock Exchange

Sequential Causal Normal Form Games: Theory, Computation, and Strategic Signaling

Deciding winning strategies in Yu-Gi-Oh! TCG is hard

Quantal Response Equilibrium as a Measure of Strategic Sophistication: Theory and Validation for LLM Evaluation

Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models

Instant Runoff Voting on Graphs: Exclusion Zones and Distortion

Algorithmic Collusion by Large Language Models

On the Existence of Fair Allocations for Goods and Chores under Dissimilar Preferences

Test-then-Punish: A Statistical Approach to Repeated Games

The Coordination Gap: Alternation Metrics for Temporal Dynamics in Multi-Agent Battle of the Exes