Topics
- LLM Routing and Aggregation
- LLM-based Agent
- LLM-based MAS
- LLM Reasoning and Reinforcement Learning
- Multi-Agent Learning in Games
- Cooperation in Social Dilemmas
- LLM Meets Psychology
LLM Routing and Aggregation
Leveraging multiple LLMs to improve collective performance and efficiency.
- Beyond Gemini-3-Pro: Revisiting LLM Routing and Aggregation at Scale (arXiv)
- LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing (arXiv)
- Beyond GPT-5: Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing (paper)
- ICL-Router: In-Context Learned Model Representations for LLM Routing (arXiv)
- Do We Truly Need So Many Samples? Multi-LLM Repeated Sampling Efficiently Scales Test-Time Compute (arXiv)
- The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants (arXiv)
- Open-Source LLMs Collaboration Beats Closed-Source LLMs: A Scalable Multi-Agent System (arXiv)
- Learning Compact Representations of LLM Abilities via Item Response Theory (arXiv)
- Decouple and Orthogonalize: A Data-Free Framework for LoRA Merging (arXiv)
- Nature-Inspired Population-Based Evolution of Large Language Models (arXiv)
- A Comprehensive Survey of LLM-Driven Collective Intelligence: Past, Present, and Future (paper)
LLM-based Agent
Agent architectures, memory systems, workflow generation, and long-horizon task execution for LLM-based agents.
- Single-Agent Scaling Fails Multi-Agent Intelligence: Towards Foundation Models with Native Multi-Agent Intelligence (arXiv)
- If Multi-Agent Debate is the Answer, What is the Question? (arXiv)
- MemVerse: Multimodal Memory for Lifelong Learning Agents (arXiv)
- EvoFlow: Evolving Diverse Agentic Workflows On The Fly (arXiv)
- PerPilot: Personalizing VLM-based Mobile Agents via Memory and Exploration (arXiv)
- Organizing, Orchestrating, and Benchmarking Agent Skills at Ecosystem Scale (arXiv)
- InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery (arXiv)
- Hands-on LLM-based Agents: A Tutorial for General Audiences (paper)
LLM-based MAS
Multi-agent mechanisms built with LLM-based agents, focusing on norms, reputation, and social interaction.
- Beyond the Tragedy of the Commons: Building A Reputation System for Generative Multi-agent Systems (arXiv)
- Emergence of Social Norms in Generative Agent Societies: Principles and Architecture (arXiv)
- OASIS: Open Agent Social Interaction Simulations with One Million Agents (arXiv)
- An AI Researchers' Perspective: At the Crossroad of LLMs, Agent-Based Modeling, and Complex Systems (paper)
LLM Reasoning and Reinforcement Learning
Methods that improve LLM reasoning, mostly through the lens of reinforcement learning.
- Reinforcement Learning for Large Language Models via Group Preference Reward Shaping (paper)
- Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model (arXiv)
- The Path of Self-Evolving Large Language Models: Achieving Data-Efficient Learning via Intrinsic Feedback (arXiv)
- Scaling Physical Reasoning with the PHYSICS Dataset (arXiv)
- ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning (arXiv)
- PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System (arXiv)
- MARTI-MARS: Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation (arXiv)
Multi-Agent Learning in Games
Theoretical analyses of multi-agent learning dynamics and equilibrium behavior.
- A Formal Model for Multiagent Q-Learning on Graphs (paper)
- Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning (paper)
- Payoff Control in Multichannel Games: Influencing Opponent Learning Evolution (paper)
- Regret Minimization in Population Network Games: Vanishing Heterogeneity and Convergence to Equilibria (paper)
- The Stochastic Evolutionary Dynamics of Softmax Policy Gradient in Games (paper)
- A Pair-Approximation Method for Modelling the Dynamics of Multi-Agent Stochastic Games (record)
- Learning by Reusing Previous Advice: A Memory-Based Teacher-Student Framework (paper)
- The Best of Both Worlds in Network Population Games: Reaching Consensus and Convergence to Equilibrium (paper)
- Individual-Level Inverse Reinforcement Learning for Mean Field Games (paper)
- Modelling the Dynamics of Multi-Agent Q-Learning: The Stochastic Effects of Local Interaction and Incomplete Information (paper)
- Modelling the Dynamics of Regret Minimization in Large Agent Populations: A Master Equation Approach (paper)
- The Dynamics of Q-Learning in Population Games: A Physics-Inspired Continuity Equation Model (paper)
- Dynamics of Q-Learning in Networked Stochastic Games (paper)
Cooperation in Social Dilemmas
Understanding cooperation emergence in social dilemma games.
- A Successful Strategy for Multichannel Iterated Prisoner's Dilemma (paper)
- Multi-Agent, Human-Agent and Beyond: A Survey on Cooperation in Social Dilemmas (paper)
- Emergence of Punishment in Social Dilemma with Environmental Feedback (record)
- Facilitating Cooperation in Human-Agent Hybrid Populations through Autonomous Agents (paper)
- How Committed Individuals Shape Social Dynamics: A Survey on Coordination Games and Social Dilemma Games (paper)
- Beyond a Binary Theorizing of Prosociality (paper)
- Large Language Models Overcome the Machine Penalty When Acting Fairly but Not When Acting Selfishly or Altruistically (arXiv)
LLM Meets Psychology
- Large Language Models are Near-Optimal Decision-Makers with a Non-Human Learning Behavior (arXiv)