PEOPLEJOIN is a benchmark for evaluating LM-mediated collaborative problem solving. Given a user request, PEOPLEJOIN agents must identify teammates who might be able to assist, converse with these teammates to gather information, and finally compile a useful answer or summary for the original user. PEOPLEJOIN comprises two evaluation domains: PEOPLEJOIN-QA, focused on questions about tabular data, and PEOPLEJOIN-DOCCREATION, focused on document creation tasks. The two domains are adapted from existing NLP benchmarks for database question answering and multi-document summarization; here, however, the information needed to complete these tasks is distributed across synthetic “organizations” of 2–20 users, simulating natural multi-user collaboration scenarios. We implemented several popular LM agent architectures, and report their accuracy and efficiency at completing tasks.
- Install Python 3.11.
One way is to use pyenv (for Linux/MacOS) or pyenv-win (for Windows).
Run
pyenv install --list | grep '^ *3.11' | tail -n1
to discover the most recent minor version of Python 3.11. For Windows runpyenv install --list
and manually check the latest minor version of Python 3.11. Runpyenv install 3.11.X
whereX
is the latest minor version available. - Install Poetry 1.5 or later following the instructions.
On Windows, you can use the following command in Powershell:
(Invoke-WebRequest -Uri https://install.python-poetry.org -UseBasicParsing).Content | python
. - Configure Poetry to use your Python 3.11 installation.
- If using
pyenv
setup above: runpoetry env use $(pyenv prefix 3.11.X)/bin/python
- Otherwise: run
poetry env use <path to your Python 3.11 binary>
- If using
- Run
poetry install
to install the dependencies. - Run
poetry shell
to activate the venv. - Run
make backend
to start a back-end server athttp://127.0.0.1:8000/
.- To utilize Make file in Windows, you will have to install make on Windows. One way to do that is through chocolatey.
- To install chocolatey, run the following command in the administrative PowerShell:
Set-ExecutionPolicy Bypass -Scope Process -Force; [System.Net.ServicePointManager]::SecurityProtocol = [System.Net.ServicePointManager]::SecurityProtocol -bor 3072; iex ((New-Object System.Net.WebClient).DownloadString('https://community.chocolatey.org/install.ps1'))
- Once installed, restart Powershell and run:
choco install make
. You can verify your installation usingmake -v
. - You can provide config name such as
make backend AGENT_CONF=src/async_collab/scenarios/people_join_qa/agent_configs/spider_sample.json
- Make any changes to the LLM APi interface as needed in
src/async_collab/llm/llm_client_service.py
. - See
workspace/
for experiments scripts
src/
: Python code for the project.tests/
: Unit tests for code insrc/
.data/
: Releant data files
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.
When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.
This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.
This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.
@misc{jhamtani2025peoplejoin,
title={LM Agents for Coordinating Multi-User Information Gathering},
author={Harsh Jhamtani and Jacob Andreas and Benjamin Van Durme},
year={2025}
}