MCP Playground | Paper | Docs
MS-Agent is a lightweight framework designed to empower agents with autonomous exploration capabilities. It provides a flexible and extensible architecture that allows developers to create agents capable of performing complex tasks, such as code generation, data analysis, and tool calling for general purposes with MCP (Model Calling Protocol) support.
- Multi-Agent for general purpose: Chat with agent with tool-calling capabilities based on MCP.
- Deep Research: To enable advanced capabilities for autonomous exploration and complex task execution.
- Code Generation: Supports code generation tasks with artifacts.
- Lightweight and Extensible: Easy to extend and customize for various applications.
[WARNING] For historical archive versions, please refer to: https://github.com/modelscope/ms-agent/tree/0.8.0
| WeChat Group | 
|---|
|  | 
- 
🚀Sep 22, 2025: Release MS-Agent v1.3.0, which includes the following updates: - 🔥 Support Code Scratch
- Support Memoryfor building agents with long-term and short-term memory
- Enhance the DeepResearch workflow
- Support RAY for accelerating document information extraction
- Support Anthropic API format for LLMs
 
- 
🚀Aug 28, 2025: Release MS-Agent v1.2.0, which includes the following updates: - DocResearch now supports pushing to ModelScope、HuggingFace、GitHubfor easy sharing of research reports. Refer to Doc Research for more details.
- DocResearch now supports exporting the Markdown report to HTML、PDF、PPTXandDOCXformats, refer to Doc Research for more details.
- DocResearch now supports TXTfile processing and file preprocessing, refer to Doc Research for more details.
 
- DocResearch now supports pushing to 
- 
🚀July 31, 2025: Release MS-Agent v1.1.0, which includes the following updates: - 🔥 Support Doc Research, demo: DocResearchStudio
- Add General Web Search Enginefor Agentic Insight (DeepResearch)
- Add Max Continuous Runsfor Agent chat with MCP.
 
- 
🚀July 18, 2025: Release MS-Agent v1.0.0, improve the experience of Agent chat with MCP, and update the readme for Agentic Insight. 
- 
🚀July 16, 2025: Release MS-Agent v1.0.0rc0, which includes the following updates: - Support for Agent chat with MCP (Model Context Protocol)
- Support for Deep Research (Agentic Insight), refer to: Report_Demo, Script_Demo
- Support for MCP-Playground
- Add callback mechanism for Agent chat
 
Archive
- 🔥🔥🔥Aug 8, 2024: A new graph based code generation tool CodexGraph is released by Modelscope-Agent, it has been proved effective and versatile on various code related tasks, please check example.
- 🔥🔥Aug 1, 2024: A high efficient and reliable Data Science Assistant is running on Modelscope-Agent, please find detail in example.
- 🔥July 17, 2024: Parallel tool calling on Modelscope-Agent-Server, please find detail in doc.
- 🔥June 17, 2024: Upgrading RAG flow based on LLama-index, allow user to hybrid search knowledge by different strategies and modalities, please find detail in doc.
- 🔥June 6, 2024: With Modelscope-Agent-Server, Qwen2 could be used by OpenAI SDK with tool calling ability, please find detail in doc.
- 🔥June 4, 2024: Modelscope-Agent supported Mobile-Agent-V2arxiv,based on Android Adb Env, please check in the application.
- 🔥May 17, 2024: Modelscope-Agent supported multi-roles room chat in the gradio.
- May 14, 2024: Modelscope-Agent supported image input in RolePlayagents with latest OpenAI modelGPT-4o. Developers can experience this feature by specifying theimage_urlparameter.
- May 10, 2024: Modelscope-Agent launched a user-friendly Assistant API, and also provided aTools APIthat executes utilities in isolated, secure containers, please find the document
- Apr 12, 2024: The Ray version of multi-agent solution is on modelscope-agent, please find the document
- Mar 15, 2024: Modelscope-Agent and the AgentFabric (opensource version for GPTs) is running on the production environment of modelscope studio.
- Feb 10, 2024: In Chinese New year, we upgrade the modelscope agent to version v0.3 to facilitate developers to customize various types of agents more conveniently through coding and make it easier to make multi-agent demos. For more details, you can refer to #267 and #293 .
- Nov 26, 2023: AgentFabric now supports collaborative use in ModelScope's Creation Space, allowing for the sharing of custom applications in the Creation Space. The update also includes the latest GTE text embedding integration.
- Nov 17, 2023: AgentFabric released, which is an interactive framework to facilitate creation of agents tailored to various real-world applications.
- Oct 30, 2023: Facechain Agent released a local version of the Facechain Agent that can be run locally. For detailed usage instructions, please refer to Facechain Agent.
- Oct 25, 2023: Story Agent released a local version of the Story Agent for generating storybook illustrations. It can be run locally. For detailed usage instructions, please refer to Story Agent.
- Sep 20, 2023: ModelScope GPT offers a local version through gradio that can be run locally. You can navigate to the demo/msgpt/ directory and execute bash run_msgpt.sh.
- Sep 4, 2023: Three demos, demo_qwen, demo_retrieval_agent and demo_register_tool, have been added, along with detailed tutorials provided.
- Sep 2, 2023: The preprint paper associated with this project was published.
- Aug 22, 2023: Support accessing various AI model APIs using ModelScope tokens.
- Aug 7, 2023: The initial version of the modelscope-agent repository was released.
# For the basic functionalities
pip install ms-agent
# For the deep research functionalities
pip install 'ms-agent[research]'git clone https://github.com/modelscope/ms-agent.git
cd ms-agent
pip install -e .Warning
As the project has been renamed to ms-agent, for versions v0.8.0 or earlier, you can install using the following command:
pip install modelscope-agent<=0.8.0To import relevant dependencies using modelscope_agent:
from modelscope_agent import ...This project supports interaction with models via the MCP (Model Context Protocol). Below is a complete example showing how to configure and run an LLMAgent with MCP support.
✅ Chat with agents using the MCP protocol: MCP Playground
By default, the agent uses ModelScope's API inference service. Before running the agent, make sure to set your ModelScope API key.
export MODELSCOPE_API_KEY={your_modelscope_api_key}You can find or generate your API key at https://modelscope.cn/my/myaccesstoken.
import asyncio
from ms_agent import LLMAgent
# Configure MCP servers
mcp = {
  "mcpServers": {
    "fetch": {
      "type": "streamable_http",
      "url": "https://mcp.api-inference.modelscope.net/{your_mcp_uuid}/mcp"
    }
  }
}
async def main():
    # Use json to configure MCP
    llm_agent = LLMAgent(mcp_config=mcp)   # Run task
    await llm_agent.run('Introduce modelscope.cn')
if __name__ == '__main__':
    # Start
    asyncio.run(main())💡 Tip: You can find available MCP server configurations at modelscope.cn/mcp.
For example: https://modelscope.cn/mcp/servers/@modelcontextprotocol/fetch.
Replace the url in mcp["mcpServers"]["fetch"] with your own MCP server endpoint.
Memory
We support memory by using mem0 in version v1.3.0! 🎉
Below is a simple example to get you started. For more comprehensive test cases, please refer to the test_case.
Before running the agent, ensure that you have set your ModelScope API key for LLM.
pip install mem0ai
export MODELSCOPE_API_KEY={your_modelscope_api_key}
export DASHSCOPE_API_KEY={your_dashscope_api_key}You can obtain or generate your API keys at:
Example Usage
This example demonstrates how the agent remembers user preferences across sessions using persistent memory:
import uuid
import asyncio
from omegaconf import OmegaConf
from ms_agent.agent.loader import AgentLoader
async def main():
    random_id = str(uuid.uuid4())
    default_memory = OmegaConf.create({
        'memory': [{
            'path': f'output/{random_id}',
            'user_id': 'awesome_me'
        }]
    })
    agent1 = AgentLoader.build(config_dir_or_id='ms-agent/simple_agent', config=default_memory)
    agent1.config.callbacks.remove('input_callback')  # Disable interactive input for direct output
    await agent1.run('I am a vegetarian and I drink coffee every morning.')
    del agent1
    print('========== Data preparation completed, starting test ===========')
    agent2 = AgentLoader.build(config_dir_or_id='ms-agent/simple_agent', config=default_memory)
    agent2.config.callbacks.remove('input_callback')  # Disable interactive input for direct output
    res = await agent2.run('Please help me plan tomorrow’s three meals.')
    print(res)
    assert 'vegan' in res[-1].content.lower() and 'coffee' in res[-1].content.lower()
asyncio.run(main())This project provides a framework for Deep Research, enabling agents to autonomously explore and execute complex tasks.
- 
Autonomous Exploration - Autonomous exploration for various complex tasks 
- 
Multimodal - Capable of processing diverse data modalities and generating research reports rich in both text and images. 
- 
Lightweight & Efficient - Support "search-then-execute" mode, completing complex research tasks within few minutes, significantly reducing token consumption. 
Here is a demonstration of the Agentic Insight framework in action, showcasing its capabilities in handling complex research tasks efficiently.
- 
User query 
- 
- Chinese:
 
在计算化学这个领域,我们通常使用Gaussian软件模拟各种情况下分子的结构和性质计算,比如在关键词中加入'field=x+100'代表了在x方向增加了电场。但是,当体系是经典的单原子催化剂时,它属于分子催化剂,在反应环境中分子的朝向是不确定的,那么理论模拟的x方向电场和实际电场是不一致的。
请问:通常情况下,理论计算是如何模拟外加电场存在的情况?
- 
- English:
 
In the field of computational chemistry, we often use Gaussian software to simulate the structure and properties of molecules under various conditions. For instance, adding 'field=x+100' to the keywords signifies an electric field applied along the x-direction. However, when dealing with a classical single-atom catalyst, which falls under molecular catalysis, the orientation of the molecule in the reaction environment is uncertain. This means the x-directional electric field in the theoretical simulation might not align with the actual electric field.
So, how are external electric fields typically simulated in theoretical calculations?
agentic_insight_case_chem_0718a.mp4
For more details, please refer to Deep Research.
This project provides a framework for Doc Research, enabling agents to autonomously explore and execute complex tasks related to document analysis and research.
- 🔍 Deep Document Research - Support deep analysis and summarization of documents
- 📝 Multiple Input Types - Support multi-file uploads and URL inputs
- 📊 Multimodal Reports - Support text and image reports in Markdown format
- 🚀 High Efficiency - Leverage powerful LLMs for fast and accurate research, leveraging key information extraction techniques to further optimize token usage
- ⚙️ Flexible Deployment - Support local run and ModelScope Studio
- 💰 Free Model Inference - Free LLM API inference calls for ModelScope users, refer to ModelScope API-Inference
1. ModelScope Studio DocResearchStudio
2. Local Gradio Application
- Research Report for UniME: Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
For more details, refer to Doc Research
This project provides a framework for Code Scratch, enabling agents to autonomously generate code projects.
- 🎯 Complex Code Generation - Support for complex code generation tasks, especially React frontend and Node.js backend
- 🔧 Customizable Workflows - Enable users to freely develop their own code generation workflows tailored to specific scenarios
- 🏗️ Three-Phase Architecture - Design & Coding Phase followed by Refine Phase for robust code generation and error fixing
- 📁 Intelligent File Grouping - Automatically groups related code files to minimize dependencies and reduce bugs
- 🔄 Auto Compilation & Fixing - Automatic npm compilation with intelligent error analysis and iterative fixing
AI Workspace Homepage
Generate a complete ai workspace homepage with the following command:
PYTHONPATH=. openai_api_key=your-api-key openai_base_url=your-api-url python ms_agent/cli/cli.py run --config projects/code_scratch --query 'Build a comprehensive AI workspace homepage' --trust_remote_code trueThe generated code will be output to the output folder in the current directory.
Architecture Workflow:
- Design Phase: Analyze requirements → Generate PRD & module design → Create implementation tasks
- Coding Phase: Execute coding tasks in intelligent file groups → Generate complete code structure
- Refine Phase: Auto-compilation → Error analysis → Iterative bug fixing → Human evaluation loop
For more details, refer to Code Scratch.
- A news collection agent ms-agent/newspaper
This project is licensed under the Apache License (Version 2.0).

