[Feature] Add agent for camp3 (InternLM#1567)

fanqiNO1 · web-flow · commit f7fc6384a827 · 2024-08-07T10:17:50.000+08:00
* [Feature] Add agent for camp3
diff --git a/README.md b/README.md
@@ -38,7 +38,7 @@
 ||关卡名称|资料|闯关激励|
 |:-----|:----|:----|:-----|
 |第 1 关| 探索 InternLM 模型能力边界 | 任务、文档、视频| 100元算力点 |
-|第 2 关| Lagent 自定义你的 Agent 智能体 |任务、文档、视频| 100元算力点 |
+|第 2 关| Lagent 自定义你的 Agent 智能体 | [任务](docs/L2/Lagent/task.md)、[文档](docs/L2/Lagent/readme.md)、视频| 100元算力点 |
 |第 3 关| LMDeploy 量化部署进阶实践 | 任务、文档、视频| 100元算力点 |
 |第 4 关| InternVL 多模态模型部署微调实践 | 任务、文档、视频| 100元算力点 |
 |第 5 关| 茴香豆：企业级知识库问答工具 | 任务、文档、视频| 100元算力点 |
diff --git a/docs/L2/Lagent/readme.md b/docs/L2/Lagent/readme.md
@@ -1 +1,214 @@
+![head](https://github.com/user-attachments/assets/4a9cd258-07c0-41db-a192-1442df9dd098)
 
+# Lagent 自定义你的 Agent 智能体
+
+## Lagent 介绍
+
+Lagent 是一个轻量级开源智能体框架，旨在让用户可以高效地构建基于大语言模型的智能体。同时它也提供了一些典型工具以增强大语言模型的能力。
+
+Lagent 目前已经支持了包括 AutoGPT、ReAct 等在内的多个经典智能体范式，也支持了如下工具：
+
+- Arxiv 搜索
+- Bing 地图
+- Google 学术搜索
+- Google 搜索
+- 交互式 IPython 解释器
+- IPython 解释器
+- PPT
+- Python 解释器
+
+其基本结构如下所示：
+
+![image](https://github.com/InternLM/lagent/assets/24351120/cefc4145-2ad8-4f80-b88b-97c05d1b9d3e)
+
+## 环境配置
+
+开发机选择 30% A100，镜像选择为 Cuda12.2-conda。
+
+首先来为 Lagent 配置一个可用的环境。
+
+```bash
+# 创建环境
+conda create -n agent_camp3 python=3.10 -y
+# 激活环境
+conda activate agent_camp3
+# 安装 torch
+conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=12.1 -c pytorch -c nvidia -y
+# 安装其他依赖包
+pip install termcolor==2.4.0
+pip install lmdeploy==0.5.2
+```
+
+接下来，我们通过源码安装的方式安装 lagent。
+
+```bash
+# 创建目录以存放代码
+mkdir -p /root/agent_camp3
+cd /root/agent_camp3
+git clone https://github.com/InternLM/lagent.git
+cd lagent && git checkout 81e7ace && pip install -e . && cd ..
+```
+
+## Lagent Web Demo 使用
+
+接下来，我们将使用 Lagent 的 Web Demo 来体验 InternLM2.5-7B-Chat 的智能体能力。
+
+首先，我们先使用 LMDeploy 部署 InternLM2.5-7B-Chat，并启动一个 API Server。
+
+```bash
+conda activate agent_camp3
+lmdeploy serve api_server /share/new_models/Shanghai_AI_Laboratory/internlm2_5-7b-chat --model-name internlm2_5-7b-chat
+```
+
+![lmdeploy server](https://github.com/user-attachments/assets/7765bce4-50bc-4204-b217-ab1bb5a269ef)
+
+然后，我们在另一个窗口中启动 Lagent 的 Web Demo。
+
+```bash
+cd /root/agent_camp3/lagent
+conda activate agent_camp3
+streamlit run examples/internlm2_agent_web_demo.py
+```
+
+![lagent web demo](https://github.com/user-attachments/assets/1b7af814-78d9-458a-8c3b-eaa39a588638)
+
+在等待两个 server 都完全启动（如下图所示）后，我们在 **本地** 的 PowerShell 中输入如下指令来进行端口映射：
+
+```bash
+ssh -CNg -L 8501:127.0.0.1:8501 -L 23333:127.0.0.1:23333 root@ssh.intern-ai.org.cn -p <你的 SSH 端口号>
+```
+
+| LMDeploy api_server | Lagent Web Demo | 
+| --- | --- |
+| ![LMDeploy done](https://github.com/user-attachments/assets/820f4ceb-4337-484f-997b-001d5532816a) | ![Lagent done](https://github.com/user-attachments/assets/b9ccff2b-6e05-4c4e-b85b-860f8b9e2f41) |
+
+接下来，在本地浏览器中打开 `localhost:8501`，并修改**模型名称**一栏为 `internlm2_5-7b-chat`，修改**模型 ip**一栏为`127.0.0.1:23333`。
+
+> [!IMPORTANT]
+> 输入后需要按下回车以确认！
+
+然后，我们在插件选择一栏选择 `ArxivSearch`，并输入指令“帮我搜索一下 MindSearch 论文”。
+
+![Web Demo](https://github.com/user-attachments/assets/34ac1001-8bfa-4d2a-8346-d871a0e0f03c)
+
+最后，可以看到，模型已经回复了相关信息。
+
+![result](https://github.com/user-attachments/assets/d21b64c2-acf4-48e1-a1e5-73775e6b36d4)
+
+## 基于 Lagent 自定义智能体
+
+在本节中，我们将带大家基于 Lagent 自定义自己的智能体。
+
+Lagent 中关于工具部分的介绍文档位于 https://lagent.readthedocs.io/zh-cn/latest/tutorials/action.html 。
+
+使用 Lagent 自定义工具主要分为以下几步：
+
+1. 继承 `BaseAction` 类
+2. 实现简单工具的 `run` 方法；或者实现工具包内每个子工具的功能
+3. 简单工具的 `run` 方法可选被 `tool_api` 装饰；工具包内每个子工具的功能都需要被 `tool_api` 装饰
+
+下面我们将实现一个调用 MagicMaker API 以完成文生图的功能。
+
+首先，我们先来创建工具文件：
+
+```bash
+cd /root/agent_camp3/lagent
+touch lagent/actions/magicmaker.py
+```
+
+然后，我们将下面的代码复制进入 `/root/agent_camp3/lagent/lagent/actions/magicmaker.py`
+
+```python
+import json
+import requests
+
+from lagent.actions.base_action import BaseAction, tool_api
+from lagent.actions.parser import BaseParser, JsonParser
+from lagent.schema import ActionReturn, ActionStatusCode
+
+
+class MagicMaker(BaseAction):
+    styles_option = [
+        'dongman',  # 动漫
+        'guofeng',  # 国风
+        'xieshi',   # 写实
+        'youhua',   # 油画
+        'manghe',   # 盲盒
+    ]
+    aspect_ratio_options = [
+        '16:9', '4:3', '3:2', '1:1',
+        '2:3', '3:4', '9:16'
+    ]
+
+    def __init__(self,
+                 style='guofeng',
+                 aspect_ratio='4:3'):
+        super().__init__()
+        if style in self.styles_option:
+            self.style = style
+        else:
+            raise ValueError(f'The style must be one of {self.styles_option}')
+        
+        if aspect_ratio in self.aspect_ratio_options:
+            self.aspect_ratio = aspect_ratio
+        else:
+            raise ValueError(f'The aspect ratio must be one of {aspect_ratio}')
+    
+    @tool_api
+    def generate_image(self, keywords: str) -> dict:
+        """Run magicmaker and get the generated image according to the keywords.
+
+        Args:
+            keywords (:class:`str`): the keywords to generate image
+
+        Returns:
+            :class:`dict`: the generated image
+                * image (str): path to the generated image
+        """
+        try:
+            response = requests.post(
+                url='https://magicmaker.openxlab.org.cn/gw/edit-anything/api/v1/bff/sd/generate',
+                data=json.dumps({
+                    "official": True,
+                    "prompt": keywords,
+                    "style": self.style,
+                    "poseT": False,
+                    "aspectRatio": self.aspect_ratio
+                }),
+                headers={'content-type': 'application/json'}
+            )
+        except Exception as exc:
+            return ActionReturn(
+                errmsg=f'MagicMaker exception: {exc}',
+                state=ActionStatusCode.HTTP_ERROR)
+        image_url = response.json()['data']['imgUrl']
+        return {'image': image_url}
+
+```
+
+最后，我们修改 `/root/agent_camp3/lagent/examples/internlm2_agent_web_demo.py` 来适配我们的自定义工具。
+
+1. 在 `from lagent.actions import ActionExecutor, ArxivSearch, IPythonInterpreter` 的下一行添加 `from lagent.actions.magicmaker import MagicMaker`
+2. 在第27行添加 `MagicMaker()`。
+
+```diff
+from lagent.actions import ActionExecutor, ArxivSearch, IPythonInterpreter
++ from lagent.actions.magicmaker import MagicMaker
+from lagent.agents.internlm2_agent import INTERPRETER_CN, META_CN, PLUGIN_CN, Internlm2Agent, Internlm2Protocol
+
+...
+        action_list = [
+            ArxivSearch(),
++             MagicMaker(),
+        ]
+```
+
+接下来，启动 Web Demo 来体验一下吧！我们同时启用两个工具，然后输入“请帮我生成一幅山水画”
+
+![instruction](https://github.com/user-attachments/assets/699308cd-6b17-4515-a42e-d120bd8e9a2b)
+
+![result](https://github.com/user-attachments/assets/c62cea67-1b9f-4a45-ba7f-6c5836d6db7e)
+
+然后，我们再试一下“帮我搜索一下 MindSearch 论文”。
+
+![result](https://github.com/user-attachments/assets/03a39808-db97-4321-883e-7a0446e95343)
diff --git a/docs/L2/Lagent/task.md b/docs/L2/Lagent/task.md
@@ -0,0 +1,17 @@
+# Lagent 自定义你的 Agent 智能体
+
+记录复现过程并截图
+
+## 基础任务（完成此任务即完成闯关）
+
+- 成功使用 Lagent Web Demo 部署一个智能体，并调用成功。
+
+## 进阶任务（闯关不要求完成此任务）
+
+- 使用 Lagent 自定义一个智能体，并使用 Lagent Web Demo 成功部署与调用，记录复现过程并截图。
+
+## 闯关材料提交（完成任务并且提交材料视为闯关成功）
+
+- 闯关作业总共分为一个任务，一个任务完成视作闯关成功。
+- 请将作业发布到知乎、CSDN等任一社交媒体，将作业链接提交到以下问卷，助教老师批改后将获得 100 算力点奖励！！！ 
+- 提交地址：https://aicarrier.feishu.cn/share/base/form/shrcnZ4bQ4YmhEtMtnKxZUcf1vd