Modelscope Hub | Paper | Demo
中文 | English
Modelscope-Agent is a customizable and scalable Agent framework. A single agent has abilities such as role-playing, LLM calling, tool usage, planning, and memory. It mainly has the following characteristics:
RolePlay
agents with latest OpenAI model GPT-4o
. Developers can experience this feature by specifying the image_url
parameter.Assistant API
, and also provided a Tools API
that executes utilities in isolated, secure containers, please find the documentbash run_msgpt.sh
.clone repo and install dependency:
git clone https://github.com/modelscope/modelscope-agent.git
cd modelscope-agent && pip install -r requirements.txt
The ModelScope Notebook offers a free-tier that allows ModelScope user to run the FaceChain application with minimum setup, refer to ModelScope Notebook
# Step1: 我的notebook -> PAI-DSW -> GPU环境
# Step2: Download the [demo file](https://github.com/modelscope/modelscope-agent/blob/master/demo/demo_qwen_agent.ipynb) and upload it to the GPU.
# Step3: Execute the demo notebook in order.
The agent incorporates an LLM along with task-specific tools, and uses the LLM to determine which tool or tools to invoke in order to complete the user's tasks.
To start, all you need to do is initialize an RolePlay
object with corresponding tasks
# 配置环境变量;如果您已经提前将api-key提前配置到您的运行环境中,可以省略这个步骤
import os
os.environ['DASHSCOPE_API_KEY']=YOUR_DASHSCOPE_API_KEY
os.environ['AMAP_TOKEN']=YOUR_AMAP_TOKEN
# 选用RolePlay 配置agent
from modelscope_agent.agents.role_play import RolePlay # NOQA
role_template = '你扮演一个天气预报助手,你需要查询相应地区的天气,并调用给你的画图工具绘制一张城市的图。'
llm_config = {'model': 'qwen-max', 'model_server': 'dashscope'}
# input tool name
function_list = ['amap_weather', 'image_gen']
bot = RolePlay(
function_list=function_list, llm=llm_config, instruction=role_template)
response = bot.run('朝阳区天气怎样?')
text = ''
for chunk in response:
text += chunk
Result
# 第一次调用llm的输出
Action: amap_weather
Action Input: {"location": "朝阳区"}
# 第二次调用llm的输出
目前,朝阳区的天气状况为阴天,气温为1度。
Action: image_gen
Action Input: {"text": "朝阳区城市风光", "resolution": "1024*1024"}
# 第三次调用llm的输出
目前,朝阳区的天气状况为阴天,气温为1度。同时,我已为你生成了一张朝阳区的城市风光图,如下所示:
![](https://dashscope-result-sh.oss-cn-shanghai.aliyuncs.com/1d/45/20240204/3ab595ad/96d55ca6-6550-4514-9013-afe0f917c7ac-1.jpg?Expires=1707123521&OSSAccessKeyId=LTAI5tQZd8AEcZX6KZV4G8qL&Signature=RsJRt7zsv2y4kg7D9QtQHuVkXZY%3D)
An Agent
object consists of the following components:
LLM
: A large language model that is responsible to process your inputs and decide calling tools.function_list
: A list consists of available tools for agents.Currently, configuration of Agent
may contain following arguments:
llm
: The llm config of this agent
function_list
: A list of tools
storage_path
: If not specified otherwise, all data will be stored here in KV pairs by memoryinstruction
: the system instruction of this agentname
: the name of agentdescription
: the description of agent, which is used for multi_agentkwargs
: other potential parametersAgent
, as a base class, cannot be directly initialized and called. Agent subclasses need to inherit it. They must implement function _run
, which mainly includes three parts: generation of messages/propmt, calling of llm(s), and tool calling based on the results of llm. We provide an implement of these components in RolePlay
for users, and you can also custom your components according to your requirement.
from modelscope_agent import Agent
class YourCustomAgent(Agent):
def _run(self, user_request, **kwargs):
# Custom your workflow
LLM is core module of agent, which ensures the quality of interaction results.
Currently, configuration of `` may contain following arguments:
model
: The specific model name will be passed directly to the model service provider.model_server
: provider of model services.BaseChatModel
, as a base class of llm, cannot be directly initialized and called. The subclasses need to inherit it. They must implement function _chat_stream
and _chat_no_stream
, which correspond to streaming output and non-streaming output respectively.
Optionally implement chat_with_functions
and chat_with_raw_prompt
for function calling and text completion.
Currently we provide the implementation of three model service providers: dashscope (for qwen series models), zhipu (for glm series models) and openai (for all openai api format models). You can directly use the models supported by the above service providers, or you can customize your llm.
For more information please refer to docs/modules/llm.md
Tool
We provide several multi-domain tools that can be configured and used in the agent.
You can also customize your tools with set the tool's name, description, and parameters based on a predefined pattern by inheriting the base tool. Depending on your needs, call() can be implemented. An example of a custom tool is provided in demo_register_new_tool
You can pass the tool name or configuration you want to use to the agent.
# by tool name
function_list = ['amap_weather', 'image_gen']
bot = RolePlay(function_list=function_list, ...)
# by tool configuration
from langchain.tools import ShellTool
function_list = [{'terminal':ShellTool()}]
bot = RolePlay(function_list=function_list, ...)
# by mixture
function_list = ['amap_weather', {'terminal':ShellTool()}]
bot = RolePlay(function_list=function_list, ...)
image_gen
: Wanx Image Generation. DASHSCOPE_API_KEY needs to be configured in the environment variable.code_interpreter
: Code Interpreterweb_browser
: Web Browsingamap_weather
: AMAP Weather. AMAP_TOKEN needs to be configured in the environment variable.wordart_texture_generation
: Word art texture generation. DASHSCOPE_API_KEY needs to be configured in the environment variable.web_search
: Web Searching. []qwen_vl
: Qwen-VL image recognition. DASHSCOPE_API_KEY needs to be configured in the environment variable.style_repaint
: Character style redrawn. DASHSCOPE_API_KEY needs to be configured in the environment variable.image_enhancement
: Chasing shadow-magnifying glass. DASHSCOPE_API_KEY needs to be configured in the environment variable.text-address
: Geocoding. MODELSCOPE_API_TOKEN needs to be configured in the environment variable.speech-generation
: Speech generation. MODELSCOPE_API_TOKEN needs to be configured in the environment variable.video-generation
: Video generation. MODELSCOPE_API_TOKEN needs to be configured in the environment variable.Please refer the multi-agent readme.
If you would like to learn more about the practical details of Agent, you can refer to our articles and video tutorials:
We appreciate your enthusiasm in participating in our open-source ModelScope-Agent project. If you encounter any issues, please feel free to report them to us. If you have built a new Agent demo and are ready to share your work with us, please create a pull request at any time! If you need any further assistance, please contact us via email at [email protected] or communication group!
Facechain is an open-source project for generating personalized portraits in various styles using facial images uploaded by users. By integrating the capabilities of Facechain into the modelscope-agent framework, we have greatly simplified the usage process. The generation of personalized portraits can now be done through dialogue with the Facechain Agent.
FaceChainAgent Studio Application Link: https://modelscope.cn/studios/CVstudio/facechain_agent_studio/summary
You can run it directly in a notebook/Colab/local environment: https://www.modelscope.cn/my/mynotebook
! git clone -b feat/facechain_agent https://github.com/modelscope/modelscope-agent.git
! cd modelscope-agent && ! pip install -r requirements.txt
! cd modelscope-agent/demo/facechain_agent/demo/facechain_agent && ! pip install -r requirements.txt
! pip install http://dashscope-cn-beijing.oss-cn-beijing.aliyuncs.com/zhicheng/modelscope_agent-0.1.0-py3-none-any.whl
! PYTHONPATH=/mnt/workspace/modelscope-agent/demo/facechain_agent && cd modelscope-agent/demo/facechain_agent/demo/facechain_agent && python app_v1.0.py
This project is licensed under the Apache License (Version 2.0).