llama3 playground下载 - llama3 playground源码下载

骆驼 3 游乐场

一个完整的、可立即运行的环境，用于使用自定义数据集微调 Llama 3 模型并在微调后的模型上运行推理

要求

码头工人
英伟达显卡

注意：目前仅在 NVIDIA RTX 2080 和 NVIDIA Tesla T4 GPU 上进行了测试。它尚未在其他 GPU 类别或 CPU 上进行过测试。

在您的主机上运行此命令以检查您安装了哪个 Nvidia GPU。

nvidia-smi

这应该会显示你的 GPU 信息。

+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.171.04             Driver Version: 535.171.04   CUDA Version: 12.2     |
| -----------------------------------------+----------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                                         |                      |               MIG M. |
| =========================================+======================+====================== |
|   0  NVIDIA GeForce RTX 2080        Off | 00000000:01:00.0  On |                  N/A |
| 22%   38C    P8              17W / 215W |    197MiB /  8192MiB |      0%      Default |
|                                         |                      |                  N/A |
+-----------------------------------------+----------------------+----------------------+

设置/图像包含什么？

Python 3.10
Jupyter实验室
Huggingface CLI
EasyOCR（已预下载英文检测模型）。这是为了在 PDF/图像文件上运行字符识别。
预下载 Llama3 模型
用于运行 OCR、训练和推理的脚本。
用于微调模型的示例数据集

设置

git clone https://github.com/amithkoujalgi/llama3-playground.git
cd llama3-playground

bash build.sh

跑步

bash run.sh

这将使用以下服务启动 Docker 容器。

服务	外部可访问端点	内部端口	描述
导师	http://本地主机:8884	9001	用于在自定义数据集上运行训练并查看训练器进程的日志
FastAPI服务器	http://localhost:8883/docs	8070	用于访问模型服务器的API
JupyterLab服务器	http://localhost:8888/lab	8888	访问 JupyterLab 界面以浏览容器并更新/试验代码

注意：所有进程（OCR、训练和推理）都使用 GPU，如果同时运行任何类型的多个进程，我们将遇到内存不足 (OOM) 问题。为了解决这个问题，系统被设计为在任何给定时间点仅运行一个进程。（即，一次只能运行一个 OCR 或训练或推理实例）
请随意根据您的需要更新代码。

从 Jupyter 运行命令

火车模型

转到终端并输入

playground --train

列出型号

转到终端并输入

playground -l

这会在/app/data/trained-models/下生成模型。训练器脚本生成 2 个模型：

仅具有 LoRA 适配器并带有lora-adapters后缀的模型。
仅将 LoRA 适配器与基本模型合并的完整模型。

运行 OCR：

 cd /app/llama3_playground/core

python ocr.py 
  -f " /app/sample.pdf "

要了解选项的含义，请转到 JupyterLab 并执行python ocr.py -h

使用 RAG 进行推理：

 cd /app/llama3_playground/core

python infer_rag.py 
  -m " llama-3-8b-instruct-custom-1720802202 " 
  -d " /app/data/ocr-runs/123/text-result.txt " 
  -q " What is the employer name, address, telephone, TIN, tax year end, type of business, plan name, Plan Sequence Number, Trust ID, Account number, is it a new plan or existing plan as true or false, are elective deferrals and roth deferrals allowed as true or false, are loans permitted as true or false, are life insurance investments permitted and what is the ligibility Service Requirement selected? " 
  -t 256 
  -e " Alibaba-NLP/gte-base-en-v1.5 " 
  -p " There are checkboxes in the text that denote the value as selected if the text is [Yes], and unselected if the text is [No]. The checkbox option's value can either be before the selected value or after. Keep this in context while responding and be very careful and precise in picking these values. Always respond as JSON. Keep the responses precise and concise. "

要了解选项的含义，请转到 JupyterLab 并执行python infer_rag.py -h

附加设置说明

如果您的主机上没有安装 NVIDIA Container Toolkit，则需要执行此操作。

如果您运行的是 Ubuntu 主机，请安装 NVIDIA Container Toolkit

 # Configure the production repository
curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg 
  && curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | 
    sed ' s#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g ' | 
    sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list

# Optionally, configure the repository to use experimental packages
sed -i -e ' /experimental/ s/^#//g ' /etc/apt/sources.list.d/nvidia-container-toolkit.list

# Update the packages list from the repository
sudo apt-get update

# Install the NVIDIA Container Toolkit packages
sudo apt-get install -y nvidia-container-toolkit

其他环境请参考此。

蜜蜂

推理

从模型生成响应

curl --silent -X ' POST ' 
  ' http://localhost:8883/api/infer/sync/ctx-text ' 
  -H ' accept: application/json ' 
  -H ' Content-Type: application/json ' 
  -d ' {
  "model_name": "llama-3-8b-instruct-custom-1720690384",
  "context_data": "You are a magician who goes by the name Magica",
  "question_text": "Who are you?",
  "prompt_text": "Respond in a musical and Shakespearean tone",
  "max_new_tokens": 50
} ' | jq -r " .data.response "

光学字符识别

通过上传文件对 PDF 文件运行 OCR

curl -X ' POST ' 
  ' http://localhost:8883/api/ocr/sync/pdf ' 
  -H ' accept: application/json ' 
  -H ' Content-Type: multipart/form-data ' 
  -F ' file=@your_file.pdf;type=application/pdf '

获取 OCR 进程的状态。如果任何 OCR 进程正在运行，则返回`true` ，否则返回`false` 。

curl -X ' GET ' 
  ' http://localhost:8883/api/ocr/status ' 
  -H ' accept: application/json '

参考：

https://huggingface.co/unsloth/llama-3-8b-bnb-4bit
https://huggingface.co/unsloth/llama-3-8b-Instruct-bnb-4bit
https://colab.research.google.com/drive/135ced7oHytdxu3N2DNe1Z0kqjyYIkDXp?usp=sharing
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html

展开

llama3 playground

骆驼 3 游乐场

要求

设置/图像包含什么？

设置

跑步

从 Jupyter 运行命令

火车模型

列出型号

附加设置说明

如果您运行的是 Ubuntu 主机，请安装 NVIDIA Container Toolkit

蜜蜂

推理

从模型生成响应

光学字符识别

通过上传文件对 PDF 文件运行 OCR

获取 OCR 进程的状态。如果任何 OCR 进程正在运行，则返回`true` ，否则返回`false` 。

llama3

Nextbots Sandbox Playground游戏

Battle Ragdoll Playground游戏

哈瓜游乐场

Melon Playground最新版

人民游乐场

chat.petals.dev

GPT Prompt Templates

GPTyped

node telegram bot api

typebot.io

python wechaty getting started

waymo open dataset

termwind

wp functions

llama3 playground

骆驼 3 游乐场

要求

设置/图像包含什么？

设置

跑步

从 Jupyter 运行命令

火车模型

列出型号

附加设置说明

如果您运行的是 Ubuntu 主机，请安装 NVIDIA Container Toolkit

蜜蜂

推理

从模型生成响应

光学字符识别

通过上传文件对 PDF 文件运行 OCR

获取 OCR 进程的状态。如果任何 OCR 进程正在运行，则返回true ，否则返回false 。

获取 OCR 进程的状态。如果任何 OCR 进程正在运行，则返回`true` ，否则返回`false` 。