llama classification下載 - llama classification原始碼下載

llama classification

Ai源碼

v1.1.1

下載

使用 LLaMA 進行文字分類

此儲存庫提供了使用 LLaMA 進行文字分類的基本程式碼庫。

我使用什麼系統來開發？

設備：Nvidia 1xV100 GPU
設備內存：34G
主機記憶體：252G

如果您需要有關硬體的其他信息，請提出問題。

如何使用

實驗裝置

從此處從官方 LLaMA 儲存庫取得檢查點。
1-1.我假設檢查點位於專案根方向，內容安排如下。

 checkpoints
├── llama
│   ├── 7B
│   │   ├── checklist.chk
│   │   ├── consolidated.00.pth
│   │   └── params.json
│   └── tokenizer.model

準備你的Python環境。我建議使用 anaconda 來隔離本機的 CUDA 版本。

conda create -y -n llama-classification python=3.8
conda activate llama-classification
conda install cudatoolkit=11.7 -y -c nvidia
conda list cudatoolkit # to check what cuda version is installed (11.7)
pip install -r requirements.txt

方法：直接

Direct就是比較條件機率p(y|x) 。

使用以下腳本預處理 Huggingface 資料集中的資料。從現在開始，我們使用 ag_news 資料集。

python run_preprocess_direct_ag_news.py
python run_preprocess_direct_ag_news.py --sample=False --data_path=real/inputs_direct_ag_news.json # Use it for full evaluation

使用 LLaMA 計算條件機率並預測類別的推理。

torchrun --nproc_per_node 1 run_evaluate_direct_llama.py 
    --data_path samples/inputs_direct_ag_news.json 
    --output_path samples/outputs_direct_ag_news.json 
    --ckpt_dir checkpoints/llama/7B 
    --tokenizer_path checkpoints/llama/tokenizer.model

Calibration是用標定法改進直接法。

使用以下命令進行校準。

torchrun --nproc_per_node 1 run_evaluate_direct_calibrate_llama.py 
    --direct_input_path samples/inputs_direct_ag_news.json 
    --direct_output_path samples/outputs_direct_ag_news.json 
    --output_path samples/outputs_direct_calibrate_ag_news.json 
    --ckpt_dir checkpoints/llama/7B 
    --tokenizer_path checkpoints/llama/tokenizer.model

方式：管道

Channel是比較條件機率p(x|y) 。

使用以下腳本預處理 Huggingface 資料集中的資料。從現在開始，我們使用 ag_news 資料集。

python run_preprocess_channel_ag_news.py
python run_preprocess_channel_ag_news.py --sample=False --data_path=real/inputs_channel_ag_news.json # Use it for full evaluation

使用 LLaMA 計算條件機率並預測類別的推理。

torchrun --nproc_per_node 1 run_evaluate_channel_llama.py 
    --data_path samples/inputs_channel_ag_news.json 
    --output_path samples/outputs_channel_ag_news.json 
    --ckpt_dir checkpoints/llama/7B 
    --tokenizer_path checkpoints/llama/tokenizer.model

方法：純生成

若要使用generate模式進行評估，您可以使用預處理的直接版本。

torchrun --nproc_per_node 1 run_evaluate_generate_llama.py 
    --data_path samples/inputs_direct_ag_news.json 
    --output_path samples/outputs_generate_ag_news.json 
    --ckpt_dir checkpoints/llama/7B 
    --tokenizer_path checkpoints/llama/tokenizer.model

實驗

數據集	範例數	k	方法	準確性	推理時間
農業新聞	7600	1	直接的	0.7682	00:38:40
農業新聞	7600	1	直接+校準	0.8567	00:38:40
農業新聞	7600	1	頻道	0.7825	00:38:37

待辦事項列表

最後評論

我非常感謝 LLaMA 專案團隊發布檢查點及其高效的推理程式碼。該存儲庫中的大部分工作都是基於官方存儲庫完成的。
對於讀者來說，請不要猶豫提出問題或拉取請求。你可以給我..
- 有關其他功能請求的任何問題
- 有關具體實施的任何問題
- 關於研究方向的任何討論

引文

如果您使用我的程式碼庫進行研究，我們將歡迎引用我的工作。

 @software{Lee_Simple_Text_Classification_2023,
    author = {Lee, Seonghyeon},
    month = {3},
    title = {{Simple Text Classification Codebase using LLaMA}},
    url = {https://github.com/github/sh0416/llama-classification},
    version = {1.1.0},
    year = {2023}
}

展開

附加信息

版本 v1.1.1
類型 Ai源碼
更新時間 2024-12-10
大小 2.5MB
來自於 Github

相關應用

node llama cpp

2024-11-11
llama models

2024-11-10
LLaMA Factory

2024-11-02
程式碼駱駝

2023-10-30
Code Llama大模型

2023-08-25
駱駝2

2023-08-17

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
node telegram bot api

Ai源碼

v0.50.0
typebot.io

Ai源碼

v3.1.2
python wechaty getting started

Ai源碼

1.0.0
waymo open dataset

其他源碼

December 2023 Update
termwind

其他類別

v2.3.0
wp functions

其他類別

1.0.0

相關資訊全部