xTuring下載 - xTuring原始碼下載

xTuring

其他源碼

0.1.8

下載

隨機指標.ai

建構、修改和控制您自己的個人化法學碩士

xTuring提供對開源 LLM 的快速、高效且簡單的微調，例如 Mistral、LLaMA、GPT-J 等。透過提供易於使用的介面來根據您自己的資料和應用程式微調 LLM，xTuring 讓建置、修改和控制 LLM 變得簡單。整個過程可以在您的電腦內部或您的私有雲中完成，確保資料隱私和安全。

借助xTuring您可以，

從不同來源獲取數據並將其預處理為法學碩士可以理解的格式
從單一 GPU 擴展到多個 GPU，以實現更快的微調
利用記憶體高效方法（即 INT4、LoRA 微調）將硬體成本降低高達 90%
探索不同的微調方法並對它們進行基準測試以找到性能最佳的模型
根據明確定義的指標評估微調模型以進行深入分析

安裝

pip install xturing

快速入門

 from xturing . datasets import InstructionDataset
from xturing . models import BaseModel

# Load the dataset
instruction_dataset = InstructionDataset ( "./examples/models/llama/alpaca_data" )

# Initialize the model
model = BaseModel . create ( "llama_lora" )

# Finetune the model
model . finetune ( dataset = instruction_dataset )

# Perform inference
output = model . generate ( texts = [ "Why LLM models are becoming so important?" ])

print ( "Generated output by the model: {}" . format ( output ))

您可以在此處找到資料資料夾。

？什麼是新的？

我們很高興地宣布xTuring庫的最新增強功能：

LLaMA 2整合- 您可以在不同的配置中使用和微調LLaMA 2模型：現成的、現成的 INT8 精度、 LoRA 微調、 LoRA INT8 精度微調和LoRA 微調使用GenericModel包裝器以 INT4 精度進行調整和/或您可以使用xturing.models中的Llama2類別來測試和微調模型。

 from xturing . models import Llama2
model = Llama2 ()

## or
from xturing . models import BaseModel
model = BaseModel . create ( 'llama2' )

Evaluation - 現在您可以評估任何資料集上的任何Causal Language Model 。目前支援的指標是perplexity 。

 # Make the necessary imports
from xturing . datasets import InstructionDataset
from xturing . models import BaseModel

# Load the desired dataset
dataset = InstructionDataset ( '../llama/alpaca_data' )

# Load the desired model
model = BaseModel . create ( 'gpt2' )

# Run the Evaluation of the model on the dataset
result = model . evaluate ( dataset )

# Print the result
print ( f"Perplexity of the evalution: { result } " )

INT4 Precision - 您現在可以使用GenericLoraKbitModel來使用和微調具有INT4 Precision的任何 LLM。

 # Make the necessary imports
from xturing . datasets import InstructionDataset
from xturing . models import GenericLoraKbitModel

# Load the desired dataset
dataset = InstructionDataset ( '../llama/alpaca_data' )

# Load the desired model for INT4 bit fine-tuning
model = GenericLoraKbitModel ( 'tiiuae/falcon-7b' )

# Run the fine-tuning
model . finetune ( dataset )

CPU 推理- CPU（包括筆記型電腦 CPU）現在已完全具備處理 LLM 推理的能力。我們整合了英特爾® Extension for Transformers，透過僅使用權重量化演算法壓縮模型來節省內存，並利用其在英特爾平台上高度優化的核心來加速推理。

 # Make the necessary imports
from xturing . models import BaseModel

# Initializes the model: quantize the model with weight-only algorithms
# and replace the linear with Itrex's qbits_linear kernel
model = BaseModel . create ( "llama2_int8" )

# Once the model has been quantized, do inferences directly
output = model . generate ( texts = [ "Why LLM models are becoming so important?" ])
print ( output )

批次整合- 透過調整 .generate() 和 .evaluate() 函數中的“batch_size”，您可以加快結果的速度。使用大於 1 的“batch_size”通常可以提高處理效率。

 # Make the necessary imports
from xturing . datasets import InstructionDataset
from xturing . models import GenericLoraKbitModel

# Load the desired dataset
dataset = InstructionDataset ( '../llama/alpaca_data' )

# Load the desired model for INT4 bit fine-tuning
model = GenericLoraKbitModel ( 'tiiuae/falcon-7b' )

# Generate outputs on desired prompts
outputs = model . generate ( dataset = dataset , batch_size = 10 )

建議探索 Llama LoRA INT4 工作範例以了解其應用。

若要獲得更深入的了解，請考慮檢查儲存庫中提供的 GenericModel 工作範例。

CLI 遊樂場

$ xturing chat -m " <path-to-model-folder> "

使用者介面遊樂場

 from xturing . datasets import InstructionDataset
from xturing . models import BaseModel
from xturing . ui import Playground

dataset = InstructionDataset ( "./alpaca_data" )
model = BaseModel . create ( "<model_name>" )

model . finetune ( dataset = dataset )

model . save ( "llama_lora_finetuned" )

Playground (). launch () ## launches localhost UI

教學

準備你的資料集
Cerebras-GPT 使用 LoRA 和 INT8 進行微調
Cerebras-GPT 使用 LoRA 進行微調
LLaMA 使用 LoRA 和 INT8 進行微調
LLaMA 使用 LoRA 進行微調
LLaMA 微調
GPT-J 使用 LoRA 和 INT8 進行微調
GPT-J 使用 LoRA 進行微調
使用 LoRA 進行 GPT-2 微調

表現

以下是不同微調技術在 LLaMA 7B 模型上的效能比較。我們使用 Alpaca 資料集進行微調。該資料集包含 52K 指令。

硬體:

4 個 A100 40GB GPU、335GB CPU 內存

微調參數：

 {
  'maximum sequence length' : 512 ,
  'batch size' : 1 ,
}

拉馬-7B	DeepSpeed + CPU 卸載	LoRA + DeepSpeed	LoRA + DeepSpeed + CPU 卸載
圖形處理器	33.5GB	23.7GB	21.9GB
中央處理器	190GB	10.2GB	14.9GB
時間/紀元	21小時	20分鐘	20分鐘

透過在其他 GPU 上提交您的效能結果來為此做出貢獻，方法是在您的硬體規格、記憶體消耗和每個週期的時間方面提出問題。

？微調模型檢查點

我們已經微調了一些模型，您可以將其用作基礎或開始使用。以下是加載它們的方法：

 from xturing . models import BaseModel
model = BaseModel . load ( "x/distilgpt2_lora_finetuned_alpaca" )

模型	數據集	小路
DistilGPT-2 LoRA	羊駝毛	`x/distilgpt2_lora_finetuned_alpaca`
駱駝洛拉	羊駝毛	`x/llama_lora_finetuned_alpaca`

支援型號

以下是xTuring的BaseModel類別支援的所有模型及其對應的載入鍵的清單。

模型	鑰匙
盛開	盛開
大腦	大腦
蒸餾GPT-2	蒸餾GPT2
獵鷹7B	鷸
卡拉狄加	卡拉狄加
GPT-J	格普特吉
GPT-2	總蛋白2
駱駝	駱駝
拉瑪2	駱駝2
OPT-1.3B	選擇

上述是法學碩士的基本變體。以下是取得LoRA 、 INT8 、 INT8 + LoRA和INT4 + LoRA版本的範本。

版本	範本
洛拉	<模型密鑰>_lora
INT8	<模型鍵>_int8
INT8+LoRA	<模型金鑰>_lora_int8

** 為了載入任何模型的INT4+LoRA版本，您需要使用xturing.models中的GenericLoraKbitModel類別。下面是如何使用它：

 model = GenericLoraKbitModel ( '<model_path>' )

model_path可以替換為您的本機目錄或任何 HuggingFace 庫模型，例如facebook/opt-1.3b 。

？路線圖

？幫助與支持

如果您有任何疑問，可以在此儲存庫上建立問題。

您也可以加入我們的 Discord 伺服器並在#xturing頻道中開始討論。

執照

該項目根據 Apache License 2.0 獲得許可 - 有關詳細信息，請參閱許可證文件。

？貢獻

作為快速發展領域的開源項目，我們歡迎各種貢獻，包括新功能和更好的文件。請閱讀我們的貢獻指南，以了解如何參與。

展開

附加信息

版本 0.1.8
類型其他源碼
更新時間 2024-12-04
大小 24.7MB
來自於 Github

相關應用

waymo open dataset

2024-11-18
SmartTube

2024-12-14
Sunamu

2024-12-14
MySchedule.py

2024-12-15
viptools for eslam

2024-12-15
VITAident

2024-12-15

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
waymo open dataset

其他源碼

December 2023 Update
SmartTube

其他源碼

24.71 Stable
Sunamu

其他源碼

Release 2.2.0
waymo open dataset

其他源碼

December 2023 Update
wp functions

其他類別

1.0.0
termwind

其他類別

v2.3.0

相關資訊全部