llm toysダウンロード - llm toysソースコードダウンロード

llmおもちゃ

小規模 (7B 以下) の、実稼働対応の微調整された LLM で、さまざまな便利なタスクに対応します。

サポートされているタスク: 言い換え、文章のトーンの変更、ダイローグからの要約とトピックの生成、~~検索拡張QA(WIP)~~ 。

量子化された 3B および 7B モデルで LoRA を微調整します。 3B モデルは特定のタスクに対して微調整されていますが、7B モデルはすべてのタスクに対して微調整されています。

目標は、非常に控えめな消費者向けハードウェア上でこれらすべてのモデルを微調整して使用できるようにすることです。

インストール

pip install llm-toys

CUDA 対応の GPU がないと動作しない可能性があります
bitsandbytes で「インストールされている bitsandbytes のバージョンは GPU サポートなしでコンパイルされました」というメッセージが表示された場合は、ここ bitsandbytes-foundation/bitsandbytes#112 を参照してください。
または試してください
cp <path_to_your_venv>/lib/python3.10/site-packages/bitsandbytes/libbitsandbytes_cpu.so <path_to_your_venv>/lib/python3.10/site-packages/bitsandbytes/libbitsandbytes_cuda117.so
インストールされたパッケージではなく、ソースディレクトリからのトランスフォーマーと peft パッケージを使用していることに注意してください。 4 ビットビットサンドバイト量子化は、トランスフォーマーとペフトのメインブランチでのみ機能していました。 Transformers バージョン 4.31.0 と peft バージョン 0.4.0 が pypi に公開されたら、公開されたバージョンを使用します。

利用可能なモデル

モデル	サイズ	タスク	コラボ
llm-toys/RedPajama-INCITE-Base-3B-v1-paraphrase-tone	3B	言い換え、トーンチェンジ	ノート
llm-toys/RedPajama-INCITE-Base-3B-v1-dialogue-summary-topic	3B	対話の要約とトピックの生成	ノート
llm-toys/falcon-7b-paraphrase-tone-dialogue-summary-topic	7B	言い換え、口調の変更、会話の要約、トピックの生成	ノート

使用法

タスク固有の 3B モデル

言い換え

 from llm_toys . tasks import Paraphraser

paraphraser = Paraphraser ()
paraphraser . paraphrase ( "Hey, can yuo hepl me cancel my last order?" )
# "Could you kindly assist me in canceling my previous order?"

音色変化

 paraphraser . paraphrase ( "Hey, can yuo hepl me cancel my last order?" , tone = "casual" )
# "Hey, could you help me cancel my order?"

paraphraser . paraphrase ( "Hey, can yuo hepl me cancel my last order?" , tone = "professional" )
# "I would appreciate guidance on canceling my previous order."

paraphraser . paraphrase ( "Hey, can yuo hepl me cancel my last order?" , tone = "witty" )
# "Hey, I need your help with my last order. Can you wave your magic wand and make it disappear?"

対話の要約とトピックの生成

 from llm_toys . tasks import SummaryAndTopicGenerator

summary_topic_generator = SummaryAndTopicGenerator ()
summary_topic_generator . generate_summary_and_topic (
    """
    #Person1#: I'm so excited for the premiere of the latest Studio Ghibli movie!
    #Person2#: What's got you so hyped?
    #Person1#: Studio Ghibli movies are pure magic! The animation, storytelling, everything is incredible.
    #Person2#: Which movie is it?
    #Person1#: It's called "Whisper of the Wind." It's about a girl on a magical journey to save her village.
    #Person2#: Sounds amazing! I'm in for the premiere.
    #Person1#: Great! We're in for a visual masterpiece and a heartfelt story.
    #Person2#: Can't wait to be transported to their world.
    #Person1#: It'll be an unforgettable experience, for sure!
    """ . strip ()
)
# {"summary": "#Person1# is excited for the premiere of the latest Studio Ghibli movie.
#              #Person1# thinks the animation, storytelling, and heartfelt story will be unforgettable.
#              #Person2# is also excited for the premiere.",
#  "topic": "Studio ghibli movie"}

一般的な7Bモデル

 from llm_toys . tasks import GeneralTaskAssitant
from llm_toys . config import TaskType

gta = GeneralTaskAssitant ()
gta . complete ( TaskType . PARAPHRASE_TONE , "Hey, can yuo hepl me cancel my last order?" )
# "Could you assist me in canceling my previous order?"

gta . complete ( TaskType . PARAPHRASE_TONE , "Hey, can yuo hepl me cancel my last order?" , tone = "casual" )
# "Hey, can you help me cancel my last order?"

gta . complete ( TaskType . PARAPHRASE_TONE , "Hey, can yuo hepl me cancel my last order?" , tone = "professional" )
# "I would appreciate if you could assist me in canceling my previous order."

gta . complete ( TaskType . PARAPHRASE_TONE , "Hey, can yuo hepl me cancel my last order?" , tone = "witty" )
# "Oops! Looks like I got a little carried away with my shopping spree. Can you help me cancel my last order?"

chat = """
#Person1#: I'm so excited for the premiere of the latest Studio Ghibli movie!
#Person2#: What's got you so hyped?
#Person1#: Studio Ghibli movies are pure magic! The animation, storytelling, everything is incredible.
#Person2#: Which movie is it?
#Person1#: It's called "Whisper of the Wind." It's about a girl on a magical journey to save her village.
#Person2#: Sounds amazing! I'm in for the premiere.
#Person1#: Great! We're in for a visual masterpiece and a heartfelt story.
#Person2#: Can't wait to be transported to their world.
#Person1#: It'll be an unforgettable experience, for sure!
""" . strip ()
gta . complete ( TaskType . DIALOGUE_SUMMARY_TOPIC , chat )
# {"summary": "#Person1# tells #Person2# about the upcoming Studio Ghibli movie.
#              #Person1# thinks it's magical and #Person2#'s excited to watch it.",
#  "topic": "Movie premiere"}

トレーニング

データ

言い換えとトーンの変更: パッセージとその言い換えバージョン、およびカジュアル、プロフェッショナル、機知に富んだなどのさまざまなトーンのパッセージが含まれています。パッセージのトーンを言い換えたり変更したりするためにモデルに使用されます。データは gpt-35-turbo を使用して生成されました。トレーニングパッセージの小さなサンプルも、quora quesions と square_2 データセットからピックアップされています。
ダイアログの概要とトピックの生成: ダイアログとその概要とトピックが含まれます。トレーニングデータは、Dialogsum データセットのトレーニング分割からの約 1,000 レコードです。また、開発分割からの約 20 個のサンプルも含まれています。サンプリングでは、長い概要とトピックを持つデータポイントが優先されました。元のラベル付きトピックは単なる単語であり、十分に説明的ではなかったため、最終トレーニングデータでは一部 (~30) のトピックが手動で編集されたことに注意してください。

サンプルトレーニングスクリプト

すべてのオプションを確認するには

python llm_toys/train.py --help

言い換えおよびトーン変更モデルをトレーニングするには

python llm_toys/train.py 
    --task_type paraphrase_tone 
    --model_name meta-llama/Llama-2-7b 
    --max_length 128 
    --batch_size 8 
    --gradient_accumulation_steps 1 
    --learning_rate 1e-4 
    --num_train_epochs 3 
    --eval_ratio 0.05

評価

言い換えとトーンチェンジ

WIP

対話の要約とトピックの生成

評価は、Dialogsum テスト分割からの 500 レコードに対して行われます。

 # llm-toys/RedPajama-INCITE-Base-3B-v1-dialogue-summary-topic
{ "rouge1" : 0.453 , "rouge2" : 0.197 , "rougeL" : 0.365 , "topic_similarity" : 0.888 }

# llm-toys/falcon-7b-paraphrase-tone-dialogue-summary-topic
{ 'rouge1' : 0.448 , 'rouge2' : 0.195 , 'rougeL' : 0.359 , 'topic_similarity' : 0.886 }