Play with LLMs
分享如何訓練、評估大型語言模型,基於RAG、Agent、Chain建立有趣的LLMs應用。
上手即用| I code this so you don't have to!
- Mistral-8x7b-Instruct 穩定輸出Json Format, 搭配Llamacpp grammar
- Mistral-8x7b-Instruct CoT Agent, Think step by steps
- Mistral-8x7b-Instruct ReAct Agent with tool call
- Llama3-8b-Instruct, transformers, vLLM and Llamacpp多種方法調戲
- Llama3-8b-Instruct, CoT with vLLM
- Llama3-8b-Instruct, 純中文實作ReAct with tool call
- Chinese-Llama3-8b, DPO微調讓Llama3更願意說中文
- llama-cpp-convert-GGUF, 模型量化轉化為GGUF格式並上傳huggingface
- Advanced ReAct
?深入LLMs | Pretraining, Fine-tuning, RLHF and ?>
- qlora-finetune-Baichuan-7B
案例展示
Mixtral 8x7b ReAct | Llama3-8b ReAct |
---|
| |