odsc llmops Download - odsc llmops Source code download

odsc llmops

Other source code

1.0.0

Download

LLMs from Protoype to Production

LLMOps, Prompt Engineering, and Moving LLMs to the Cloud

Welcome to the GitHub repository for the ODSC workshop on LLMOps. This workshop is designed to help you unlock the full potential of LLMs through quantization, distillation, fine-tuning, Kubernetes, and so much more!

Most of these case studies are from my book: Quick Start Guide to LLMs

Book

For more details and to join the workshop, click here.

Notebooks / Slides

Dive deep into the practical application with our comprehensive notebooks. These notebooks will guide you step by step through the two case studies covered in the workshop, allowing for an interactive and hands-on learning experience.

Workshop Slides

Here are the slides for the workshop.

Slides: [ODSC LLMOps](./slides/ODSC - LLMs from Prototype to Production.pdf)

Case Study 1: Quantizing Llama 3

Quantizing Llama-3 dynamically - Using bitsandbytes to quantize a model in real-time on load. We will investigate the differences before and after quantization
See how to load a pre-quantized version of Llama to compare speed and memory usage:
- Working with GGUF (no GPU)
- Working with GGUF (with a GPU)

Case Study 2: Distilling LLMs

Distilling BERT models to optimize for speed/memory - See how to perform task-specific distillation to miniaturize a BERT model for classification

Case Study 3: Deploying Quantized Llama 3 with llama.cpp

This directory for a K8s demo of using embedding models and Llama 3 with GGUF on a GPU

Case Study 4: Evaluating Generative AI

Evaluating LLMs with Rubrics - Exploring a rubric prompt to evaluate generative output
Evaluating Alignment (time permitting) - Seeing how an LLM can judge agent's responses

Extra Noteooks

Here are some notebooks that I reference during the workshop but won't have time to get into:

Thank you!

If you enjoyed the case studies, please consider giving my book a 5 star rating on Amazon as it really helps me as an author! For more details and to join the workshop, click here.

Expand

Additional Information

Version 1.0.0
Type Other source code
Update Time 2024-12-03
size 6.38MB
From Github

Related Applications

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
GitHub the via/releases

2024-11-01

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
waymo open dataset

Other source code

December 2023 Update
SmartTube

Other source code

24.71 Stable
Sunamu

Other source code

Release 2.2.0
waymo open dataset

Other source code

December 2023 Update
wp functions

Other categories

1.0.0
termwind

Other categories

v2.3.0

Related Information All