sft demos 다운로드 - sft demos 소스 코드 다운로드

sft demos

AI 소스 코드

1.0.0

다운로드

LLM을 위한 미세 조정 데모

소개

이 저장소에는 Meta의 llama-3과 같은 LLM(대형 언어 모델)의 미세 조정을 위한 데모가 포함되어 있습니다. 특히, Short-form Instruction Follow 교육에 중점을 두고 있습니다.

? 미세 조정

참고 : 기본 모델별로 구성된 훈련 실행은 _peft 참조하세요.

몇 가지 예:

dfurman/CalmeRys-78B-Orpo-v0.1
- mlx-커뮤니티/CalmeRys-78B-Orpo-v0.1-4bit
dfurman/Qwen2-72B-Orpo-v0.1
dfurman/Llama-3-70B-Orpo-v0.1
dfurman/Mixtral-8x7B-Instruct-v0.1

? 평가

참고 : 평가 실행은 _eval 참조하세요.

예:

2024년 10월 현재 dfurman/CalmeRys-78B-Orpo-v0.1은 Open LLM 리더보드에서 최고 순위 모델입니다.

미터법	값
평균	50.78
IFEval(0샷)	81.63
BBH(3샷)	61.92
수학 Lvl 5 (4샷)	37.92
GPQA(0샷)	20.02
MuSR(0샷)	36.37
MMLU-PRO(5발)	66.80

용법

참고 : 텍스트 생성(추론)을 시작하려면 아래 코드를 사용하세요. GPU 지원 클러스터가 있어야 합니다.

설정

!p ip install - qU transformers accelerate bitsandbytes
!h uggingface - cli download dfurman / CalmeRys - 78 B - Orpo - v0 . 1

 from transformers import AutoTokenizer , BitsAndBytesConfig
import transformers
import torch


if torch . cuda . get_device_capability ()[ 0 ] >= 8 :
    !p ip install - qqq flash - attn
    attn_implementation = "flash_attention_2"
    torch_dtype = torch . bfloat16
else :
    attn_implementation = "eager"
    torch_dtype = torch . float16

# # quantize if necessary
# bnb_config = BitsAndBytesConfig(
#    load_in_4bit=True,
#    bnb_4bit_quant_type="nf4",
#    bnb_4bit_compute_dtype=torch_dtype,
#    bnb_4bit_use_double_quant=True,
# )

model = "dfurman/CalmeRys-78B-Orpo-v0.1"

tokenizer = AutoTokenizer . from_pretrained ( model )
pipeline = transformers . pipeline (
    "text-generation" ,
    model = model ,
    model_kwargs = {
        "torch_dtype" : torch_dtype ,
        # "quantization_config": bnb_config,
        "device_map" : "auto" ,
        "attn_implementation" : attn_implementation ,
    }
)

실시예 1

 question = "Is the number 9.11 larger than 9.9?"

messages = [
    { "role" : "system" , "content" : "You are a helpful assistant that thinks step by step." },
    { "role" : "user" , "content" : question },
]
prompt = tokenizer . apply_chat_template ( messages , tokenize = False , add_generation_prompt = True )
# print("***Prompt:n", prompt)

outputs = pipeline (
    prompt , max_new_tokens = 1000 , do_sample = True , temperature = 0.7 , top_k = 50 , top_p = 0.95
)
print ( "***Generation:" )
print ( outputs [ 0 ][ "generated_text" ][ len ( prompt ) :])

 ***Generation:
To compare these two numbers, it's important to look at their decimal places after the whole number part, which is 9 in both cases. Comparing the tenths place, 9.11 has a '1' and 9.9 has a '9'. Since '9' is greater than '1', 9.9 is larger than 9.11.

실시예 2

 question = """The bakers at the Beverly Hills Bakery baked 200 loaves of bread on Monday morning. 
They sold 93 loaves in the morning and 39 loaves in the afternoon. 
A grocery store then returned 6 unsold loaves back to the bakery. 
How many loaves of bread did the bakery have left?
Respond as succinctly as possible. Format the response as a completion of this table:
|step|subquestion|procedure|result|
|:---|:----------|:--------|:-----:|"""


messages = [
    { "role" : "system" , "content" : "You are a helpful assistant." },
    { "role" : "user" , "content" : question },
]
prompt = tokenizer . apply_chat_template ( messages , tokenize = False , add_generation_prompt = True )
# print("***Prompt:n", prompt)

outputs = pipeline ( prompt , max_new_tokens = 1000 , do_sample = True , temperature = 0.7 , top_k = 50 , top_p = 0.95 )
print ( "***Generation:" )
print ( outputs [ 0 ][ "generated_text" ][ len ( prompt ):])

 ***Generation:
|1|Calculate total sold|Add morning and afternoon sales|132|
|2|Subtract sold from total|200 - 132|68|
|3|Adjust for returns|Add returned loaves to remaining|74|

실시예 3

 question = "What's a good recipe for a spicy margarita?"

messages = [
    { "role" : "system" , "content" : "You are a helpful assistant." },
    { "role" : "user" , "content" : question },
]
prompt = tokenizer . apply_chat_template ( messages , tokenize = False , add_generation_prompt = True )
# print("***Prompt:n", prompt)

outputs = pipeline ( prompt , max_new_tokens = 1000 , do_sample = True , temperature = 0.7 , top_k = 50 , top_p = 0.95 )
print ( "***Generation:" )
print ( outputs [ 0 ][ "generated_text" ][ len ( prompt ):])

 ***Generation:
To make a Spicy Margarita, you'll need to incorporate a chili or pepper element into your classic margarita recipe. Here’s a simple way to do it:

### Ingredients:
- 2 oz tequila (blanco or reposado)
- 1 oz fresh lime juice
- 1/2 oz triple sec (Cointreau or Grand Marnier)
- 1/2 oz agave syrup or simple syrup
- 1-2 slices of jalapeño (or more depending on how spicy you like it)
- Salt and/or chili powder for rimming the glass
- Ice
- Lime wheel for garnish

### Instructions:
1. **Muddle Jalapeño**: In a shaker, muddle the jalapeño slices slightly. This will release the oils and heat from the peppers.
2. **Add Remaining Ingredients**: Add the tequila, lime juice, triple sec, and agave syrup or simple syrup. 
3. **Shake and Strain**: Fill the shaker with ice and shake vigorously until cold. Strain into a salt and/or chili powder rimmed glass filled with ice.
4. **Garnish and Serve**: Garnish with a lime wheel and enjoy.

If you prefer a smoother spiciness that doesn't overpower the drink, you could also consider making a jalapeño-infused tequila by leaving the jalapeño slices in the bottle of tequila for several hours to a couple of days, adjusting the time based on desired level of spiciness. Then use this infused tequila instead of regular tequila in the recipe above. 

Another variation is to use a spicy syrup. To make this, combine equal parts water and sugar with a few sliced jalapeños in a saucepan. Bring to a boil, stirring occasionally to dissolve the sugar. Reduce heat and simmer for about 5 minutes. Let cool, strain out the jalapeños, then store in a sealed container in the refrigerator until ready to use. Use this spicy syrup instead of regular syrup in the recipe. 

As always, adjust the quantity of jalapeño or the type of chili used to suit your taste. Enjoy responsibly!

? 참고자료

기본 모델:

qwen2
라마-3
파이-2
믹스트럴
미스트랄
라마-2
매

데이터세트:

mlabonne/orpo-dpo-mix-40k
하트퍼드/돌핀
존두빈/airoboros-2.2.1
차고-베인드/오픈-오리너구리
timdettmers/openassistant-guanaco

컴퓨팅 공급자:

런팟
람다 연구소
구글 코랩

권장되는 venv 설정

 python3 -m venv .venv
source .venv/bin/activate
pip3 install -r requirements.txt

확장하다

추가 정보

버전 1.0.0
유형 AI 소스 코드
업데이트 시간 2024-12-13
크기 305.97KB
출처 Github

sft demos

LLM을 위한 미세 조정 데모

소개

? 미세 조정

? 평가

용법

실시예 1

실시예 2

실시예 3

? 참고자료

권장되는 venv 설정

winforms demos

GitHub sgrebnov/cordova plugin background download

Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

chat.petals.dev

GPT Prompt Templates

GPTyped

node telegram bot api

typebot.io

python wechaty getting started

waymo open dataset

termwind

wp functions