LLaMa2lang Download - LLaMa2lang Quellcode-Download

Jetzt mit LLaMa3-Unterstützung

LLaMa2lang v0.6

Dieses Repository enthält praktische Skripte zur Feinabstimmung von LLaMa3-8B (oder einem anderen Basismodell) für den Chat in jeder Sprache (außer Englisch). Der Grund dafür ist, dass LLaMa3 hauptsächlich auf englischen Daten trainiert wird und zwar in gewissem Umfang für andere Sprachen funktioniert, seine Leistung im Vergleich zu Englisch jedoch schlecht ist.

Kombinieren Sie die Kraft der Feinabstimmung mit der Kraft von RAG – schauen Sie sich unser RAG Me Up-Repository auf RAG an, das zusätzlich zu Ihren mit LLaMa2Lang getunten Modellen verwendet werden kann.

TL;DR

 pip install -r requirements.txt

# Translate OASST1 to target language
python translate.py m2m target_lang checkpoint_location

# Combine the checkpoint files into a dataset
python combine_checkpoints.py input_folder output_location

# Finetune
python finetune.py tuned_model dataset_name instruction_prompt

# Optionally finetune with DPO (RLHF)
python finetune_dpo.py tuned_model dataset_name instruction_prompt

# Run inference
python run_inference.py model_name instruction_prompt input

Was es bewirkt

Der Prozess, den wir befolgen, um ein Basismodell wie LLaMa3 für eine bestimmte Sprache abzustimmen, ist wie folgt:

Laden Sie einen Datensatz, der Fragen und Antworten/Anweisungspaare enthält.
Übersetzen Sie den gesamten Datensatz in eine bestimmte Zielsprache.
Laden Sie den übersetzten Datensatz und extrahieren Sie Threads, indem Sie rekursiv nur Eingabeaufforderungen mit ihren jeweiligen Antworten mit dem höchsten Rang auswählen, bis hin zu nachfolgenden Eingabeaufforderungen usw.
Wandeln Sie die Threads in Eingabeaufforderungen um, die einer vorgegebenen Vorlage folgen (anpassbar).
Verwenden Sie QLoRA und PEFT, um die Anweisungsfeinabstimmung eines Basisfundamentmodells für diesen Datensatz zu optimieren.
- Verwenden Sie QLoRA und PEFT zur Feinabstimmung mit DPO, um die Kapazitäten des Modells noch weiter zu erweitern und ihm bevorzugte Antworten gegenüber abgelehnten beizubringen. Beachten Sie, dass Ihr Basisdatensatz über diese Informationen verfügen muss.
- Alternativ zu DPO können Sie dasselbe mit ORPO erreichen
Führen Sie die Inferenz mit dem neu trainierten Modell aus.

Unterstützte Paradigmen

Übersetzung

OPUS
M2M
MADLAD
mBART
NLLB
Nahtlos (nur groß)
Tower Instruct (Kann Rechtschreibfehler korrigieren)

Basisdatensätze

Folgendes wurde getestet, aber möglicherweise funktionieren noch mehr

OASST1
OASST2

Unterstützte Fundamentmodelle

LLaMa3
LLaMa2
Mistral
(Inoffiziell) Mixtral 8x7B

Roadmap

[L2L-6] Untersuchen Sie die Interoperabilität mit anderen Bibliotheken (Axolotl, llamacpp, unsloth)
[L2L-7] Erlauben Sie verschiedene Quantisierungen neben QLoRA (GGUF, GPTQ, AWQ)
[L2L-10] Unterstützt die Erweiterung des Tokenizers und des Vokabulars

Kosten und Laufzeit

Der obige Prozess kann vollständig auf einer kostenlosen Google Colab T4-GPU ausgeführt werden. Der letzte Schritt kann jedoch nur mit ausreichend kurzen Kontextfenstern und einem Stapel von höchstens 2 erfolgreich ausgeführt werden. Darüber hinaus dauert die Übersetzung in Schritt 2 insgesamt etwa 36 Stunden für eine bestimmte Sprache und sollte daher bei Bedarf in mehreren Schritten ausgeführt werden Ich möchte bei einer kostenlosen Google Colab-GPU bleiben.

Unsere fein abgestimmten Modelle für Schritt 5 wurden mit einem A40 auf broad.ai erstellt und kosteten uns weniger als einen Dollar für jedes Modell, die Fertigstellung dauerte etwa 1,5 Stunden.

Verwendung

Stellen Sie sicher, dass Pytorch installiert ist und für Ihre Umgebung funktioniert (Verwendung von CUDA vorzuziehen): https://pytorch.org/get-started/locally/
Klonen Sie das Repo und installieren Sie die Anforderungen.

pip install -r requirements.txt

Übersetzen Sie Ihren Basisdatensatz in die gewünschte Zielsprache.

 usage: translate.py [-h] [--quant8] [--quant4] [--base_dataset BASE_DATASET] [--base_dataset_text_field BASE_DATASET_TEXT_FIELD] [--base_dataset_lang_field BASE_DATASET_LANG_FIELD]
                    [--checkpoint_n CHECKPOINT_N] [--batch_size BATCH_SIZE] [--max_length MAX_LENGTH] [--cpu] [--source_lang SOURCE_LANG]
                    {opus,mbart,madlad,m2m,nllb,seamless_m4t_v2,towerinstruct} ... target_lang checkpoint_location

Translate an instruct/RLHF dataset to a given target language using a variety of translation models

positional arguments:
  {opus,mbart,madlad,m2m,nllb,seamless_m4t_v2,towerinstruct}
                        The model/architecture used for translation.
    opus                Translate the dataset using HelsinkiNLP OPUS models.
    mbart               Translate the dataset using mBART.
    madlad              Translate the dataset using Google's MADLAD models.
    m2m                 Translate the dataset using Facebook's M2M models.
    nllb                Translate the dataset using Facebook's NLLB models.
    seamless_m4t_v2     Translate the dataset using Facebook's SeamlessM4T-v2 multimodal models.
    towerinstruct       Translate the dataset using Unbabel's Tower Instruct. Make sure your target language is in the 10 languages supported by the model.
  target_lang           The target language. Make sure you use language codes defined by the translation model you are using.
  checkpoint_location   The folder the script will write (JSONized) checkpoint files to. Folder will be created if it doesn't exist.

options:
  -h, --help            show this help message and exit
  --quant8              Optional flag to load the translation model in 8 bits. Decreases memory usage, increases running time
  --quant4              Optional flag to load the translation model in 4 bits. Decreases memory usage, increases running time
  --base_dataset BASE_DATASET
                        The base dataset to translate, defaults to OpenAssistant/oasst1
  --base_dataset_text_field BASE_DATASET_TEXT_FIELD
                        The base dataset's column name containing the actual text to translate. Defaults to text
  --base_dataset_lang_field BASE_DATASET_LANG_FIELD
                        The base dataset's column name containing the language the source text was written in. Defaults to lang
  --checkpoint_n CHECKPOINT_N
                        An integer representing how often a checkpoint file will be written out. To start off, 400 is a reasonable number.
  --batch_size BATCH_SIZE
                        The batch size for a single translation model. Adjust based on your GPU capacity. Default is 10.
  --max_length MAX_LENGTH
                        How much tokens to generate at most. More tokens might be more accurate for lengthy input but creates a risk of running out of memory. Default is unlimited.
  --cpu                 Forces usage of CPU. By default GPU is taken if available.
  --source_lang SOURCE_LANG
                        Source language to select from OASST based on lang property of dataset

Wenn Sie weitere Parameter für die verschiedenen Übersetzungsmodelle wünschen, führen Sie Folgendes aus:

 python translate.py [MODEL] -h

Stellen Sie sicher, dass Sie zuerst modellspezifische Parameter angeben, bevor Sie allgemeine Parameter aus der obigen Liste angeben. Beispielaufrufe:

 # Using M2M with 4bit quantization and differen batch sizes to translate Dutch
python translate.py m2m nl ./output_nl --quant4 --batch_size 20

# Using madlad 7B with 8bit quantization for German with different max_length
python translate.py madlad --model_size 7b de ./output_de --quant8 --batch_size 5 --max_length 512

# Be sure to use target language codes that the model you use understands
python translate.py mbart xh_ZA ./output_xhosa
python translate.py nllb nld_Latn ./output_nl

Kombinieren Sie die JSON-Arrays aus den Dateien der Prüfpunkte zu einem Huggingface-Datensatz und schreiben Sie ihn dann entweder auf die Festplatte oder veröffentlichen Sie ihn auf Huggingface. Das Skript versucht standardmäßig, auf die Festplatte zu schreiben und auf die Veröffentlichung auf Huggingface zurückzugreifen, wenn der Ordner nicht auf der Festplatte vorhanden ist. Stellen Sie für die Veröffentlichung auf Huggingface sicher, dass Sie Ihre Umgebungsvariable HF_TOKEN gemäß der Dokumentation eingerichtet haben.

 usage: combine_checkpoints.py [-h] input_folder output_location

Combine checkpoint files from translation.

positional arguments:
  input_folder     The checkpoint folder used in translation, with the target language appended.
                   Example: "./output_nl".
  output_location  Where to write the Huggingface Dataset. Can be a disk location or a Huggingface
                   Dataset repository.

options:
  -h, --help       show this help message and exit

Wandeln Sie die übersetzten Nachrichten in Chat-/Anweisungs-/Eingabeaufforderungsthreads um und optimieren Sie die Anweisungen eines Gründungsmodells mithilfe von LoRA und PEFT.

 usage: finetune.py [-h] [--base_model BASE_MODEL] [--base_dataset_text_field BASE_DATASET_TEXT_FIELD] [--base_dataset_rank_field BASE_DATASET_RANK_FIELD] [--base_dataset_id_field BASE_DATASET_ID_FIELD] [--base_dataset_parent_field BASE_DATASET_PARENT_FIELD]
                   [--base_dataset_role_field BASE_DATASET_ROLE_FIELD] [--quant8] [--noquant] [--max_seq_length MAX_SEQ_LENGTH] [--num_train_epochs NUM_TRAIN_EPOCHS] [--batch_size BATCH_SIZE] [--threads_output_name THREADS_OUTPUT_NAME] [--thread_template THREAD_TEMPLATE]
                   [--padding PADDING]
                   tuned_model dataset_name instruction_prompt

Finetune a base instruct/chat model using (Q)LoRA and PEFT

positional arguments:
  tuned_model           The name of the resulting tuned model.
  dataset_name          The name of the dataset to use for fine-tuning. This should be the output of the combine_checkpoints script.
  instruction_prompt    An instruction message added to every prompt given to the chatbot to force it to answer in the target language. Example: "You are a generic chatbot that always answers in English."

options:
  -h, --help            show this help message and exit
  --base_model BASE_MODEL
                        The base foundation model. Default is "NousResearch/Meta-Llama-3-8B-Instruct".
  --base_dataset_text_field BASE_DATASET_TEXT_FIELD
                        The dataset's column name containing the actual text to translate. Defaults to text
  --base_dataset_rank_field BASE_DATASET_RANK_FIELD
                        The dataset's column name containing the rank of an answer given to a prompt. Defaults to rank
  --base_dataset_id_field BASE_DATASET_ID_FIELD
                        The dataset's column name containing the id of a text. Defaults to message_id
  --base_dataset_parent_field BASE_DATASET_PARENT_FIELD
                        The dataset's column name containing the parent id of a text. Defaults to parent_id
  --base_dataset_role_field BASE_DATASET_ROLE_FIELD
                        The dataset's column name containing the role of the author of the text (eg. prompter, assistant). Defaults to role
  --quant8              Finetunes the model in 8 bits. Requires more memory than the default 4 bit.
  --noquant             Do not quantize the finetuning. Requires more memory than the default 4 bit and optional 8 bit.
  --max_seq_length MAX_SEQ_LENGTH
                        The maximum sequence length to use in finetuning. Should most likely line up with your base model's default max_seq_length. Default is 512.
  --num_train_epochs NUM_TRAIN_EPOCHS
                        Number of epochs to use. 2 is default and has been shown to work well.
  --batch_size BATCH_SIZE
                        The batch size to use in finetuning. Adjust to fit in your GPU vRAM. Default is 4
  --threads_output_name THREADS_OUTPUT_NAME
                        If specified, the threads created in this script for finetuning will also be saved to disk or HuggingFace Hub.
  --thread_template THREAD_TEMPLATE
                        A file containing the thread template to use. Default is threads/template_fefault.txt
  --padding PADDING     What padding to use, can be either left or right.

6.1 [OPTIONAL] Feinabstimmung mit DPO (ähnlich wie RLHF)

 usage: finetune_dpo.py [-h] [--base_model BASE_MODEL] [--base_dataset_text_field BASE_DATASET_TEXT_FIELD] [--base_dataset_rank_field BASE_DATASET_RANK_FIELD] [--base_dataset_id_field BASE_DATASET_ID_FIELD] [--base_dataset_parent_field BASE_DATASET_PARENT_FIELD] [--quant8]
                       [--noquant] [--max_seq_length MAX_SEQ_LENGTH] [--max_prompt_length MAX_PROMPT_LENGTH] [--num_train_epochs NUM_TRAIN_EPOCHS] [--batch_size BATCH_SIZE] [--threads_output_name THREADS_OUTPUT_NAME] [--thread_template THREAD_TEMPLATE] [--max_steps MAX_STEPS]
                       [--padding PADDING]
                       tuned_model dataset_name instruction_prompt

Finetune a base instruct/chat model using (Q)LoRA and PEFT using DPO (RLHF)

positional arguments:
  tuned_model           The name of the resulting tuned model.
  dataset_name          The name of the dataset to use for fine-tuning. This should be the output of the combine_checkpoints script.
  instruction_prompt    An instruction message added to every prompt given to the chatbot to force it to answer in the target language. Example: "You are a generic chatbot that always answers in English."

options:
  -h, --help            show this help message and exit
  --base_model BASE_MODEL
                        The base foundation model. Default is "NousResearch/Meta-Llama-3-8B-Instruct".
  --base_dataset_text_field BASE_DATASET_TEXT_FIELD
                        The dataset's column name containing the actual text to translate. Defaults to text
  --base_dataset_rank_field BASE_DATASET_RANK_FIELD
                        The dataset's column name containing the rank of an answer given to a prompt. Defaults to rank
  --base_dataset_id_field BASE_DATASET_ID_FIELD
                        The dataset's column name containing the id of a text. Defaults to message_id
  --base_dataset_parent_field BASE_DATASET_PARENT_FIELD
                        The dataset's column name containing the parent id of a text. Defaults to parent_id
  --quant8              Finetunes the model in 8 bits. Requires more memory than the default 4 bit.
  --noquant             Do not quantize the finetuning. Requires more memory than the default 4 bit and optional 8 bit.
  --max_seq_length MAX_SEQ_LENGTH
                        The maximum sequence length to use in finetuning. Should most likely line up with your base model's default max_seq_length. Default is 512.
  --max_prompt_length MAX_PROMPT_LENGTH
                        The maximum length of the prompts to use. Default is 512.
  --num_train_epochs NUM_TRAIN_EPOCHS
                        Number of epochs to use. 2 is default and has been shown to work well.
  --batch_size BATCH_SIZE
                        The batch size to use in finetuning. Adjust to fit in your GPU vRAM. Default is 4
  --threads_output_name THREADS_OUTPUT_NAME
                        If specified, the threads created in this script for finetuning will also be saved to disk or HuggingFace Hub.
  --thread_template THREAD_TEMPLATE
                        A file containing the thread template to use. Default is threads/template_fefault.txt
  --max_steps MAX_STEPS
                        The maximum number of steps to run DPO for. Default is -1 which will run the data through fully for the number of epochs but this will be very time-consuming.
  --padding PADDING     What padding to use, can be either left or right.

6.1 [OPTIONAL] Feinabstimmung mit ORPO (ähnlich RLHF)

 usage: finetune_orpo.py [-h] [--base_model BASE_MODEL] [--base_dataset_text_field BASE_DATASET_TEXT_FIELD] [--base_dataset_rank_field BASE_DATASET_RANK_FIELD] [--base_dataset_id_field BASE_DATASET_ID_FIELD] [--base_dataset_parent_field BASE_DATASET_PARENT_FIELD] [--quant8]
                        [--noquant] [--max_seq_length MAX_SEQ_LENGTH] [--max_prompt_length MAX_PROMPT_LENGTH] [--num_train_epochs NUM_TRAIN_EPOCHS] [--batch_size BATCH_SIZE] [--threads_output_name THREADS_OUTPUT_NAME] [--thread_template THREAD_TEMPLATE] [--max_steps MAX_STEPS]
                        [--padding PADDING]
                        tuned_model dataset_name instruction_prompt

Finetune a base instruct/chat model using (Q)LoRA and PEFT using ORPO (RLHF)

positional arguments:
  tuned_model           The name of the resulting tuned model.
  dataset_name          The name of the dataset to use for fine-tuning. This should be the output of the combine_checkpoints script.
  instruction_prompt    An instruction message added to every prompt given to the chatbot to force it to answer in the target language. Example: "You are a generic chatbot that always answers in English."

options:
  -h, --help            show this help message and exit
  --base_model BASE_MODEL
                        The base foundation model. Default is "NousResearch/Meta-Llama-3-8B-Instruct".
  --base_dataset_text_field BASE_DATASET_TEXT_FIELD
                        The dataset's column name containing the actual text to translate. Defaults to text
  --base_dataset_rank_field BASE_DATASET_RANK_FIELD
                        The dataset's column name containing the rank of an answer given to a prompt. Defaults to rank
  --base_dataset_id_field BASE_DATASET_ID_FIELD
                        The dataset's column name containing the id of a text. Defaults to message_id
  --base_dataset_parent_field BASE_DATASET_PARENT_FIELD
                        The dataset's column name containing the parent id of a text. Defaults to parent_id
  --quant8              Finetunes the model in 8 bits. Requires more memory than the default 4 bit.
  --noquant             Do not quantize the finetuning. Requires more memory than the default 4 bit and optional 8 bit.
  --max_seq_length MAX_SEQ_LENGTH
                        The maximum sequence length to use in finetuning. Should most likely line up with your base model's default max_seq_length. Default is 512.
  --max_prompt_length MAX_PROMPT_LENGTH
                        The maximum length of the prompts to use. Default is 512.
  --num_train_epochs NUM_TRAIN_EPOCHS
                        Number of epochs to use. 2 is default and has been shown to work well.
  --batch_size BATCH_SIZE
                        The batch size to use in finetuning. Adjust to fit in your GPU vRAM. Default is 4
  --threads_output_name THREADS_OUTPUT_NAME
                        If specified, the threads created in this script for finetuning will also be saved to disk or HuggingFace Hub.
  --thread_template THREAD_TEMPLATE
                        A file containing the thread template to use. Default is threads/template_fefault.txt
  --max_steps MAX_STEPS
                        The maximum number of steps to run ORPO for. Default is -1 which will run the data through fully for the number of epochs but this will be very time-consuming.
  --padding PADDING     What padding to use, can be either left or right.

Führen Sie die Inferenz mit dem neu erstellten QLoRA-Modell aus.

 usage: run_inference.py [-h] model_name instruction_prompt input

Script to run inference on a tuned model.

positional arguments:
  model_name          The name of the tuned model that you pushed to Huggingface in the previous
                      step.
  instruction_prompt  An instruction message added to every prompt given to the chatbot to force
                      it to answer in the target language.
  input               The actual chat input prompt. The script is only meant for testing purposes
                      and exits after answering.

options:
  -h, --help          show this help message and exit

Auswahl des richtigen Übersetzungsmodells

Woher weiß ich, welches Übersetzungsmodell ich für meine Zielsprache wählen soll?

Wir bieten Ihnen benchmark.py -Skript, das dabei hilft, einigermaßen gute Vermutungen anzustellen (der von uns verwendete Datensatz ist derselbe, auf dem die OPUS-Modelle trainiert werden, sodass die Ergebnisse immer positiv für OPUS sind). Informationen zur Verwendung finden Sie in der Hilfe dieses Skripts unten. Modelle werden in 4-Bit-Quantisierung geladen und auf einer kleinen Stichprobe der Teilmenge der OPUS-Bücher ausgeführt.

Stellen Sie sicher, dass Sie die in Ihrem Basisdatensatz am häufigsten vorkommenden Sprachen als Quellsprache und Ihre Zielübersetzungssprache als Zielsprache verwenden. Stellen Sie beispielsweise sicher, dass für OASST1 mindestens en und es als Quellsprachen ausgeführt werden.

 usage: benchmark.py [-h] [--cpu] [--start START] [--n N] [--max_length MAX_LENGTH] source_language target_language included_models

Benchmark all the different translation models for a specific source and target language to find out which performs best. This uses 4bit quantization to limit GPU usage. Note:
the outcomes are indicative - you cannot assume corretness of the BLEU and CHRF scores but you can compare models against each other relatively.

positional arguments:
  source_language       The source language you want to test for. Check your dataset to see which occur most prevalent or use English as a good start.
  target_language       The source language you want to test for. This should be the language you want to apply the translate script on. Note: in benchmark, we use 2-character
                        language codes, in constrast to translate.py where you need to specify whatever your model expects.
  included_models       Comma-separated list of models to include. Allowed values are: opus, m2m_418m, m2m_1.2b, madlad_3b, madlad_7b, madlad_10b, madlad_7bbt, mbart,
                        nllb_distilled600m, nllb_1.3b, nllb_distilled1.3b, nllb_3.3b, seamless

options:
  -h, --help            show this help message and exit
  --cpu                 Forces usage of CPU. By default GPU is taken if available.
  --start START         The starting offset to include sentences from the OPUS books dataset from. Defaults to 0.
  --n N                 The number of sentences to benchmark on. Defaults to 100.
  --max_length MAX_LENGTH
                        How much tokens to generate at most. More tokens might be more accurate for lengthy input but creates a risk of running out of memory. Default is 512.

Datensätze und Modelle

Wir haben bereits zahlreiche Datensätze und Modelle erstellt und werden dies auch weiterhin tun. Möchten Sie zur Demokratisierung von LLMs beitragen? Klonen Sie das Repo, erstellen Sie Datensätze und Modelle für andere Sprachen und erstellen Sie dann eine PR.

Übersetzte oasst1-Datensätze


Niederländisch UnderstandLing/oasst1_nl	Spanisch verstehenLing/oasst1_es	Französisch verstehenLing/oasst1_fr	Deutsch verstehenLing/oasst1_de
Katalanisch xaviviro/oasst1_ca	Portugiesisch verstehenLing/oasst1_pt	Arabisch HeshamHaroon/oasst-arabisch	Italienisch verstehenLing/oasst1_it
Russisch verstehenLing/oasst1_ru	Hindi UnderstandLing/oasst1_hi	Chinesisch verstehenLing/oasst1_zh	Polnische Chrystianer/oasst1_pl
Japanisch verstehenLing/oasst1_jap	Baskisch xezpeleta/oasst1_eu	Bengali UnderstandLing/oasst1_bn	Türkisch verstehenLing/oasst1_tr

Sprachspezifische ❗LLaMa3-8B❗ Chat-Modelladapter

Stellen Sie sicher, dass Sie Zugriff auf das LLaMa3-8B-Modell von Meta haben, und legen Sie Ihren HF_TOKEN fest, bevor Sie diese Modelle verwenden.


UnderstandLing/Llama-3-8B-Instruct-nl Niederländisch	UnderstandLing/Llama-3-8B-Instruct-es Spanisch	UnderstandLing/Llama-3-8B-Instruct-fr Französisch	UnderstandLing/Llama-3-8B-Instruct-de Deutsch
UnderstandLing/Llama-3-8B-Instruct-pt Portugiesisch	UnderstandLing/Llama-3-8B-Instruct-it Italienisch	UnderstandLing/Llama-3-8B-Instruct-hi Hindi	UnderstandLing/Llama-3-8B-Instruct-ru Russisch

Übersetzte LLaMa2-Thread-Chat-Eingabeaufforderungsdatensätze


Niederländisch UnderstandLing/oasst1_nl_threads	Spanisch verstehenLing/oasst1_es_threads	Französisch verstehenLing/oasst1_fr_threads	German UnderstandLing/oasst1_de_threads
Katalanisch xaviviro/oasst1_ca_threads	Portugiesisch UnderstandLing/oasst1_pt_threads	Arabisch HeshamHaroon/oasst-arabic_threads	Italienisch verstehenLing/oasst1_it_threads
Russisch verstehenLing/oasst1_ru_threads	Hindi UnderstandLing/oasst1_hi_threads	Chinesisch verstehenLing/oasst1_zh_threads	Polnische Chrystianer/oasst1_pl_threads
Japanisch verstehenLing/oasst1_jap_threads	Baskisch xezpeleta/oasst1_eu_threads	Bengali UnderstandLing/oasst1_bn_threads	Türkisch UnderstandLing/oasst1_tr_threads

Sprachspezifische LLaMa2-7B-Chat-Modelladapter


UnderstandLing/llama-2-7b-chat-nl Niederländisch	Ling/llama-2-7b-chat-es Spanisch verstehen	UnderstandLing/llama-2-7b-chat-fr Französisch	UnderstandLing/llama-2-7b-chat-de Deutsch
xaviviro/llama-2-7b-chat-ca Katalanisch	Ling/llama-2-7b-chat-pt Portugiesisch verstehen	HeshamHaroon/llama-2-7b-chat-ar Arabisch	VerstehenLing/llama-2-7b-chat-it Italienisch
VerstehenLing/llama-2-7b-chat-ru Russisch	UnderstandLing/llama-2-7b-chat-hi Hindi	Ling/llama-2-7b-chat-zh Chinesisch verstehen	chrystians/llama-2-7b-chat-pl-polish-polski Polnisch
xezpeleta/llama-2-7b-chat-eu Baskisch	UnderstandLing/llama-2-7b-chat-bn Bengali	VerstehenLing/llama-2-7b-chat-tr Türkisch

Sprachspezifische Mistral-Chat-Modelladapter


UnderstandLing/Mistral-7B-Instruct-v0.2-nl Niederländisch	UnderstandLing/Mistral-7B-Instruct-v0.2-es Spanisch	UnderstandLing/Mistral-7B-Instruct-v0.2-de Deutsch

Sprachspezifische LLaMa2-13B-Chat-Modelladapter


UnderstandLing/llama-2-13b-chat-nl Niederländisch	Ling/llama-2-13b-chat-es Spanisch verstehen	UnderstandLing/llama-2-13b-chat-fr Französisch

Sprachspezifische Mixtral-8x7B-Chat-Modelladapter


UnderstandLing/Mixtral-8x7B-Instruct-nl Niederländisch

Empirische Leistung

Niederländisch

<s>[INST] <<SYS>> Je bent een generieke chatbot die altijd in het Nederlands antwoord geeft. <</SYS>> Wat is de hoofdstad van Nederland? [/INST] Amsterdam</s>

<s>[INST] <<SYS>> Je bent een generieke chatbot die altijd in het Nederlands antwoord geeft. <</SYS>> Wat is de hoofdstad van Nederland? [/INST] Amsterdam</s><s>[INST] Hoeveel inwoners heeft die stad? [/INST] 850 duizend inwoners (2023)</s>

<s>[INST] <<SYS>> Je bent een generieke chatbot die altijd in het Nederlands antwoord geeft. <</SYS>> Wie is de minister-president van Nederland? [/INST] Mark Rutte is sinds 2010 minister-president van Nederland. Hij is meerdere keren herkozen.</s>

FAQ

F: Warum übersetzen Sie zuerst den vollständigen OASST1/2-Datensatz? Wäre es nicht schneller, nur Threads mit dem höchsten Rang zu übersetzen?
A: Während Sie hinsichtlich der Durchlaufzeit ziemlich viel gewinnen können, wenn Sie zuerst die Threads erstellen und sie dann übersetzen, stellen wir der Community vollständige OASST1/2-Übersetzungen zur Verfügung, da wir glauben, dass sie für sich genommen nützlich sein können.
F: Wie gut funktionieren die Feinabstimmungen im Vergleich zu Vanilla LLaMa3?
A: Obwohl wir keine formellen Benchmarks haben, ist es schwierig, wenn nicht sogar unmöglich, LLaMa3 dazu zu bringen, von Anfang an durchweg eine andere Sprache als Englisch zu sprechen. Die nicht-englische Sprache, die daraus entsteht, ist oft grammatikalisch fehlerhaft. Unsere Feinabstimmungen zeigen dieses Verhalten nicht.
F: Kann ich andere Frameworks zur Feinabstimmung verwenden?
A: Ja, das können Sie. Wir verwenden Axolotl für Schulungen zu Multi-GPU-Setups.
F: Kann ich verschiedene Übersetzungsmodelle kombinieren?
A: Auf jeden Fall, wir glauben, dass es sogar die Leistung steigern könnte, wenn die Übersetzung von mehreren Modellen durchgeführt wird. Sie können dies erreichen, indem Sie eine Übersetzung frühzeitig stoppen und von den Prüfpunkten aus fortfahren, indem Sie das Übersetzungsskript mit einem anderen Übersetzungsmodell erneut ausführen.

Finanzierung

Wir suchen aktiv nach Mitteln, um KI zu demokratisieren und ihre Anwendungen voranzutreiben. Kontaktieren Sie uns unter [email protected], wenn Sie investieren möchten.

Expandieren