This repository contains all the necessary items needed to build your own LLM from scratch. Just follow the instructions. Inspired from Karpathy's nanoGPT and Shakespeare generator, I made this repository to build my own LLM. It has everything from data collection for the Model to architecture file, tokenizer and train file.
This repo contains:
Before setting up SmallLanguageModel, ensure that you have the following prerequisites installed:
Follow these steps to train your own tokenizer or generate outputs from the trained model:
Clone this repository:
git clone https://github.com/shivendrra/SmallLanguageModel-project
cd SLM-clone
Install Dependencies:
pip install requirements.txt
Train: Read the training.md for more information. Follow it.
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change. Please make sure to update tests as appropriate.
MIT License. Check out License.md for more info.