This is a guide on how to build a Telegram Bot backed by an LLM (i.e. llama2-chat, llama2-chat-32k, vicuna). The bot is hosted on a free tier EC2 instance, the llm inference is hosted on Beam Cloud as a serverless REST API, which is free for the first 10 hours of compute. The whole thing is quite slow, but this is just a starting point.
You can follow this guide to build a Python Telegram Bot:
How to Create a Telegram Bot using Python
Here I will give you the main steps:
You can now start a conversation with your bot by searching for the username on Telegram.
As for hosting the llm inference the best option I found for now is Beam Cloud. Their compute prices are among the cheapest and they offer 10 hours of free compute with nice GPUs. The offer free storage, which is highly appreciated.
The chatbot is built using langchain and huggingface. So if you want to the Llama 2 family of models you will need to require access to the models. It is very easy to do and they are really quick at approving the request.
TODO I used a couple of sources to put together langchain and HF, I will add them ASAP.
If you want to use gated models you will need to set an hugging face token. This is built in the code, I will fix it in the next days.
This is a guide to generate the token:
HuggingFace User access tokens
Once you have created your account, no payment method required, go to the dashboard and under the Settings tab on the right menu you can find the Secrets. If you are using a model like llama 2 that requires an hugging face token then you need to set the HF_TOKEN variable with the hugging face token.
Then you can do everything locally. Move to the lm subdirectory.
cd ./src/telegram_llm_bot/shared/llm/beam
Follow the Beam installation guide Beam Installation.
Inside the app.py file you can modify the following variables or leave them as they are. I will soon move them to a configuration file:
HF_CACHE = "./models"
MODEL_ID = "meta-llama/Llama-2-7b-chat-hf"
APP_NAME = "travel-guru"
GPU = "T4"
MEMORY = "16Gi"
You are ready to deploy the app:
beam deploy app.py
The app should be up and running now. Go to the Beam Dashboard and under the Apps tab you can find your app.
You can host your bot for free on a free tier EC2 instance. This is a guide you can follow:
Tutorial: Get started with Amazon EC2 Linux instances
During the creation of the instance you have to remember to create a key pair that you will use to connect via ssh to your instance remotely.
I recommend to set Ubuntu as OS.
Once you set the key pair, the .pem will be automatically downloaded.
Now you can connect to the ec2 instance via command line using ssh:
ssh -i "{filename}.pem" ubuntu@{address}.{region}.compute.amazonaws.com
Clone this repository on the ec2 instance. We will only need the bot folder, we do need the rest, so I will probably separate it from the rest in the future, for now this is not a big problem:
git clone https://github.com/ma2za/telegram-llm-bot.git
Move to the bot directory
cd telegram-llm-bot
Create a .env file to set the environment variables common to all your bots
touch .env
Via nano modify the content of the .env with the following content.
MONGO_HOST=telegram-mongo
MONGO_PORT=27017
This is required to set up a MongoDB database to store the conversations.
Create another .env file specific for a bot to set the environment variables
touch ./src/telegram_llm_bot/bots/base_chatbot/.env
Via nano modify the content of the .env with the following content.
TELEGRAM_BOT_TOKEN =
BEAM_TOKEN =
BEAM_URL = https://apps.beam.cloud/{something}
SETTINGS_FILE=telegram_llm_bot.bots.base_chatbot.settings
BOT_NAME=travel-guru
TELEGRAM_BOT_TOKEN is the token we received earlier from the BotFather.
BEAM_TOKEN: under API Keys in the Beam app dashboard you can generate a Beam token.
BEAM_URL is obtained from the overview of the app where you can click on Call API and there you can easily find out the url
We can finally use docker compose to build images and run the containers.
Install Docker and Docker compose. Here is the official guide:
Install Docker Engine on Ubuntu
Build, create and start the containers:
sudo docker compose up -d --build
We are done here!
The system prompts are contained in config.yml.
You are ready to chat!