This project utilizes Ollama LLM to create video text ideas based on user prompts. It takes a prompt from the user and generates a story, which is then converted into audio using AI. Subsequently, captions are generated from the audio, and finally, these captions are merged into a video.
git clone https://github.com/ccallazans/ai-video-generator.git
cd ai-video-generator
make start
Once the containers are up and running, you can access the AI video story generator through its provided interface or API endpoints. The first time may be slow because it has to download the Ollama model and Python Torch requirements, but subsequent runs will be faster because Docker keeps it in cache.
curl --location 'http://localhost:8080/api/v1/generate'
--header 'Content-Type: application/json'
--data '{
"message": "Tell me a story about Bahia"
}'
The generated videos are saved on the "generated" folder.
Install Ollama: Instructions on: https://ollama.com/download
Create a Python 3 environment and install its dependencies:
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
Run the Golang application:
go run cmd/*
Feel free to explore and modify the project according to your needs! If you encounter any issues or have suggestions for improvement, please don't hesitate to open an issue or submit a pull request.