berto_ollama_whisper

πŸš– fully local taxi-driver language assistant

πŸ“ overview

this project provides a fully local language assistant called berto. berto can listen to your audio, transcribe it using whisper, and then interact with you in spanish using an ai model served locally via ollama. it also has text-to-speech capabilities to provide audio responses. πŸ—£οΈ

🌟 features

πŸŽ₯ demo

demo gif

watch the demo video here

πŸš€ setup

1. install dependencies πŸ“¦

first, you need to install the required packages. you can do this by running the following command to install all dependencies from the requirements.txt file:

pip install -r requirements.txt

2. download and install ollama πŸ› οΈ

this project requires ollama to serve the ai model (llama2-uncensored). you can download ollama from the Ollama website and install it on your local machine.

once installed, you need to download the model:

ollama run llama2-uncensored

make sure ollama is running on localhost:11434 to handle the requests.

3. run the script 🎬

to run the assistant, execute the following script:

python bertosito_chat.py

this will start a conversation with berto, who will transcribe your spoken audio and respond based on the conversation using the ai model hosted on ollama.

πŸ’‘ how it works

πŸ“‚ file structure

πŸ“ notes


enjoy interacting with berto! πŸŽ‰