Skip to content
Prompt Engineering
0:13:41
5 010
255
24
Last update : 18/09/2024

🗣️ Your Personal AI Assistant: Going Local with Verbi 🤖

Want the power of an AI assistant without relying on the cloud? This is your guide to building a local voice assistant using Verbi, an open-source project that puts you in control.

🗝️ Why Go Local? 🤔

Imagine having a voice assistant that:

  • Respects your privacy: Your conversations stay on your device. 🤫
  • Works offline: No internet? No problem! 📶🚫
  • Responds instantly: Lightning-fast responses without server lag. ⚡

That’s the power of going local!

🏗️ Building Your Local Voice Assistant: The 3 Pillars 🏛️

Think of your voice assistant like a relay race:

  1. 👂 Speech to Text (Whisper API): Converts your spoken words into text.
  2. 🧠 Language Model (OLAMMA): Understands the text and generates responses.
  3. 🗣️ Text to Speech (Mello TTS): Transforms the responses back into spoken words.

Let’s break down how to set up each component locally:

1. 👂 From Sounds to Words: Fast Whisper API 💨

  • What it does: Transcribes your voice into text with impressive accuracy.
  • Why it’s cool: Built on the Whisper model, known for its speed and efficiency.
  • How to set it up:
    • Clone the Fast Whisper API repository.
    • Install the necessary packages.
    • Run the API to start the server.

💡Pro Tip: Create a virtual environment to keep your project dependencies organized.

2. 🧠 The Brain: Running LLMs Locally with OLAMMA 🧠

  • What it does: Acts as the brain of your assistant, understanding your requests and generating responses.
  • Why it’s cool: Lets you run powerful language models like LLaMA on your own hardware.
  • How to set it up:
    • Install OLAMMA on your machine.
    • Download your desired language model (e.g., LLaMA 38B).
    • Run the OLAMMA client to start the server.

🤯 Fun Fact: LLaMA stands for “Large Language Model Meta AI.”

3. 🗣️ Giving Your Assistant a Voice: Mello TTS 🎶

  • What it does: Converts text responses into natural-sounding speech.
  • Why it’s cool: Offers a variety of voices to choose from, making your assistant more personable.
  • How to set it up:
    • Clone the Mello TTS repository.
    • Install Mello TTS as a package.
    • Download the speech model files.
    • Run the Mello TTS API endpoint.

💡Pro Tip: Experiment with different speaker IDs to find a voice you like!

🧩 Putting It All Together: Configuring Verbi ⚙️

  • 1. Configuration is Key: Open the config.py file in Verbi and update the following:
    • Transcription Model: Set to fastwhisperapi
    • Response Model: Set to llama
    • Text to Speech Model: Set to mellowtts
  • 2. Start Talking: Run the run_voice_assistant.py script.

🎉 Congratulations! You’ve built your very own local voice assistant!

🧰 Resource Toolbox 🧰

This setup provides a solid foundation for a local voice assistant. Remember, the world of open-source is constantly evolving, so keep exploring and experimenting to create an assistant that truly meets your needs!

Other videos of

Play Video
Prompt Engineering
0:05:56
175
15
3
Last update : 17/01/2025
Play Video
Prompt Engineering
0:08:46
149
8
2
Last update : 16/01/2025
Play Video
Prompt Engineering
0:17:31
187
10
2
Last update : 15/01/2025
Play Video
Prompt Engineering
0:25:53
262
20
0
Last update : 08/01/2025
Play Video
Prompt Engineering
0:15:29
288
27
2
Last update : 18/11/2024
Play Video
Prompt Engineering
0:15:36
1 404
72
7
Last update : 13/11/2024
Play Video
Prompt Engineering
0:08:55
12 183
213
29
Last update : 30/10/2024
Play Video
Prompt Engineering
0:18:55
2 004
139
6
Last update : 21/10/2024
Play Video
Prompt Engineering
0:10:22
3 088
133
9
Last update : 19/10/2024