Whatsapp_speech_to_text Download - Whatsapp_speech_to

Whatsapp_speech_to_text

Other source code

1.0.0

Download

Whatsapp Speech To Text

This is a Speech-to-Text application for Whatsapp that uses Whisper and Whatsapp-Web.js, running on Docker

Example

Description

Once authenticated on Whatsapp Web, the worker will transcribe all voice messages that you reply to with the command !tran using Whisper. Currently, it is only configured to transcribe messages from contacts saved in your contact book.

Originally, the program used Google Cloud Speech, but it now uses Whisper, which is a lightweight, open-source speech recognition engine.

If you do not want to host the model directly on your computer, you can use the main_openai_api branch, which uses the OpenAI API to transcribe the audio.

If you want to contribute, just send a pull request.

Usage

Just reply to the voice message you want to transcribe with !tran

Running the server

To build the images run docker-compose build
To run the containers run docker-compose up (Do not detach, the qr will be displayed in the terminal)

Configuration

To chose the model you want to use edit the variable called MODEL_VERSION under x-shared-variables inside the file docker-compose.yml. Default model: tiny
To configure the path and the api address edit the environment variables inside the docker-compose.yml file. The default values are:
- HOST_ADDRESS=whisper_api
- CHROME_DATA_PATH="/app/data/"
If you want to use the code outside docker, you will need to edit the env variables in the index.js file, to point to your api address.

If you are using a GPU add and edit, to your needs, the following code in the whisper_api container

    deploy:
    resources:
        reservations:
        devices:
            - driver: nvidia
            count: 1
            capabilities: [gpu]

Editing the variables responseMsgHeader and responseMsgHeaderError inside the node/index.js. You can setup the message header for the automatic response.

TODO

~~Only transcribe if the audio is replied with "!tran"~~
~~Send "!tran" from my chat and also transcribe the audio. For now only messages send by contacts will be transcribed.~~
Save the models locally
Maybe use https://github.com/ahmetoner/whisper-asr-webservice as the api
Add environment file.

BUGs

~~For now files that are older than the session can't be fetched. Solution might be to retrieve the file with some function and cache it.~~
- ~~UPDATE: Due to the inability of the library whatsapp-web.js to retrieve messages by id this bug cannot be fixed for now. Maybe there is another solution, but i don't see it.~~
  - UPDATE 2: The Bug has been fixed using the function fetchMessages() from whatsapp-web.js, the function that handle this it's called downloadQuotedMedia()

Expand

Additional Information

Version 1.0.0
Type Other source code
Update Time 2025-02-21
size 156.35KB
From Github

Related Applications

waymo open dataset

2024-11-18
SmartTube

2024-12-14
Sunamu

2024-12-14
MySchedule.py

2024-12-15
chat.petals.dev

2024-11-30
viptools for eslam

2024-12-15

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
waymo open dataset

Other source code

December 2023 Update
SmartTube

Other source code

24.71 Stable
Sunamu

Other source code

Release 2.2.0
waymo open dataset

Other source code

December 2023 Update
wp functions

Other categories

1.0.0
termwind

Other categories

v2.3.0

Related Information All