rnnt speech recognition下載 - rnnt speech recognition原始碼下載

rnnt speech recognition

Ai源碼

1.0.0

下載

RNN-感測器語音識別

在 Tensorflow 2.0 中使用 RNN-Transducer 進行端對端語音識別

概述

此語音辨識模型基於 Google 的行動裝置串流端對端語音辨識研究論文，並使用 Tensorflow 2.0 在 Python 3 中實現

設定您的環境

若要設定您的環境，請執行以下命令：

 git clone --recurse https://github.com/noahchalifour/rnnt-speech-recognition.git
cd rnnt-speech-recognition
pip install tensorflow==2.2.0 # or tensorflow-gpu==2.2.0 for GPU support
pip install -r requirements.txt
./scripts/build_rnnt.sh # to setup the rnnt loss

共同聲音

您可以在此處找到並下載 Common Voice 資料集

將所有 MP3 轉換為 WAV

在 Common Voice 資料集上訓練模型之前，必須先將所有音訊 mp3 檔案類型轉換為 wav。透過執行以下命令來執行此操作：

注意：確保您的電腦上安裝了ffmpeg ，因為它使用它將 mp3 轉換為 wav

 ./scripts/common_voice_convert.sh <data_dir> <# of threads>
python scripts/remove_missing_samples.py 
    --data_dir <data_dir> 
    --replace_old

預處理資料集

將所有 mp3 轉換為 wav 後，您需要預處理資料集，可以透過執行以下命令來完成：

 python preprocess_common_voice.py 
    --data_dir <data_dir> 
    --output_dir <preprocessed_dir>

訓練模型

要訓練簡單模型，請執行以下命令：

 python run_rnnt.py 
    --mode train 
    --data_dir <path to data directory>

展開

附加信息

版本 1.0.0
類型 Ai源碼
更新時間 2025-01-28
大小 30.82KB
來自於 Github

相關應用

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
GitHub the via/releases

2024-11-01

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
node telegram bot api

Ai源碼

v0.50.0
typebot.io

Ai源碼

v3.1.2
python wechaty getting started

Ai源碼

1.0.0
waymo open dataset

其他源碼

December 2023 Update
termwind

其他類別

v2.3.0
wp functions

其他類別

1.0.0

相關資訊全部