vocalize Download - vocalize Source code download

vocalize

Other source code

Download

vocalize

About

Vocalize is a pronunciation trainer made for language learners.

Vocalize is an application that provides pronunciation training for language learners. The user selects the language that they would like to practice, either English and Spanish, and is then presented with practice words. The user is able to record their pronunciation and submit it for comparison against the average pronunciation of the word. A visual representation of the user's pronunciation is graphed against the average pronunciation.

The average pronunciation of each word is created by feeding YouTube videos into a custom audio processing algorithm. We first scrape audio books from YouTube and submit them to IBM Watson's Text-to-Speech API. We then use FFmpeg to create an audio file for each word in the audiobook. When a word appears multiple times, we average the word instances together using a custom Python module that is built on top of SciPi. We narrow the scope of our data by only processing the 1000 most popular words of each language. Once an average pronunciation has been create for a word, it is stored using Amazon S3.

Front End: React.js, React Native, Redux, D3.js
Back End: Node.js, Express, MongoDB, Amazon S3
Audio Processing: Python, SciPy, IBM Watson, FFmpeg
Testing: Chai, Mocha, pytest
Build Tools: Gulp, Browersify, Webpack
Deployment: Digital Ocean

Team

Product Owner: Eugene Krayni
Scrum Master: Andrew Pedley
Development Team Members: Luke Powell, Aaron Phillips, Alex Zywiak

Dependencies

Data Scraping

youtube-dl brew install youtube-dl

Processing

Processing requires this python module. Github

Running

npm install
gulp build
node server.js

Data Scraping

In the data scraping directory you will find node js files that scrape youtube videos (audio books) for wav files of words.

npm install
node index.js scrape <youtube id> <language>

There is also a file that runs the python scripts to average the words and outputs them into a 'averaged' folder called average.sh

Expand

Additional Information

Version
Type Other source code
Update Time 2025-01-31
size 9.04MB
From Github

Related Applications

waymo open dataset

2024-11-18
SmartTube

2024-12-14
Sunamu

2024-12-14
MySchedule.py

2024-12-15
viptools for eslam

2024-12-15
VITAident

2024-12-15

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
waymo open dataset

Other source code

December 2023 Update
SmartTube

Other source code

24.71 Stable
Sunamu

Other source code

Release 2.2.0
waymo open dataset

Other source code

December 2023 Update
wp functions

Other categories

1.0.0
termwind

Other categories

v2.3.0

Related Information All

vocalize

vocalize

Table of Contents

About

Team

Dependencies

Data Scraping

Processing

Running

Data Scraping