Some of my public work at https://hltcoe.jhu.edu/research/scale/scale-2017/
-
Updated
Jul 27, 2017 - Python
Some of my public work at https://hltcoe.jhu.edu/research/scale/scale-2017/
A selection of voice assistants that I created.
This script takes a manual voice from the microphone of the device used as input and transform it into a string datatype that will be printed on the screen.
Tool tự động gán nhãn dựa trên VAD, sử dụng cho bài tập thu âm và gán nhãn lớp học phần INT3411
A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIVITY DETECTION IN ADVERSE CONDITIONS"
ovos plugin for voice activity detection using webrtcvad
Conan.io package for libfvad project
Download and sync subtitles automatically using Voice Activity Detection
An implementation of SileroVAD as a recognizer for ELAN
A comprehensive AI companion leveraging advanced semantic analysis, sentiment detection, and voice processing to provide personalized and context-aware interactions using Autogen, semantic-router, and VoiceProcessingToolkit.
Tr-VAD: An Efficient Transformer based Voice Activity Detection Model
🎙️ Enhanced Speaker Diarisation 📒 with OSD, SS, and Advanced VAD🗣️.
This Script is able to extract Frequency of the voice detected in an audio file (preferred in ".wav" filetype)
Farfield Voice Activity Detection - Academic project
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Demonstration of Hugging Face's (https://huggingface.co/) newly released Wav2Vec2 model for easy, reasonably coherent, Speech to Text!
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
A packaged convolutional voice activity detector for noisy environments.
A statistical model-based Voice Activity Detector
Lightweight CNN for Robust Voice Activity Detection
Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."