libnyumaya_esp32

Experimental support for nyumaya audio recognition on ESP32

Currently only ESP32S3 is supported

Dependencies:

ESP-NN
ESP-DSP

Steps to use your custom model the model:

The model is provided as unsigned char array. You have to provide this array to the function addModelFromBuffer. In this example this array is just included via marvin_model.h and marvin_model.c as source files in the CMakeLists.txt

Limitations:

Currently there is only support for listening to one keyword
The library needs to allocate a 115kB memory block on the IRAM
You need a PSRAM

The wake words are compatible with the corresponding version from our other repository However to make them loadable without a file system they are converted using xxd. xxd -i model.premium > model.cc

VAD:

Voice activity detection is supported for commercial customers. VAD is here defined as follows: Activity shoud be detected if Speech is present in the last 1.2 seconds of audio. Music/Singing/Breathing/Coughing should not count as speech. The output is RAW probabilities and should be filtered in an appropiate way.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
include/esp32s3		include/esp32s3
lib/esp32s3		lib/esp32s3
CMakeLists.txt		CMakeLists.txt
README.md		README.md
component.mk		component.mk

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

libnyumaya_esp32

VAD:

About

Releases

Packages

Languages

nyumaya/libnyumaya_esp32

Folders and files

Latest commit

History

Repository files navigation

libnyumaya_esp32

VAD:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages