Block or Report
Block or report gizmo84
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (1)
Sort Name ascending (A-Z)
Stars
Language: Jupyter Notebook
Sort by: Most stars
🔊 Text-Prompted Generative Audio Model
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
A self-organizing file system with llama 3
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
oyvindln / vhs-decode
Forked from happycube/ld-decodeSoftware defined VHS decoder - Fork (maybe temporary) of the ld-decode Laserdisc rf decoder
GPTAuthor is an AI tool for writing long form, multi-chapter stories given a story prompt.
This sample demonstrates how to use GPT-4 Vision to extract structured JSON data from PDF documents, such as invoices, using the Azure OpenAI Service.