Skip to content

CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.

Notifications You must be signed in to change notification settings

otonomee/mic2transcript

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

mic2transcript

mic2transcript is a CLI tool that will continuously transcribe audio from the device's built-in microphone to a text file using OpenAI Whisper.

CLI Inputs

  • Name of output text file
  • Transcription model
    • tiny: Fastest, lowest accuracy. Suitable for quick transcriptions or resource-constrained environments.
    • medium: Balanced performance. Good for general use, offering a trade-off between speed and accuracy.
    • large: Highest accuracy, slowest speed. Best for scenarios requiring maximum transcription quality.

Requirements

  • Python 3.6+
  • sounddevice
  • numpy
  • whisper

Installation

  1. Clone this repository: https://github.com/otonomee/continuous-mic-transcribe
  2. Install the required packages:
pip install -r requirements.txt

Usage

To start the transcription, run the tool from the terminal:

python main.py

You will be prompted to select a Whisper model and specify an output file name for the transcriptions.

The tool will run in the background, transcribing any detected audio into the specified text file. Press 'q' in the terminal to stop the transcription process.

About

CLI tool that continuously transcribes audio from the device's built-in microphone to a text file. Runs in the background, providing an ongoing log of ambient audio as text.

Topics

Resources

Stars

Watchers

Forks

Packages

 
 
 

Languages