Tags · bnosac/audio.whisper

0.4.1

predict.whisper_transcription to predict channel based on VAD (#65)

* add predict.whisper_transcription which allows to assign a transcription segment to either a left/right channel based on a Voice Activity Detection

* Add remote

May 6, 2024
4b5c6a2
zip
tar.gz
Notes

0.4

Allow to transcribe only sections of the audio (#57)

* add argument sections to predict.whisper

* code cleanup

* adding example of sections

* add extra data check on subset.wav such that offsets/durations are not outside of audio range

* VAD on mono

* Align the timestamps based on the removed audio segments again

* play around with 1 millisecond

* more examples on multiple-offsets

* add NEWS item, bump version

* unit test for multiple sections return the part where it has a voice instead of the non-voice and put thses skipped parts at the top

* CI

* put alignment timestamps of transcriptions with skipped segments in separate function

* add utils

* docs

* CI

* docs

* dev code for token_timestamps

* CI

* docs

* CI

* use format instead of timezone

* parse the timestamps with milliseconds

* make sure to parse out the milliseconds + test with token_timestamp

* rename sentences to voiced for later readability

* bump dependency version of data.table to when data.table contains nafill

* there is no guarantee on short segments that whisper.cpp adheres to the requested timeframe - it provides the whole segment - disable that unit test

* docs

* CI

* add segment_offset to keep track of within which offset the segment was part of

* add offset/duration as arguments in predict.whisper, include new column called segment_offset in output by default, make sure examples on stereo are run in language es

* increase duration to make sure we have a segment

* logs

* more liberal unit test

* more liberal unit test

Mar 18, 2024
024c2cd
zip
tar.gz
Notes

0.3.3

bump release to 0.3.3

Mar 16, 2024
f30c8b3
zip
tar.gz
Notes

0.3.2

README bump to 0.3.2

Mar 4, 2024
8d57d02
zip
tar.gz
Notes

0.3.1

README

Feb 5, 2024
c302175
zip
tar.gz
Notes

0.3

Merge pull request #25 from bnosac/upgrade-v1.5.4

upgrade to whisper.cpp version v1.5.4

Jan 27, 2024
47f42e4
zip
tar.gz
Notes

0.2.2

NEWS

Jan 27, 2024
ef0074e
zip
tar.gz
Notes

0.2.1-1

#18 Change URL's to download from Huggingface, deprecate downloading …

…from ggerganov as some models are removed there

Jul 22, 2023
64a0eaf
zip
tar.gz
Notes

0.2.1

README

Mar 19, 2023
ee508e8
zip
tar.gz
Notes

0.2.0

Don't need recent Rcpp

Mar 17, 2023
162575a
zip
tar.gz
Notes

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.4.1

0.4

0.3.3

0.3.2

0.3.1

0.3

0.2.2

0.2.1-1

0.2.1

0.2.0

Tags: bnosac/audio.whisper