From c5515ace19066e938854b4b99e0c2e9bbc2eeb65 Mon Sep 17 00:00:00 2001 From: Konstantin Date: Sat, 22 Jul 2023 15:12:11 +0200 Subject: [PATCH] NuGet package readme and metadata --- WhisperNet/Readme.md | 4 ++-- WhisperNet/WhisperNet.nuspec | 4 ++-- 2 files changed, 4 insertions(+), 4 deletions(-) diff --git a/WhisperNet/Readme.md b/WhisperNet/Readme.md index 1545e23..aed10ce 100644 --- a/WhisperNet/Readme.md +++ b/WhisperNet/Readme.md @@ -5,8 +5,8 @@ The library requires a hardware GPU which supports Direct3D 11.0, a 64-bit Windo The main entry point of the llibrary is `Whisper.Library` static class. Call `loadModel` function from that class to load an ML model from a binary file. -These binary files are available for free download on [Hugging Face](https://huggingface.co/ggerganov/whisper.cpp/tree/main).
-I recommend `ggml-medium.bin` (1.42GB in size, but that web page says 1.53 GB), because I’ve mostly tested the software with that model.
+These binary files are available for free download on [Hugging Face](https://huggingface.co/ggerganov/whisper.cpp/tree/main). +I recommend `ggml-medium.bin` (1.42GB in size, but that web page says 1.53 GB), because I’ve mostly tested the software with that model. Compressed models in ZIP format with `mlmodelc` in the file name are not supported. Once the model is loaded, create a context by calling `createContext` extension method, diff --git a/WhisperNet/WhisperNet.nuspec b/WhisperNet/WhisperNet.nuspec index e7fe354..6d349a2 100644 --- a/WhisperNet/WhisperNet.nuspec +++ b/WhisperNet/WhisperNet.nuspec @@ -8,8 +8,8 @@ https://github.com/Const-me/Whisper High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model - When loading models, adapters can be selected with 0-based index, in addition to the name. - Added an API method to decode initial prompt into array of tokens. + Updated models source URL in the documentation. + Reliability enhancement, microphone capture less likely to transition to “Stalled” state and discard the audio. Copyright © const.me, 2022-2023 whisper, gpgpu, speech recognition