Releases: Sharrnah/whispering-ui
v1.3.6.1
v1.3.6.1 Release of the native UI.
Changelog:
- [FEATURE] Add option to reorder additional language translations
- [FEATURE] Add markdown parsing for Plugin List.
- [FEATURE] Add "Export Last Generation" to TTS Tab
- [TASK] Improve handling of deactivated VAD settings
- [TASK] Add language_detection_on_each_segment setting
- [TASK] Improve backend restart logic
- [TASK] Improve main window recognition for dialog parenting
- [TASK] Remove obsolete validation of Transcript/Translate textareas
- [TASK] Remove DirectSound API from list (Python backend seems incompatible with DirectSound anyway)
- [TASK] Reimplement websocket message read limit
- [TASK] Improve list seperator style
- [BUGFIX] Crash on switching TTS Type in Settings tab of main UI window.
- [BUGFIX] Only show notification of wrong setting combination when changing it after initial profile loading.
- [BUGFIX] TextValueSelect widget invalid defaultIndex error
Full Changelog: v1.3.5.1...v1.3.6.1
List of available Plugins: https://github.com/Sharrnah/whispering-plugins/blob/main/README.md
v1.3.5.1
v1.3.5.1 Release of the native UI.
Changelog:
- [TASK] Add setting Only_no_speech_threshold_for_segments
- [BUGFIX] Fix typo in setting
- [BUGFIX] possible crash on turbo whisper, mms or wav2vec_bert models.
Full Changelog: v1.3.4.1...v1.3.5.1
List of available Plugins: https://github.com/Sharrnah/whispering-plugins/blob/main/README.md
v1.3.4.1
v1.3.4.1 Release of the native UI.
Changelog:
- [FEATURE] Added Machine UI Translations (Spanish, Ukrainian, Russian)
- [FEATURE] Support for multi column plugin settings groups
- [TASK] Updated UI Translations
- [TASK] Update F5-TTS compute device selections
- [TASK] Add TTS Volume setting
- [TASK] Add Databinding to custom Entry widget
- [BUGFIX] Correctly display Compute Capability and Memory current/max if detected
- [BUGFIX] Update selected TTS Model Field widget
- [BUGFIX] UI rendering issues
Full Changelog: v1.3.3.1...v1.3.4.1
List of available Plugins: https://github.com/Sharrnah/whispering-plugins/blob/main/README.md
v1.3.3.1
v1.3.3.1 Release of the native UI.
Changelog:
- [FEATURE] Added Machine UI Translations (Japanese, Chinese, Korean, Polish)
- [FEATURE] Add settings for multi target language translation
- [FEATURE] Add custom model option for user own model usage
- [FEATURE] Add Crisper Whisper model
- [TASK] Updated UI Translations (Czech, Dutch, French, Danish, German, English)
- [TASK] Update Go and modules
- [TASK] Add build pipeline
- [TASK] generalize audio API logic
- [TASK] Add Video Memory fallback reading from Registry
- [TASK] Add new test audio
- [TASK] Add options for F5/E2 TTS model
- [TASK] Add OSC Server options
- [TASK] Change preset names
- [BUGFIX] possible crash on accessing empty completion entry widget
- [BUGFIX] possible crash on empty TextValueSelect widget
- [BUGFIX] loading language options from text translation plugins
- [BUGFIX] Correct use of "auto" or "" keyword for language selections
- [BUGFIX] Linux audio fix
Full Changelog: v1.3.2.4...v1.3.3.1
List of available Plugins: https://github.com/Sharrnah/whispering-plugins/blob/main/README.md
v1.3.2.4
v1.3.2.4 Release of the native UI.
Changelog:
- [FEATURE] Added UI Translations (English, German, French, Czech)
- [FEATURE] Add Profile Presets.
- [BUGFIX] Initialize Global Fields after translations loaded.
- [BUGFIX] Add missing M4T v2 model size estimate
- [TASK] Some improvements in audio device handling in profile settings
- [TASK] Set default vad_frames_per_buffer to 512
- [TASK] Update libraries
- [TASK] Show VRAM estimate even if available VRAM was not detected
- [TASK] Set fallback language to english
Full Changelog: v1.3.2.3...v1.3.2.4
List of available Plugins: https://github.com/Sharrnah/whispering-plugins/blob/main/README.md
v1.3.2.3
v1.3.2.3 Bugfix Release of the native UI.
Changelog:
- [FEATURE] Direct-ML support. (Should allow to run many AI models on any DirectX 12 comatible GPU, including Intel and AMD)
- [FEATURE] Implement Plugins Update check on Startup
- [TASK] Add new Application Settings Tab
- [TASK] Small UI improvements on Plugin Listing
- [TASK] Show "Disabled" if Noise Filter is disabled in Settings.
- [TASK] Small theme changes
- [TASK] Update fyne to v2.5.0
Full Changelog: v1.3.2.2...v1.3.2.3
List of available Plugins: https://github.com/Sharrnah/whispering-plugins/blob/main/README.md
v1.3.2.2
v1.3.2.2 Bugfix Release of the native UI.
Changelog:
- [TASK] Added more error logging to Profile Settings window.
- [BUGFIX] possible crash on switching Audio API's (Thanks to Vecken for reporting it ❤️)
Full Changelog: v1.3.2.1...v1.3.2.2
List of available Plugins: https://github.com/Sharrnah/whispering-plugins/blob/main/README.md
v1.3.2.1
v1.3.2.1 Release of the native UI.
Changelog:
- [FEATURE] Ask for filepath before TTS generation and send together with request.
- [TASK] Enable 'thread_per_transcription' by default again.
- [TASK] Rename crash.log to error_ui.log
- [TASK] Change App ID to website domain
- [TASK] Add missing MMS model estimates
- [BUGFIX] read metadata from build toml again
- [BUGFIX] Open dialogs in correct plugin window.
Full Changelog: v1.3.1.3...v1.3.2.1
List of available Plugins: https://github.com/Sharrnah/whispering-plugins/blob/main/README.md
v1.3.1.3
v1.3.1.3 Release of the native UI.
Changelog:
- [FEATURE] Add status bar
- [FEATURE] Add energy + pause message on push2talk setup.
- [FEATURE] Support Password textfield for Plugin Settings.
- [FEATURE] Add support for audio_device selection widget for Plugins
- [FEATURE] Add Clipboard usage options in Context menu for TTS Tab.
- [FEATURE] Add clipboard image button for OCR
- [FEATURE] Add Open in window buttons for plugin settings
- [TASK] Fetch plugin list from new plugins github repo https://github.com/Sharrnah/whispering-plugins
- [TASK] Add speaker change recognition experimental settings
- [TASK] Add Noise filter settings
- [TASK] Add Transcription List item sizing
- [TASK] Add "switch language" Icon
- [TASK] Remember "Only show enabled Plugins" Setting over restarts.
- [TASK] Add support of clipboard text content in OCR tab
- [TASK] Rename OCR buttons
- [TASK] use romaji setting for text-translation requests
- [TASK] Added plugin textfield widget
- [TASK] Rename "Text-to-Speech Enable" checkbox to "Integrated *.." to make it more clear
- [TASK] Websocket message handling rewrite
- [TASK] Reorder some Profile settings
- [TASK] Add some more info texts to settings
- [TASK] Add send to both (tts + OSC) option in context menu
- [TASK] Replace Transcription List with non Databinding variant
- [TASK] Remove embedded font
- [TASK] Change symbols for enabled/disabled Plugins (because of non-embedded font)
- [TASK] Add Remove + Find AdditionalMenuItem functions to custom entry widget
- [TASK] Add Website link and hide Repository Links behind Accordion in About Tab And Profile window
- [TASK] Add Loading dialogs by parsing log entries instead of websocket messages.
- [TASK] Ignore loading messages for recentLog and writing log.txt
- [TASK] Show "Enabled" / "Disabled" text for checkboxes.
- [TASK] Remove obsolete "Enabled" text for Integrated TTS option.
- [BUGFIX] removed zip renaming after download (hopefully fixes 'in use' errors)
- [BUGFIX] stt_type setting naming mistake
- [BUGFIX] Fill Log on display of Logs tab with latest Log entries.
- [BUGFIX] Clear Log text on Log open before filling with RecentLog
- [BUGFIX] Make the multi threaded message calls thread safe
- [BUGFIX] ignore percentages in transcripts for status bar
- [BUGFIX] possible error on loading message.
Full Changelog: v1.2.1.1...v1.3.1.3
List of available Plugins: https://github.com/Sharrnah/whispering-plugins/blob/main/README.md
v1.2.1.1
v1.2.1.1 Release of the native UI.
Changelog:
- [TASK] Add supporter link
- [TASK] correctly set location plugin file dialogues show on open
- [TASK] Add M4T Large V2 model option
- [TASK] Add Whisper Transformer option
- [TASK] Make language code mapping case insensitive
- [TASK] Improve source language combo widget initialization
- [TASK] Improve websocket code (use custom dialer)
- [TASK] Improve runBackend conditions
- [TASK] Update to Golang 1.21
- [TASK] Migrate settings to Profiles subfolder
- [TASK] Allow Plugin fetching from regular github urls
- [TASK] Allow downloading without hash-check
- [TASK] Seperate OCR translation fields from other text-translation fields
- [TASK] Add 4/8 bit quantization options
- [TASK] Add 8bit + 4bit estimates
- [TASK] hide speechTask for wav2vec models
- [TASK] Add setting to run transcriptions in seperate or single thread
- [TASK] Add memory estimates for wav2vec + NeMo Canary
- [TASK] Use target language field for NeMo Canary model
- [TASK] Use correct default frames per buffer
- [TASK] Add option to detect venv environment to run via UI
- [BUGFIX] possible issue on plugin settings line counting
- [BUGFIX] Update font version with Korean characters included
- [BUGFIX] possible freeze on preview loading
- [BUGFIX] possible race condition crash if loading state was displayed before main ui
Full Changelog: v1.2.0.8...v1.2.1.1
List of available Plugins: https://github.com/Sharrnah/whispering/blob/main/documentation/plugins.md