Skip to content

Releases: openly-jp/voiscribe

First Open Beta Release

02 Mar 17:19
72ae24a

Choose a tag to compare

Pre-release

Full Changelog: v0.1.0-beta...v0.2.1-beta

First Beta Release Candidate

01 Mar 17:34
9c427c1

Choose a tag to compare

Pre-release
First Beta Release Candidate (#253)

* recognition accuracy  improvement (#168)

* update whisper.cpp and activate non_speech_token suppression

* add prompt reset feature

* modify remaining audio concatination

* fix to non-block the recording on background (#183)

* fix to non-block the recording on background

* fix to distinguish inactive and background

* format

* set max width (#185)

* set max width

* apply formatter

* fixes based on comments

* modified to show ongoing recognition results (#160)

* modified to show ongoing recognition results

* changed to pop up alert when recognition is on going

* format

* add Package.resolved (#186)

* add Package.resolved

* add .gitignore itself

* add missing file (#189)

* Add info page (#178)

* add app info menu bar

* show webpages with in-app browser

* format

* add padding

* update terms of service url

* fixes based on commnets

* additional fixes

* specify fonts

* Activate swipe gesture for side menu closing (#181)

* add smooth sidemenu close feature

* fix to activate record button

* format

* prooperly added package.resolved to gitignore (#197)

* model download feature (#174)

* add pull request template (#77)

* add model path

* add whisper model definition

* add download function

* move url list

* add repository functions

* add missing self

* enable itunes file-sharing and file app integration

* initiate whisper model in menu view

* make WhisperRecognizer accept WhisperModel

* resolve compile errors

* return URL type

* fix error for guard statment

* fix error

* replace model name with WhisperModel

* uppercase struct names

* change @AppStorage objects

* make storage-key as String

* add initilizers

* fix type error

* fix assignment errors

* fix display name

* remove unused property

* fix key names

* add missing args

* fix wrong operator

* refactoring audio player (#95)

* fix typos

* remove `playerWrapper` and use raw `AVAudioPlayer`

* format

* remove  state

* add whisperModel to workspace

* add whisperModelRepository to repo

* make whisperModel initializer accept callback function so that recognizer's model loading occurs after the completion of download

* change model name

* update recognizer.whisperModel outside model initialization in order to avoid 'published property cannot be updated outside main thread' error

* load models when they already exist

* set whisperModel in initialization

* explicitly set whisperModel path in initialization

* add code for error handling

* Revert "enable itunes file-sharing and file app integration"

This reverts commit f5d7d2d9a44a8cce242659492d2c7aee2ad1936d.

* apply formatter

* extract downloader

* format

* free whisperContext before load

* format

* fix comment format

* undo change in buildConfiguration

* fix some appearance

* checkmarks for downloads work somehow

* swap wrong icons

* refactor

* show prompt for downloading

* separate model download and model loading

* to avoid annoying confirmation whenever I submit testflight builds

* change icon size

* beatify progress bar design

* change progress bar color

* progress bar working

* reorganize callback

* update value in main thread

* delete model gesture working

* delete model functino complete

* remove cloud icons for tiny models

* fix selected-recognizer-reseted-bug #125

* load models asynchronously

* remove unused suscess

* disable delete while loading

* add clearer var name

* remove unnecesarry Dispatch

* add ggml-tiny.multi.bin

* format

* add new formatting rule to limit max width to 120

* reformat

* update formtter version

* reformat

* change bundle identifier

* add tiny model

* remove inject from dependencies

* add medium model

* remove unnecesarry files

* remove wrongly enalbed UIApplicationSupportsMultipleScenes

* translate english comments into japanese ones

* disanle multiwindows

* Co-authored-by: Takehisa Oyama <[email protected]>

* update file info

* simplify logic

* use callback for initializing recognizer on app start

* remove unused needssubscription

* format

* remove optional flag

* some fixes

* refactoring

* fix bugs

* fix bug

---------

Co-authored-by: Takehisa Oyama <[email protected]>
Co-authored-by: ooyamatakehisa <[email protected]>

* Improvement/splash screen (#212)

* splash-screen improvement

* splash-screen

* rolled back entitlements file

* rollback HomeView

* rollback

* roll back unnecessary

* format

* audio-sharing feature (#194)

* audio-sharing

* formatted

* change button's position

* format

* レビューを反映しました。

* Update ToolBar.swift

* audio bar width restore

* fix view issues (#201)

* re-add resource to make preview work

* remove animation from stop button

* resolve moving red circle animation issue

* remove extra space above navigation bar

* manage preview codes

* set circle height

* fix too big waveform

* change app name

* set `maxAmp` to 0 when starting recording

---------

Co-authored-by: shibukazu <[email protected]>

* delete developer page (#204)

* delete developer page

* remove StreamingRecognition row in project.pbxproj

---------

Co-authored-by: shibukazu <[email protected]>

* Add english localization (#221)

* add english localization

* enable language switching

* add alert for potential data loss

* add message for macOS

* add missing translation

* hide switch item when there is only one display language installed on device

* fixes based on comments

* additional fixes

* remove language switch feature

* add missing translation

* change display name to VoiScribe

* update initial image (#233)

* update icons (#235)

* language and model select pane (#223)

* delete Package.resolved

* add partial view to set recognition model and language before start recording

* change to use presets and fix each box size

* enabled user to download model from sheet

* fix design and add model size information

* refactoring

* delete unused files

* fix to share model download state

* fix to share isDownloading state

* removed tiny models and changed default model

* add recommend tag

* fix to prohibit model change when recognition on progress

* fix folder name

* format

* fix for mac catalyst

* fix displayed model size

* add localization

* removed recognition frequency sec item in sidemenu

* remove model load feature from sidemenu

* format

* add license information

* fix design for english environment

* change default recognition language and model based on device setting

* format

* fix to show sheet in stead of partial sheet in iPad

* format

* add defer statement

* format

* fix to init isDownloading value as false when app relaunched

* add error handling

* remove white blank from splash-screen (#244)

* remove white blank from splash-screen

* temporarily change github actions

* fix swift-format version

* suppress hallucination and improved audio-transcript-time matching (#243)

* add missing translation (#245)

* change model to int4 quantization model (#246)

* update whisper library

* change small-en and medium-en to quantized model

* change model megabytes calculation

* format

* replace car with hare (#248)

* Update README.md

* change non_speech_token_suppression true to false for Japanese (#252)

---------

Co-authored-by: shibukazu <[email protected]>
Co-authored-by: Takehisa Oyama <[email protected]>
Co-authored-by: ooyamatakehisa <[email protected]>
Co-authored-by: Hwari2929 <[email protected]>
Co-authored-by: shibukazu <[email protected]>

Initial release

11 Feb 13:02
aa648c3

Choose a tag to compare

Initial release Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: https://github.com/openly-jp/whisper-ios/commits/v0.1.0-beta