Managed to make long dictations even >10mins appear in < 2seconds by pushing what is possible with current STT models.
All processing done locally with 0 network calls.
Managed to make long dictations even >10mins appear in < 2seconds by pushing what is possible with current STT models.
All processing done locally with 0 network calls.