Skip to content

Add timestamping to speech to text#742

Open
msluszniak wants to merge 33 commits intomainfrom
@ms/add-transcription-to-stt
Open

Add timestamping to speech to text#742
msluszniak wants to merge 33 commits intomainfrom
@ms/add-transcription-to-stt

Conversation

@msluszniak
Copy link
Member

@msluszniak msluszniak commented Jan 21, 2026

Description

Introduces a breaking change?

  • Yes
  • No
    This PR introduces breaking change as now the return type from transcribe and stream methods are based on TranscriptionResult type. Also now there is no commited / nonCommited properties of hook. stream now is async generator.

Type of change

  • Bug fix (change which fixes an issue)
  • New feature (change which adds functionality)
  • Documentation update (improves or adds clarity to existing documentation)
  • Other (chores, tests, code style improvements etc.)

Tested on

  • iOS
  • Android

Testing instructions

  • Run demo app in apps/speech and run transcription for both time stamping and regular mode (both from url and from real time audio to test both transcribe and stream methods).
  • Run voice chat in apps/llm to check if transcription appears. NOTE This example seems to be a bit buggy.
  • You need to run this on android device since this PR also fixes Speech to Text demo app in case of using physical android device. Earlier, required permissions for microphone weren't granted and the example effectively didn't work.
  • Check that documentation for modified sections is updated and that api reference is correct as well.

Screenshots

Related issues

Checklist

  • I have performed a self-review of my code
  • I have commented my code, particularly in hard-to-understand areas
  • I have updated the documentation accordingly
  • My changes generate no new warnings

Additional notes

@msluszniak msluszniak self-assigned this Jan 21, 2026
@msluszniak msluszniak added the feature PRs that implement a new feature label Jan 21, 2026
@msluszniak msluszniak marked this pull request as draft January 21, 2026 14:58
@msluszniak msluszniak linked an issue Jan 21, 2026 that may be closed by this pull request
@msluszniak msluszniak marked this pull request as ready for review January 21, 2026 18:43
@msluszniak msluszniak marked this pull request as draft January 29, 2026 10:25
@msluszniak msluszniak force-pushed the @ms/add-transcription-to-stt branch from 85fe485 to b4d17ad Compare February 2, 2026 11:15
@msluszniak msluszniak force-pushed the @ms/add-transcription-to-stt branch from 6c9e0a2 to cdc4f7b Compare February 6, 2026 16:19
@msluszniak msluszniak marked this pull request as ready for review February 6, 2026 19:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

feature PRs that implement a new feature

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add speech to text timestamping

2 participants