It can be solved with speaker segmentation/embedding models, although it is not ...

mijoharas · 2025-07-30T11:57:38 1753876658

Oh awesome, I was reading through to see about whether it had speaker diarization (why I got rid of my whisper script I use).

I'll look forward to the Linux version.

Is there any chance of a headless mode? (I.e. start, and write transcript to stdout with some light speaker diarization markup. e.g. "Speaker1: text")

yujonglee · 2025-07-30T15:37:04 1753889824

> Is there any chance of a headless mode?

maybe. we might be able to add extension system that each extension can have that info and do whatever it want within the app.

> I'll look forward to the Linux version.

https://github.com/fastrepl/hyprnote/issues/67 We have open issue. You might want to subscribe to it!

apwell23 · 2025-07-29T18:22:53 1753813373

our conference rooms even have some sort of rotating camera contraption that automatically focus on the person speaking