pona-a 12 days ago

I've actually done that same concept a while back when whisper.cpp came out. A significant challenge is sane paragraph segmentation, as even humans don't often agree on the best place for a line break. I wonder what approach you've used.

  • ldenoue 12 days ago

    I've adopted a very simple approach: 80 words per "paragraph". I am now experimenting with computing the embeddings of each sentence and try to detect topic segments. But the simple approach yields pleasant segments AFAIK.

mredigonda 12 days ago

This is absolutely amazing, the fact that you can click on text and it takes you immediately to the part of the audio where that text is being said is great.

I wish the video was shown as well, but other than that, excellent work!

  • ldenoue 12 days ago

    The site embeds the YouTube video so we can hear the audio, so I could show the video as well, just not sure where :)

    Perhaps split in 2 columns on wide displays (video left, transcript right), and portrait (e.g. mobile phone), video on top and transcript below it?

pixelmoth 13 days ago

What am I meant to once I have highlighted text? The export buttons don't even use them

  • ldenoue 13 days ago

    The pdf should show them as bold. I’ll see how to add the highlights to the markdown and html. Hard for text (maybe use * around each?)

    You can share the link as it contains the list of timestamps corresponding to your highlights.

    So others or yourself can see them again.

    • ldenoue 12 days ago

      Just updated the app so export to PDF now shows your highlights colored.

kamikaz1k 13 days ago

Just seeing loading spinner…not obvious what I should be doing

  • ldenoue 13 days ago

    The first time the deep net needs to load. I should add a progress of some sort. Thanks