feat: option to pipe text-to-speech result to GPT for post processing #70

peritus · 2024-11-19T11:58:22Z

New feature:

Take the text result from whisper
feed it to GPT4-o with a user defined prompt
use that to create the final markdown document

so that one does not only get the verbatim text but a full fledged note that is usable for copy-pasting, integration in other notes or to keep as is in another place in the vault.

Example for a post processing prompt:

Please respond in english. The following text is a transcript of a voice message. Format it in markdown, giving it a clear structure while keeping it as accurate as possible. Add a summary section at the beginning. If you come across to-do lists, format them as Markdown task lists with "[ ]". Avoid more than one level of nesting wherever possible. Finally, render the transcript exactly as spoken, with one sentence per line.

Settings:

drkpxl · 2024-11-22T14:56:02Z

I would love this merged in. I'm new to Obsidian plugins so is there a way to add this PR as a plugin directly so I don't need to wait and hope this get merged into mainline?

heimoshuiyu · 2024-12-05T05:34:11Z

Love this PR

But I think it should allow users to customize the API URL and the model. These are the two most basic parameters.

In addition to that, allowing users to set the temperature and other parameters (I believe the temperature is quite important)
This would make it better 👍

ezuk · 2025-04-04T19:03:45Z

@nikdanilov hey Nik, just figured I'd follow up on this PR since it's been open for a little while — this is my "one crucial feature" for the plugin so I wanted to ask. :) Would you be able to review? Thank you for the plugin!

vlietz · 2025-05-22T20:56:06Z

@nikdanilov hey Nik, just figured I'd follow up on this PR since it's been open for a little while — this is my "one crucial feature" for the plugin so I wanted to ask. :) Would you be able to review? Thank you for the plugin!

Just wanted to echo this — I’d also really love to see this PR approved. It’s a key feature for me too. Appreciate all the work on the plugin!

ezuk · 2025-05-22T22:26:22Z

@nikdanilov hey Nik, just figured I'd follow up on this PR since it's been open for a little while — this is my "one crucial feature" for the plugin so I wanted to ask. :) Would you be able to review? Thank you for the plugin!

Just wanted to echo this — I’d also really love to see this PR approved. It’s a key feature for me too. Appreciate all the work on the plugin!

fwiw, what I ended up doing is going with VoiceInk. It's a one-time purchase, $20 (or you can build it from source since it's open source), works systemwide, and supports "AI enhancement" (ChatGPT post processing) with several different providers and models.

peritus · 2025-06-30T19:18:28Z

@vlietz @ezuk @heimoshuiyu @drkpxl @nikdanilov

I've just open sourced https://github.com/peritus/obsidian-content-pipeline — which builds on top of the idea that made me submit this pull request, but in a more configurable way, enabling more complex workflows.

It's still a little rough around the edges (haven't submitted this to the official plugin repository yet, but plan to) .. happy to get feedback whether this solves your use cases as well.

feat: option to pipe text-to-speech result to GPT for post processing

319039c

drkpxl approved these changes Nov 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: option to pipe text-to-speech result to GPT for post processing #70

feat: option to pipe text-to-speech result to GPT for post processing #70

Uh oh!

peritus commented Nov 19, 2024

Uh oh!

drkpxl commented Nov 22, 2024

Uh oh!

heimoshuiyu commented Dec 5, 2024

Uh oh!

ezuk commented Apr 4, 2025

Uh oh!

vlietz commented May 22, 2025

Uh oh!

ezuk commented May 22, 2025

Uh oh!

peritus commented Jun 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

feat: option to pipe text-to-speech result to GPT for post processing #70

Are you sure you want to change the base?

feat: option to pipe text-to-speech result to GPT for post processing #70

Uh oh!

Conversation

peritus commented Nov 19, 2024

Uh oh!

drkpxl commented Nov 22, 2024

Uh oh!

heimoshuiyu commented Dec 5, 2024

Uh oh!

ezuk commented Apr 4, 2025

Uh oh!

vlietz commented May 22, 2025

Uh oh!

ezuk commented May 22, 2025

Uh oh!

peritus commented Jun 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants