Wine voice search using Whisper, DuckDB, and Metaphone

Howdy Streamlit folks,

I prototyped a hands-free inventory counting system for sommeliers recently.

A thorny part of this problem is inaccurate transcriptions in voice search. How do you handle situations where “Chateau Champignon” is transcribed as “shadow champagne on”?!

I have a demo of the voice search on Streamlit cloud: https://voice-search-with-whisper-duckdb-and-metaphone.streamlit.app/.

The code is all on Github: GitHub - voberoi/voice-search-with-whisper-duckdb-and-metaphone: This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https://vikramoberoi.com/helping-sommeliers-inventory-wine-faster-with-whisper-duckdb-and-metaphone/

I also wrote a blog post to contextualize the problem and explain how this works here: Helping sommeliers inventory wines faster with Whisper, DuckDB, and Metaphone

Cheers,
Vikram

1 Like

This is wild! Great work

Thank you @samthedataman!