Howdy Streamlit folks,
I prototyped a hands-free inventory counting system for sommeliers recently.
A thorny part of this problem is inaccurate transcriptions in voice search. How do you handle situations where “Chateau Champignon” is transcribed as “shadow champagne on”?!
I have a demo of the voice search on Streamlit cloud: https://voice-search-with-whisper-duckdb-and-metaphone.streamlit.app/.
The code is all on Github: GitHub - voberoi/voice-search-with-whisper-duckdb-and-metaphone: This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https://vikramoberoi.com/helping-sommeliers-inventory-wine-faster-with-whisper-duckdb-and-metaphone/
I also wrote a blog post to contextualize the problem and explain how this works here: Helping sommeliers inventory wines faster with Whisper, DuckDB, and Metaphone