How to play an audio file automatically (generated using text-to-speech) in Streamlit?

ayanatherate · November 11, 2022, 6:01am

I’m working on an application which will have an automatic text-to-speech feature. I have a piece of text which I’ve converted to speech using google text-to-speech library. I’m aware that we can generate an audio file of the same using st.audio() but here I want the audio to play automatically after it is generated. (without the user having to click any button anywhere).

Any ideas on how to implement this?
Thanks!

blackary · November 11, 2022, 2:59pm

Hi @ayanatherate,

You can accomplish this using a similar method to the one you may have come across in this issue by base64 encoding the generated file.

import base64

import streamlit as st


def autoplay_audio(file_path: str):
    with open(file_path, "rb") as f:
        data = f.read()
        b64 = base64.b64encode(data).decode()
        md = f"""
            <audio controls autoplay="true">
            <source src="data:audio/mp3;base64,{b64}" type="audio/mp3">
            </audio>
            """
        st.markdown(
            md,
            unsafe_allow_html=True,
        )


st.write("# Auto-playing Audio!")

autoplay_audio("local_audio.mp3")

ayanatherate · December 14, 2022, 8:12am

I’m a bit late but thanks a lot for the solution. It worked for me perfectly!

One problem I’m facing is that whenever the text to be converted to speech changes, the speech output isnt changing
automatically. I’ve to refresh the page by some way to play the audio file again. Is there a way to resolve that? That would be really helpful!

Thanks!

blackary · December 20, 2022, 3:09pm

Hi @ayanatherate, could you share a code snippet that shows this issue? Are you using st.experimental_memo to cache the autoplay_audio function, perhaps?

goldengrape · May 31, 2023, 8:57pm

I think what you probably need is the “callback”, which is called when something changes.

gist.github.com

https://gist.github.com/goldengrape/84ce3624fd5be8bc14f9117c3e6ef81a

langchain_stream_in_streamlit

from langchain.callbacks.base import BaseCallbackHandler
import azure.cognitiveservices.speech as speechsdk
import os
import base64
import time
class StreamDisplayHandler(BaseCallbackHandler):
    def __init__(self, container, initial_text="", display_method='markdown'):
        self.container = container
        self.text = initial_text
        self.display_method = display_method

This file has been truncated. show original

This is a callback demo I wrote using langchain. when you get a stream answer from ChatGPT, the callback is called once for each token. when the answer accumulates to one sentence, I send azure’s text to speech to generate speech and play it.
I haven’t tried google text-to-speech, I think you can set an onchange in the text_input of the user input, and then call the callback.

Another trick here is to use html to include the base64 of the audio so that it can be played with st.markdown. Maybe this is what you need to autoplay audio

audio_base64 = base64.b64encode(audio_stream).decode('utf-8')
audio_tag = f'<audio autoplay="true" src="data:audio/wav;base64,{audio_base64}">'
st.markdown(audio_tag, unsafe_allow_html=True)

system · June 6, 2023, 8:02am

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Audio display Using Streamlit	7	9388	January 12, 2022
Being not Able to Auto Play Audio Files on Streamlit Using Streamlit css	1	942	January 6, 2024
Autoplay multiple audios in a streamlit app one after another Using Streamlit audio , discussion , streamlit	5	644	August 20, 2024
How do I create an audio queue? Using Streamlit audio , discussion	4	466	October 19, 2024
How to play audio automatically? Using Streamlit	1	1220	January 12, 2022

How to play an audio file automatically (generated using text-to-speech) in Streamlit?

Related topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies