New Component: streamlit-webrtc, a new way to deal with real-time media streams

whitphx · December 18, 2022, 12:11pm

@Om_Surushe Hi, I think it is not possible with the current version, and the following issue would cover it. Please be patient for it.

Om_Surushe · December 18, 2022, 12:24pm

Thank you for your reply I figured a way by which I was able to pass my source frames

using lock but the same did not work with audio

Om_Surushe · December 18, 2022, 12:25pm

import av
import streamlit as st
from streamlit_webrtc import WebRtcMode, webrtc_streamer
import threading
import time

# create a lock for the thread
lock = threading.Lock()
video = {
    'video_frame': None,
    'video_count':-1, 
    'audio_frame': None,
    'audio_count':-1,
    }


def video_frame_callback(frame: av.VideoFrame) -> av.VideoFrame:
    # with lock:
    #     frame_list = video['video_frame']
    #     count = video['video_count']
    
    # if count == -1:
    #     container = av.open('question_0.avi')
    #     frame_list = list(container.decode(video=0))
    #     count = 0
    #     with lock:
    #         video['video_frame'] = frame_list
    #         video['video_count'] = count
    # else:
    #     count += 1
    #     with lock:
    #         video['video_count'] = count

    # if count >= len(frame_list):
    #     count = 0
    #     with lock:
    #         video['video_count'] = count
    # print("video ",count)
    # return frame_list[count]
    return frame

def audio_frame_callback(frame: av.AudioFrame) -> av.AudioFrame:
    # with lock:
    #     frame_list = video['audio_frame']
    #     count = video['audio_count']
    
    # if count == -1:
    #     container = av.open('question_0.wav')
    #     frame_list = list(container.decode(audio=0))
    #     count = 0
    #     with lock:
    #         video['audio_frame'] = frame_list
    #         video['audio_count'] = count
    # else:
    #     count += 1
    #     with lock:
    #         video['audio_count'] = count

    # if count >= len(frame_list):
    #     count = 0
    #     with lock:
    #         video['audio_count'] = count
    # print("audio ",count)
    # print(type(frame_list[count]))
    # return frame_list[count]
    return frame

ctx = webrtc_streamer(
    key="omg",
    video_frame_callback=video_frame_callback,
    audio_frame_callback=audio_frame_callback,
    media_stream_constraints={
        "video": True,
        "audio": True,
    },
    rtc_configuration={"iceServers": [
        {"urls": ["stun:stun.l.google.com:19302"]}]},
    mode=WebRtcMode.SENDRECV,
    )

here question_0.avi is my source

whitphx · December 19, 2022, 1:23pm

@Om_Surushe
I see.
If it is OK to upload video and audio from the client and ignore all of them, the SENDRECV mode can be used just like you did.

In that case, the callback must return a frame object whose props are the same as the input frame because its original purpose was to transform the input to output. For example, in the case of audio, the props include the # of channels and the sampling rate.
I guess this is why your code didn’t work.

As I am not an audio expert, I don’t know the best practices for manipulating such props, but I used pydub for it in an audio example linked below, FYI.

github.com

whitphx/streamlit-webrtc/blob/522644c976ac53b17de3d4557b8845204016c5cf/pages/3_audio_filter.py#L21-L28


      
          # Ref: https://github.com/jiaaro/pydub/blob/master/API.markdown#audiosegmentget_array_of_samples  # noqa
          channel_sounds = sound.split_to_mono()
          channel_samples = [s.get_array_of_samples() for s in channel_sounds]
          new_samples: np.ndarray = np.array(channel_samples).T
          new_samples = new_samples.reshape(raw_samples.shape)
          
          
new_frame = av.AudioFrame.from_ndarray(new_samples, layout=frame.layout.name)
          new_frame.sample_rate = frame.sample_rate

GKu · January 30, 2023, 12:30am

Hello @whitphx,
Is this supported now?
I checked link below but I am not sure about that.
Thank you.

github.com/whitphx/streamlit-webrtc

Unify input/output API for RECVONLY and SENDONLY modes

opened 04:24PM - 17 Jan 23 UTC

whitphx

enhancement

`webrtc_streamer` has many different I/O types which makes its API complex. We …should get rid of some APIs so that both the API and its implementation get simpler. ### Input Currently `webrtc_streamer` supports 2 input types: * Player object specified with `player_factory` * Source track specified with `ctx.source_video_track` and `ctx.source_audio_track` And #731 may also be here. The `source_*_track` should be the unified input API, and the player-based API should be removed from `webrtc_streamer()`. Instead, the developers will manually connect the player object to the input tracks. ### Output Currently `webrtc_streamer` supports 3 output types: * Recorder API with `in_recorder_factory` and `out_recorder_factory` * Receiver API with `ctx.video_receiver` and `ctx.audio_receiver` * Output tracks with `ctx.output_video_track` and `ctx.output_audio_track` At least, like the player API above, the recorder API should be removed from `webrtc_streamer()` so that the developers manually connect the output tracks to the recorder object. The receiver objects are kind of special, but they may also be unified with the output track-based API.

whitphx · January 31, 2023, 4:41am

No,

it’s rather on this issue as written in New Component: streamlit-webrtc, a new way to deal with real-time media streams - #128 by whitphx, and it’s not available yet.

Vishnu_Teja · April 4, 2023, 3:57am

It seems like GitHub - whitphx/streamlit-stt-app: Real time web based Speech-to-Text app with Streamlit (https://whitphx-streamlit-stt-app-app-deepspeech-m6tt1k.streamlit.app/ ) is not working on the streamlit cloud. It was working until yesterday but something went wrong and now it is not working.

whitphx · April 16, 2023, 3:41pm

@Vishnu_Teja Thank you for the report.
It may be streamlit-webrtc is not working and it is not due to component · Issue #6330 · streamlit/streamlit · GitHub, while it’s under investigation and still not fixed. Please track the issue.

whitphx · April 25, 2023, 5:53pm

WebRTC apps hosted on the Community Cloud have been broken as reported at Inconsistent issue with streamlit-webrtc in streamlit app · Issue #1213 · whitphx/streamlit-webrtc · GitHub, but they are now working after some fix.
Please let me know if there are still something broken.

@Vishnu_Teja The STT app should also work now. Please check it

Esau_Hutcherson · April 30, 2023, 9:00pm

Hey, all I’m creating a web app that recognizes emotion from real-time video using Microsoft’s DeepFace library. I am able to get the webcam activated and have real-time analysis on my local computer. This works perfectly when the camera is running but when I hit the Stop button, I receive an error regarding setting the detected dominant emotion to st.session_state["user_emotion"]. I am able to accurately set the “user_emotion” session_state variable to the emotion detected until I hit the Stop button My error is given below:

Traceback (most recent call last):
  File "/Users/v.esau.hutcherson/.local/share/virtualenvs/StreamLit-ohTsyygW/lib/python3.10/site-packages/streamlit/runtime/scriptrunner/script_runner.py", line 565, in _run_script
    exec(code, module.__dict__)
  File "/Users/v.esau.hutcherson/StreamLit/pages/listings.py", line 146, in <module>
    if st.session_state["user_emotion"] == "neutral" or "suprised" or "happy":
  File "/Users/v.esau.hutcherson/.local/share/virtualenvs/StreamLit-ohTsyygW/lib/python3.10/site-packages/streamlit/runtime/state/session_state_proxy.py", line 90, in __getitem__
    return get_session_state()[key]
  File "/Users/v.esau.hutcherson/.local/share/virtualenvs/StreamLit-ohTsyygW/lib/python3.10/site-packages/streamlit/runtime/state/safe_session_state.py", line 111, in __getitem__
    raise KeyError(key)
KeyError: 'user_emotion'

My code for the webrtc_streamer and for the deep face integration is implemented like this:

lock = threading.Lock()
img_container = {"img": None}



face_cascade=cv2.CascadeClassifier("haarcascade_frontalface_default.xml")

def video_frame_callback(frame):
    img = frame.to_ndarray(format="bgr24")
    with lock:
        img_container["img"] = img
    return frame

frame_rate = 1
ctx = webrtc_streamer(key="example", video_frame_callback=video_frame_callback,
                       media_stream_constraints={
        "video": {"frameRate": {"ideal": frame_rate}},
    },
    video_html_attrs={
        "style": {"width": "50%", "margin": "0 auto", "border": "5px purple solid"},
        "controls": False,
        "autoPlay": True,
    },
                      )
if "emotion" not in st.session_state:
    st.session_state["emotion"] = ""
while ctx.state.playing:
    with lock:
        img = img_container["img"]
    if img is None:
        continue
    emotion_data = DeepFace.analyze(img_path=img,actions=['emotion'],enforce_detection=False)
    if emotion_data != []:
        st.session_state["emotion"] = emotion_data[0]["dominant_emotion"]

The data that I receive from the DeepFace.analyze method is given like this:

[
0:{
"emotion":{
"angry":0.0645486346911639
"disgust":0.0000023556083306175424
"fear":0.0018471573639544658
"happy":95.05292773246765
"sad":0.23144783917814493
"surprise":0.10018055327236652
"neutral":4.549040272831917
}
"dominant_emotion":"happy"
"region":{
"x":206
"y":103
"w":241
"h":241
}
}
]

I assumed I should always be able to access the analyzed dominant emotion by doing emotion_data[0]["dominant_emotion] and set it to the st.session_state["user_emotion"] variable however for some reason when the camera is run for around 30 seconds or more I receive an error regarding the st.session_state variable of “user_emotion” does anyone know of a fix?

stanny370599 · May 24, 2023, 9:10am

Hello @whitphx ,

I have a small question , how can I use webrtc streamer to access the image frame and also the frame_number at the same time as I need to pass these into a function . Also I need to stop the stream at frame_number =23.

Thank you ,

AfroLogicInsect · June 29, 2023, 10:49pm

Hi,

So I have tried this, however i get this error message => AttributeError: ‘list’ object has no attribute ‘render’

The entire code:

import numpy as np
import cv2
import av
from ultralytics import YOLO
from streamlit_webrtc import webrtc_streamer

model = YOLO('yolov8n-seg.pt')


def video_frame_callback(frame):
    image = frame.to_ndarray(format="bgr24")

    results = model(image)
    output_img = np.squeeze(results.render())
    #output_img = np.squeeze(results.render()[0])

    return av.VideoFrame.from_ndarray(output_img, format="bgr24")


webrtc_streamer(key="example",
                video_frame_callback=video_frame_callback,
                media_stream_constraints={"video": True, "audio": False})

Kindly assist.

Weimeng_Luo · August 7, 2023, 7:20am

Hi @whitphx, thank you for the great work you have done!
I noticed there are video_frame_callback and audio_frame_callback in the Callbacks. Is there a way to deal with both video and audio in a single callback? My intention is to process the input audio, and transform it into a streaming video, if there is no such callbacks, is there any work-around to deal with that?

Thanks very much !!!

whitphx · August 13, 2023, 2:09pm

@Weimeng_Luo I commented in Is there a way to process both video and audio in one callback? · Issue #1329 · whitphx/streamlit-webrtc · GitHub which is the same topic. Thanks

whitphx · August 13, 2023, 2:11pm

@AfroLogicInsect Looks like the error message told the exact reason…? I’m not familiar with the ultralytics package. You should check the type of result.

whitphx · August 13, 2023, 2:14pm

@stanny370599 Hi, there is not frame counter. You should implement it by yourself by incrementing a counter variable in a callback.

whitphx · August 13, 2023, 2:17pm

@Esau_Hutcherson I can’t find what’s wrong. What’s the code consuming st.session_state["user_emotion"]?

leggion · September 20, 2023, 1:53pm

hi, thanks for a good framework. I’m just getting started. Please need support, how to launch the application without pressing the “Start” button, how to implement

whitphx · September 24, 2023, 3:54pm

@leggion Hi,
setting the desired_playing_state argument as True can do it.

↓This sample helps.

github.com

whitphx/streamlit-webrtc/blob/main/pages/11_programatic_control_playing.py

"""A sample of controlling the playing state from Python."""

import streamlit as st
from streamlit_webrtc import WebRtcMode, webrtc_streamer

from sample_utils.turn import get_ice_servers

playing = st.checkbox("Playing", value=True)

webrtc_streamer(
    key="programatic_control",
    desired_playing_state=playing,
    mode=WebRtcMode.SENDRECV,
    rtc_configuration={"iceServers": get_ice_servers()},
)

robb · November 18, 2023, 11:10pm

Is it possible to somehow set resolution of the frame inside recv(frame)?
Maybe there is a way to enableCpuOveruseDetection = false

I am trying to take a snapshot of the frame and the resolution is low quality in mobile.

Thanks

Topic		Replies	Views
Developing a streamlit-webrtc component for real-time video processing 💬 Show the Community!	5	2742	February 10, 2023
New Component: Streamlit camera live input 🧩 Custom Components	12	4955	December 20, 2022
Using Streamlit-webrtc with custom video source streamlit-webrtc	1	580	February 1, 2023
Using gstreamer webrtc in streamlit to stream the gstreamer buffers 🎈 Using Streamlit real-time , webrtc	2	833	December 22, 2023
Webrtc component examples are not working in streamlit anymore? 🧩 Custom Components webrtc	6	399	September 11, 2023

New Component: streamlit-webrtc, a new way to deal with real-time media streams

Related Topics

Hello there 👋🏻

Cookie settings

Strictly necessary cookies

Performance cookies

Functional cookies

Targeting cookies