After a video conference is established, a video conferencing system receives a video stream from the participants of the video conference. One of the received video streams is transmitted to the conference participants (e.g. the video stream of the person currently speaking). The video conferencing system monitors a second one (or typically all) of the received video streams to determine if a designated video event (e.g. someone raising his hand) has occurred. If the designated video event has occurred, the video conferencing system switches or supplements the currently transmitted video stream to or with the second one of the received video streams that contains the designated video event. The second video stream is then transmitted to participants in the video conference. The switching of the video stream and the switching of the audio stream may be independent.
Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.
1. A system for changing transmission of a video stream comprising: a stream monitor configured to receive a plurality of video streams that comprise a video conference including a first and a second video stream, and monitor the second video stream to determine whether a designated video event has occurred in the second video stream; and a stream transmitter configured to transmit the first video stream, and to transmit the second video stream in response to a determination that the designated video event has occurred in the second video stream, wherein the second video stream is transmitted in place of or simultaneously with the first video stream; and wherein the video stream is dynamically selected for transmission, the plurality of video streams each further comprises an audio stream, and wherein the stream monitor is further configured to monitor each audio stream of the plurality of video streams for a designated audio event, and dynamically select the first video stream for transmission based on detection of the designated audio event.
A video conferencing system automatically switches the displayed video stream based on audio and video events. It receives multiple video streams (including audio) from participants. The system transmits one video stream to all participants, initially determined by audio: it monitors each audio stream for a designated sound event (e.g., someone speaking) and selects that person's video stream for broadcast. Simultaneously, the system monitors all video streams for designated video events (e.g., raising a hand). If a video event occurs in a different video stream, the system switches the broadcast to that video stream, either replacing or displaying it alongside the original video stream.
2. The system of claim 1 , wherein the stream monitor is further configured to determine if a dynamically selected video stream is the second video stream, monitor the second video stream to determine if the second video stream's designated video event is no longer important, and responsive to a determination that the dynamically selected video stream is not the second video stream and that the second video stream's designated video event is no longer important, select the first video stream for transmission.
Building upon the video conferencing system from the previous description: the system determines if the currently displayed video stream is showing someone who triggered an event. It monitors the video stream of that person. If the trigger is no longer important and the system is not currently displaying that person's video, the system switches back to the original video stream selected based on audio, dynamically selecting the first video stream for transmission based on detection of the designated audio event.
3. The system of claim 1 , wherein the stream monitor is further configured to monitor the second video stream to determine if the second video stream's designated video event is no longer important, and the stream transmitter transmits the first video stream in response to a determination that the second video stream's designated video event is no longer important.
In the video conferencing system described earlier: the system constantly monitors the video stream of the currently displayed person, watching for when their triggering action is no longer important (e.g., hand lowered). When that event is no longer detected in their video stream, the system switches the broadcast back to the original video stream, dynamically selected for transmission based on detection of the designated audio event.
4. The system of claim 3 , wherein the stream monitor is further configured to use hysteresis to determine if the second video stream's designated event is no longer important.
Using the switching system from the previous description: the system employs hysteresis when determining if the video event is no longer important. This means it requires a stronger or more sustained absence of the triggering event before switching back to the primary video stream. For example, instead of immediately switching back when a hand is lowered, it waits for a slightly longer period or requires the hand to be lowered further before reverting to the previous speaker's video.
5. The system of claim 1 wherein a communication device is configured to receive the transmission of the first video stream, receive the transmission of the second video stream, display the received first video stream, and display the received second video stream.
The video conferencing system includes a communication device (e.g., a computer, tablet, or phone) that receives and displays the selected video streams. The device receives both the main video stream (e.g., the current speaker) and the stream triggered by a video event (e.g., hand raise). The device then displays both of these video streams, allowing users to see both the primary speaker and the person who triggered the event.
6. The system of claim 1 , wherein at least one of the display of the first video stream and the display of the second video stream is a window within a screen displaying the video conference.
The video conferencing system described previously displays video streams in windows on a screen. The main video stream and the video stream triggered by a video event each occupy a window. These windows are displayed within the larger screen that shows the video conference, allowing multiple participants to view multiple active screens.
7. The system of claim 1 , wherein the video event is based on at least one gesture selected from the group comprising: a raising of a hand, a shaking of a head, a shaking motion, a smile, a frown, a sign in a sign language, and a nodding motion.
In the video conferencing system, the trigger for switching video streams is based on detecting specific gestures. These gestures include raising a hand, shaking a head, a general shaking motion, smiling, frowning, making a sign in sign language, and nodding. The system uses video processing to recognize these gestures in the incoming video streams and trigger the switch accordingly.
8. The system of claim 1 , wherein at least one of the plurality of video streams is an item selected from the group comprising: an analog video signal, a digital video signal, a packet stream, and a non-packet stream.
The video streams used in the video conferencing system can be of various types. These include analog video signals, digital video signals, packet streams (like RTP), and non-packet streams. The system is designed to handle and process these different types of video streams to monitor for the trigger events and switch between them.
9. The system of claim 1 , wherein the stream monitor is further configured to buffer the second video stream to capture the video event.
The video conferencing system includes a buffering mechanism for the video stream containing the designated video event. When the video event is detected (e.g., a hand raise), the system buffers a short segment of the video stream around the event, capturing the action and ensuring that it is displayed to the conference participants even if the event is brief.
10. The system of claim 1 , wherein a communication device is further configured to determine if the video event has occurred, send the video event in one of the plurality of video streams to the stream monitor in response to determining that the video event has occurred, and not send the one of the plurality of video streams in response to determining that the video event has not occurred.
In the video conferencing system, the remote devices determine if a video event occurred. If the device detects the video event, it sends a video stream to the video conferencing system. If the device doesn't detect the video event, then it does not send the video stream.
11. The system of claim 1 , wherein the stream monitor is further configured to switch the transmission of the first video stream to the transmission of the second stream in response to the determination that the dynamically selected video stream is not the second video stream.
The video conferencing system dynamically selects which video stream is being transmitted. If the system is not transmitting the video stream of the person who caused an event, the system immediately begins transmitting the video stream of the person who caused the event.
12. A method for changing transmission of a video stream comprising: receiving, at a stream monitor, a plurality of video streams that comprise a video conference; transmitting, from a stream transmitter, a first video stream of the plurality of video streams; monitoring a second video stream of the plurality of video streams to determine if a designated video event has occurred in the second video stream; and responsive to determining that the designated video event has occurred in the second video stream, transmitting from the stream transmitter, the second video stream, wherein the second video stream is transmitted in place of or simultaneously with the first video stream; and wherein the first video stream is dynamically selected, and wherein the plurality of video streams each further comprise an audio stream, and further comprising the steps of: monitoring each audio stream of the plurality of video streams for a designated audio event; and dynamically selecting the first video stream based on an occurrence of the designated audio event.
A method for dynamically switching the video stream during a video conference involves receiving multiple video streams (with audio) from participants at a monitor. The system initially transmits a first video stream, determined by audio: the system monitors each audio stream for a designated sound event (e.g., someone speaking) and selects that person's video stream for broadcast. Simultaneously, it monitors other video streams for a designated video event (e.g., raising a hand). If a video event is detected in another stream, it switches the transmitted stream to the stream with the video event, replacing or displaying it alongside the original.
13. The method of claim 12 , further comprising the steps of: monitoring the second video stream to determine if the second video stream's designated video event is no longer important; and responsive to the second video stream's designated video event no longer being important, selecting the first video stream for transmission.
The video stream switching method from the previous description also includes monitoring the video stream of the currently displayed person. It watches for when their triggering action is no longer important (e.g., hand lowered). When that event is no longer detected in their video stream, the system switches back to the original video stream, dynamically selecting the first video stream for transmission based on detection of the designated audio event.
14. The method of claim 13 , wherein determining if the second video stream's designated event is no longer important is based on hysteresis.
In the video stream switching method, determining when a video event is no longer important uses hysteresis. This means requiring a stronger or more sustained absence of the triggering event before switching back to the primary video stream, dynamically selecting the first video stream for transmission based on detection of the designated audio event.
15. The method of claim 12 , further comprising the steps of: receiving the transmission of the first video stream and the transmission of the second video stream; and displaying the received first video stream and the received second video stream.
The video stream switching method includes receiving both the main video stream (e.g., the current speaker) and the stream triggered by a video event (e.g., hand raise). The method displays both of these video streams, allowing users to see both the primary speaker and the person who triggered the event.
16. The method of claim 12 , wherein the display of the first video stream or the display of the second video stream is a window within a screen displaying the video conference.
In the video stream switching method, the display of the first video stream or the second video stream takes place within a window within the main conference screen.
17. The method of claim 12 , wherein the video event is based on at least one gesture selected from the group comprising: a raising of a hand, a shaking of a head, a shaking motion, a smile, a frown, a sign in a sign language, and a nodding motion.
In the video stream switching method, the video event that triggers switching the displayed video stream can be based on gestures. These gestures include raising a hand, shaking a head, a general shaking motion, smiling, frowning, making a sign in sign language, and nodding.
18. The method of claim 12 , wherein at least one of the plurality of video streams is an item selected from the group comprising: an analog video signal, a digital video signal, a packet stream, and a non-packet stream.
In the video stream switching method, the video streams can be one of a number of different items. These items include an analog video signal, a digital video signal, a packet stream, and a non-packet stream.
19. The method of claim 12 , further comprising the step of: buffering the second video stream to capture the video event.
The video stream switching method also includes buffering a section of video containing the event. This ensures the transmission includes the event.
20. The method of claim 12 , further comprising the steps of: determining in a communication device if the video event has occurred; responsive to determining that the video event has occurred, sending the video event in a video stream; and responsive to determining that the video event has not occurred, not sending the video stream.
In the video stream switching method, the local devices determine if a video event has occurred. If the device detects the video event, it sends a video stream to the video conferencing system. If the device doesn't detect the video event, then it does not send the video stream.
21. The method of claim 12 , further comprising the step of: switching the transmission of the first video stream to the transmission of the second stream in response to the determination that the dynamically selected video stream is not the second video stream.
The video stream switching method dynamically selects which video stream is being transmitted. If the system is not transmitting the video stream of the person who caused an event, the system immediately begins transmitting the video stream of the person who caused the event.
22. A means for changing transmission of a video stream comprising: means for receiving a plurality of video streams that comprise a video conference; means for transmitting a first video stream of the plurality of video streams; means for monitoring a second video stream of the plurality of video streams to determine if a designated video event has occurred in the second video stream; and means responsive to determining that the designated video event has occurred in the second video stream, for transmitting the second video stream, wherein the second video stream is transmitted in place of or simultaneously with the first video stream; and wherein the first video stream is dynamically selected, and wherein the plurality of video streams each further comprise an audio stream, and further comprising the steps of: monitoring each audio stream of the plurality of video streams for a designated audio event; and dynamically selecting the first video stream based on an occurrence of the designated audio event.
This describes a means for achieving the video stream switching method. It comprises means for receiving multiple video streams (with audio) from participants, means for initially transmitting a first video stream (determined by audio), means for monitoring for a designated video event in the incoming video streams, and means for switching the transmitted stream to the stream with the video event when triggered, replacing or displaying it alongside the original. The first video stream is dynamically selected and audio is used to determine which video stream should be the first to be displayed.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
February 27, 2009
July 2, 2013
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.