A multimedia processing resource (MPR) includes interactive voice response capabilities. A user may control MPR commands and functions over a telephone interface. For example, through a telephone connection a user may receive a listing of electronic programming guide data, may schedule a multimedia program for recording, or may receive a report related to a multimedia program that is viewed at the present time.
Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.
1. A set top box, comprising: a processor; a network interface configured to receive, via a residential gateway, multimedia content from a multimedia service provider; a video decoder configured to decode the multimedia content to generate a video stream suitable for displaying on a display device; a telephone interface comprising: a wireless telephone interface configured to receive calls to a dedicated phone number associated with the set top box; and a second telephone interface configured to receive calls routed from the residential gateway based on a telephone number extension associated with the set top box; wherein the telephone interface is configured to obtain caller identification information indicative of a telephone number associated with a caller of a particular call, and voice input provided during the particular call; and a computer readable medium including program instructions, executable by the processor, wherein the program instructions, when executed by the processor, cause the processor to perform operations comprising: retrieving voice recognition data specific to a caller setting associated with the caller identification information; processing the voice input based, in part, on the voice recognition data to recognize voice commands from the caller; recognizing, in the voice input, a presentation setting; sending the presentation setting to a secondary video decoder of a secondary set top box; recognizing content criteria indicated by the voice input; accessing, within the set top box, electronic programming guide data listing multimedia programs satisfying the content criteria; generating speech output indicative of the multimedia programs satisfying the content criteria; and receiving at least one of the multimedia programs satisfying the content criteria.
A set-top box allows users to control it using voice commands via a telephone. It has a processor, a network interface to receive multimedia content from a provider, and a video decoder to display the content. It includes a wireless telephone interface that receives calls to a dedicated number and a second telephone interface that receives calls routed from a residential gateway based on an extension. The set-top box identifies the caller's phone number and captures their voice input. It retrieves voice recognition data specific to the caller, and processes the voice input to recognize voice commands. The voice commands include presentation settings for a secondary set-top box. The user can also specify content criteria to filter the electronic programming guide, which is then read back to the user. Finally, the set-top box receives the selected multimedia content.
2. The set top box of claim 1 , wherein the telephone interface includes a microphone and is configured to receive local voice input from a user.
The set-top box described previously, which responds to voice commands received over a phone line, also includes a microphone. This microphone allows the set-top box to receive local voice input directly from a user in the same room, in addition to voice commands coming in via telephone.
3. The set top box of claim 1 , wherein the telephone interface is configured to recognize dual-tone multi-frequency commands.
The set-top box described previously, which responds to voice commands received over a phone line, is also able to recognize dual-tone multi-frequency (DTMF) commands, such as those generated by pressing the buttons on a telephone keypad. This allows users to interact with the set-top box using both voice and traditional touch-tone input.
4. The set top box of claim 1 , wherein the content criteria indicate a genre.
The set-top box described previously, which responds to voice commands and filters electronic programming guide (EPG) data based on content criteria specified by the user, allows the user to specify a genre as the content criteria. For example, the user could say "Find me comedies" and the set-top box would list comedies in the EPG.
5. The set top box of claim 1 , wherein the operations include: determining whether a received voice print matches a stored voice print.
The set-top box described previously, which responds to voice commands received over a phone line, can determine if a received voice print (a unique identifier of a voice) matches a stored voice print. This allows the set-top box to authenticate users based on their voice, providing personalized experiences or restricting access to certain features.
6. The set top box of claim 1 , wherein the network interface is an IP network interface, the multimedia content comprises a set of IP packets, and the particular call comprises a set of IP packets.
The set-top box described previously, which responds to voice commands received over a phone line, uses an IP network interface to receive multimedia content. The multimedia content is transmitted as IP packets. The phone call itself also uses IP packets, meaning it is a Voice over IP (VoIP) call.
7. The set top box of claim 1 , wherein the telephone interface is configured to support an IP-based communication session.
The set-top box described previously, which responds to voice commands received over a phone line, is configured to support an IP-based communication session for handling telephone calls.
8. The set top box of claim 7 , wherein the IP-based communication session includes a session initiation protocol session.
The IP-based communication session used by the set-top box described previously to receive voice commands is a Session Initiation Protocol (SIP) session. SIP is a standard protocol for initiating, maintaining, and terminating real-time sessions that include voice, video and messaging applications.
9. The set top box of claim 1 , wherein the operations include: scheduling a recording of a portion of the multimedia content; reviewing a scheduled recording; identifying a currently viewed multimedia program; changing a lock status of at least one of: the set top box and the secondary set top box; changing a channel; adjusting a video quality of a multimedia program presentation; and adjusting an audio volume of a multimedia program presentation.
The set-top box described previously, which responds to voice commands received over a phone line, can also perform the following operations via voice command: schedule a recording of multimedia content; review scheduled recordings; identify a currently viewed multimedia program; change a lock status of either the set-top box itself or a secondary set-top box; change the channel; adjust the video quality of a multimedia program; and adjust the audio volume of a multimedia program.
10. A method of providing multimedia content, the method comprising: responsive to receiving, by a wireless telephone interface of a set top box, a wireless telephone call to a dedicated telephone number associated with the set top box, performing particular operations; and responsive to receiving, by a second telephone interface of the set top box, a routed telephone call routed from a residential gateway based on a telephone number extension associated with the set top box, performing the particular operations; wherein the particular operations include: accessing caller identification information associated with a particular call; retrieving voice recognition data specific to a caller setting associated with the caller identification information; processing voice input, provided by a caller during the particular call, based, in part, on the voice recognition data to recognize voice commands from the caller; recognizing, in the voice input, a presentation setting; and sending the presentation setting to a video decoder of a secondary set top box; recognizing programming criteria indicated by the voice input; accessing, within the set top box, electronic programming guide data listing multimedia programs containing content satisfying the programming criteria; generating speech output indicative of the multimedia programs satisfying the programming criteria; and receiving at least one of the multimedia programs satisfying the programming criteria.
A method for providing multimedia content involves a set-top box with a wireless telephone interface that receives calls to a dedicated number and a second telephone interface that receives calls routed from a residential gateway based on an extension. Upon receiving a call on either interface, the method performs the following operations: accessing caller identification information, retrieving voice recognition data specific to the caller, processing voice input to recognize voice commands, recognizing a presentation setting, and sending the presentation setting to a secondary set-top box. It also recognizes programming criteria, accesses electronic programming guide data to list matching programs, generates speech output of these programs, and receives a selected program.
11. The method of claim 10 , wherein the programming criteria are indicated in the voice input.
In the method described previously for providing multimedia content via voice commands, the programming criteria (e.g., genre, actor) used to filter the electronic programming guide is specified directly in the user's voice input during the phone call. For example, the user might say "Find me action movies".
12. The method of claim 10 , wherein the programming criteria are indicated, at least in part, by dual tone multi-frequency signals.
In the method described previously for providing multimedia content via voice commands, the programming criteria (e.g., channel number) used to filter the electronic programming guide is indicated, at least in part, by dual-tone multi-frequency (DTMF) signals. This means the user can press buttons on the phone keypad to enter the criteria, in addition to or instead of speaking.
13. The method of claim 10 , wherein the programming criteria includes broadcast time criteria.
In the method described previously for providing multimedia content via voice commands, the programming criteria used to filter the electronic programming guide includes broadcast time criteria. This allows users to specify when they want to watch a program, such as "Find programs airing tonight."
14. The method of claim 10 , further comprising changing a lock status of at least one of the set top box and the secondary set top box in response to recognizing a lock status request.
The method described previously for providing multimedia content via voice commands includes the step of changing a lock status of either the set-top box itself or a secondary set-top box in response to recognizing a lock status request in the voice input. This allows parents to remotely lock or unlock channels.
15. The method of claim 10 , further comprising, responsive to recognizing a presentation setting request, adjusting a presentation setting selected from an audio volume setting, an image quality setting, and a current channel setting.
The method described previously for providing multimedia content via voice commands includes, in response to recognizing a presentation setting request in the voice input, adjusting a presentation setting. The adjustable settings include audio volume, image quality, and the current channel. This allows users to remotely control these settings using their voice.
16. A non-transitory computer readable medium including program instructions, executable by a processor, wherein the program instructions, when executed by the processor, cause the processor to perform operations comprising: responsive to receiving, by a wireless telephone interface of a set top box, a wireless telephone call to a dedicated telephone number associated with the set top box, performing particular operations; and responsive to receiving, by a second telephone interface of the set top box, a routed telephone call routed from a residential gateway based on a telephone number extension associated with the set top box, performing the particular operations; wherein the particular operations include: accessing caller identification information associated with a particular call; retrieving voice recognition data specific to a caller setting associated with the caller identification information; processing voice input, provided by a caller during the particular call, based, in part, on the voice recognition data to recognize voice commands from the caller; recognizing, in the voice input, a presentation setting; and sending the presentation setting to a video decoder of a secondary set top box; recognizing programming criteria indicated by the voice input; accessing, within the set top box, electronic programming guide data listing multimedia programs containing content satisfying the programming criteria; generating speech output indicative of the multimedia programs satisfying the programming criteria; and receiving at least one of the multimedia programs satisfying the programming criteria.
A computer-readable medium contains instructions that, when executed by a processor, enable a set-top box to respond to voice commands received over a phone. The set-top box has a wireless telephone interface that receives calls to a dedicated number and a second telephone interface that receives calls routed from a residential gateway based on an extension. The instructions perform the following operations: accessing caller identification information, retrieving voice recognition data specific to the caller, processing voice input to recognize voice commands, recognizing a presentation setting, and sending the presentation setting to a secondary set-top box. It also recognizes programming criteria, accesses electronic programming guide data to list matching programs, generates speech output of these programs, and receives a selected program.
17. The computer readable medium of claim 16 , wherein the presentation setting is selected from a settings group of an audio volume setting, an image quality setting, and a current channel setting.
The computer-readable medium described previously, containing instructions for voice-controlled set-top box functionality, allows the presentation setting (which is sent to a secondary set-top box) to be selected from a group of settings. This group includes audio volume, image quality, and the current channel setting.
18. The computer readable medium of claim 16 , wherein the programming criteria are indicated in the voice input.
The computer-readable medium described previously, containing instructions for voice-controlled set-top box functionality, takes the programming criteria used to filter the electronic program guide directly from the user's voice input.
19. The computer readable medium of claim 16 , wherein the programming criteria are indicated, at least in part, by dual tone multi-frequency signals.
The computer-readable medium described previously, containing instructions for voice-controlled set-top box functionality, allows the programming criteria to be indicated, at least in part, by dual-tone multi-frequency (DTMF) signals generated by pressing buttons on a telephone.
20. The computer readable medium of claim 16 , wherein the programming criteria includes broadcast time criteria.
The computer-readable medium described previously, containing instructions for voice-controlled set-top box functionality, uses broadcast time as one of the criteria for filtering the electronic program guide.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
November 9, 2015
April 11, 2017
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.