Patentable/Patents/US-11968410
US-11968410

Systems and methods to insert supplemental content into presentations of two-dimensional video content based on intrinsic and extrinsic parameters of a camera

PublishedApril 23, 2024
Assigneenot available in USPTO data we have
Inventorsnot available in USPTO data we have
Technical Abstract

Systems and methods for inserting supplemental content into presentations of two-dimensional video content based on the intrinsic and extrinsic parameters of a camera are disclosed. Exemplary implementations may: obtain two-dimensional video content depicting a three-dimensional space; obtain supplemental content; obtain values of intrinsic parameters for the camera; obtain values of extrinsic parameters for the camera; identify presentation locations in the three-dimensional space based on the two-dimensional video content; determine integration information for the supplemental content based on the values of intrinsic and extrinsic parameters; modify the two-dimensional video content to include the supplemental content at the identified presentation location in accordance with the integration information; and/or perform other operations.

Patent Claims
16 claims

Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.

Claim 2

Original Legal Text

2. The system of claim 1, wherein the three-dimensional space includes one or more visible physical features, and wherein the presentation location is a surface on a visible physical feature within the three-dimensional space or is a portion of airspace within the three-dimensional space.

Plain English Translation

This invention relates to a system for presenting digital content within a three-dimensional space, addressing the challenge of integrating virtual elements with real-world environments in a seamless and contextually relevant manner. The system enables the display of digital content at specific locations within a three-dimensional space, which may include visible physical features such as walls, objects, or other surfaces. Alternatively, the presentation location can be a portion of airspace within the space, allowing for mid-air projections or augmented reality overlays. The system dynamically determines the optimal placement of digital content based on the spatial characteristics of the environment, ensuring that the content is visually accessible and appropriately positioned relative to physical features. This approach enhances user interaction by anchoring digital content to real-world references, improving spatial awareness and immersion. The system may also adapt to changes in the environment, such as moving objects or shifting lighting conditions, to maintain the integrity of the presentation. By leveraging visible physical features or designated airspace, the system provides a flexible framework for integrating digital content into physical spaces, supporting applications in augmented reality, virtual reality, and interactive displays.

Claim 4

Original Legal Text

4. The system of claim 1, wherein the integration information includes the presence of one or more occluding physical features within the three-dimensional space that occludes or partially occludes the identified presentation location from the field of view of the camera.

Plain English Translation

This invention relates to a system for integrating virtual content into a real-world environment using a camera. The system addresses the challenge of accurately placing and rendering virtual objects in a physical space while accounting for real-world obstructions that may block the view of the virtual content from the camera's perspective. The system includes a camera that captures images of a three-dimensional space and identifies a presentation location for displaying virtual content. The system then determines integration information, which includes detecting the presence of physical features within the space that may occlude or partially occlude the identified presentation location from the camera's field of view. These occluding features can be objects, surfaces, or other elements that interfere with the visibility of the virtual content when viewed through the camera. By analyzing the captured images, the system identifies these occlusions and adjusts the rendering of the virtual content accordingly. This ensures that the virtual content appears correctly positioned in the real-world environment, even when partially or fully hidden by physical objects. The system may modify the visibility, transparency, or other visual properties of the virtual content to account for the occlusions, providing a more realistic and seamless integration of virtual and real elements. This improves the user experience in augmented reality applications by maintaining visual coherence and accuracy in mixed reality environments.

Claim 5

Original Legal Text

5. The system of claim 4, wherein, responsive to the identified presentation location being occluded or partially occluded, the modified two-dimensional video content includes occluding or partially occluding the supplemental content at the identified presentation locations from the field of view of the camera, such that the supplemental content appears to be occluded or partially occluded by the one or more occluding physical features defined by the integration information.

Plain English Translation

This invention relates to augmented reality (AR) systems that overlay supplemental digital content onto a real-world environment captured by a camera. The problem addressed is ensuring that the supplemental content appears naturally integrated with the physical environment, particularly when real-world objects occlude or partially occlude the intended presentation locations of the digital content. The system dynamically modifies the two-dimensional video content to match the occlusion state of the physical environment. When a presentation location for the supplemental content is determined to be occluded or partially occluded by physical features (e.g., objects, obstacles, or environmental elements), the system adjusts the digital content to appear similarly occluded or partially occluded in the video output. This adjustment is based on integration information that defines the physical features causing the occlusion. The result is a seamless AR experience where the digital content visually aligns with the real-world obstructions, enhancing realism and user immersion. The system may also track the camera's field of view and dynamically update the occlusion adjustments as the camera moves or the environment changes. This approach improves the coherence of AR displays by ensuring digital elements interact realistically with the physical world.

Claim 6

Original Legal Text

6. The system of claim 1, wherein the values of the intrinsic and extrinsic parameters of the camera change based on the progression of the two-dimensional video content.

Plain English Translation

This invention relates to a camera system with adaptive intrinsic and extrinsic parameter adjustments for two-dimensional video content. The system dynamically modifies camera parameters in response to the progression of the video, ensuring optimal capture conditions. Intrinsic parameters, such as focal length, aperture, and sensor sensitivity, are adjusted to maintain image quality as lighting or scene conditions change. Extrinsic parameters, including camera position, orientation, and movement, are altered to follow or frame subjects accurately. The system analyzes the video content in real-time to determine when parameter adjustments are necessary, ensuring seamless transitions without disrupting the viewing experience. This adaptive approach enhances visual consistency and reduces the need for manual adjustments, making it suitable for applications like surveillance, film production, and augmented reality. The invention improves upon static camera setups by continuously optimizing parameters to match evolving content, addressing challenges in maintaining high-quality video capture under varying conditions.

Claim 7

Original Legal Text

7. The system of claim 1, wherein the supplemental content includes image content, video content, and text content.

Plain English Translation

This invention relates to a system for delivering supplemental content to users, addressing the challenge of providing diverse and engaging additional material alongside primary content. The system dynamically generates and presents supplemental content in multiple formats, including images, videos, and text, to enhance user experience and engagement. The supplemental content is tailored to the user's context, preferences, or interactions, ensuring relevance and personalization. The system may analyze user behavior, content consumption patterns, or external data sources to determine the most appropriate supplemental content to display. By integrating various media types, the system ensures that users receive a rich and varied experience, whether they prefer visual, auditory, or textual information. The supplemental content can be displayed alongside or embedded within primary content, such as articles, videos, or applications, to provide additional context, entertainment, or utility. The system may also adapt the format or delivery of supplemental content based on user feedback or real-time interactions, optimizing engagement and satisfaction. This approach improves user retention and interaction by offering a more immersive and personalized content experience.

Claim 8

Original Legal Text

8. The system of claim 1, wherein the intrinsic parameters include at least one of focal length, focal point, aperture, and aspect ratio of the field of view.

Plain English Translation

The invention relates to a system for capturing and processing images, particularly focusing on the calibration and optimization of camera parameters. The system addresses the challenge of accurately determining and adjusting intrinsic camera parameters, which are essential for high-quality image capture and subsequent processing tasks such as object detection, tracking, and 3D reconstruction. Intrinsic parameters, including focal length, focal point, aperture, and aspect ratio of the field of view, define the geometric and optical properties of the camera. These parameters influence how light is projected onto the image sensor and affect the accuracy of downstream applications. The system includes a calibration module that measures and adjusts these intrinsic parameters to ensure optimal performance. By dynamically adjusting focal length, aperture, and aspect ratio, the system can adapt to varying environmental conditions and imaging requirements. The focal point is also calibrated to ensure sharpness and clarity in the captured images. The system may further include an image processing module that uses these calibrated parameters to enhance image quality, correct distortions, and improve the accuracy of computer vision algorithms. This approach ensures that the camera operates at peak efficiency, reducing errors in tasks such as object recognition and spatial mapping. The system is particularly useful in applications requiring precise imaging, such as autonomous vehicles, robotics, and augmented reality.

Claim 9

Original Legal Text

9. The system of claim 1, wherein the extrinsic parameters include at least of camera location within the three-dimensional space and camera orientation, wherein camera orientation includes the positional angle of the camera.

Plain English Translation

This invention relates to a system for determining extrinsic parameters of a camera within a three-dimensional space. The system addresses the challenge of accurately capturing and processing camera position and orientation data to enable precise spatial mapping and navigation. The extrinsic parameters include the camera's location within the three-dimensional space and its orientation, which encompasses the positional angle of the camera. The system likely integrates multiple sensors or computational techniques to derive these parameters, ensuring accurate alignment between the camera's perspective and the surrounding environment. By determining the camera's location and orientation, the system enables applications such as augmented reality, robotics, autonomous navigation, and 3D reconstruction, where precise spatial awareness is critical. The positional angle of the camera is a key component of the orientation data, allowing the system to account for the camera's tilt, pan, and roll, which are essential for accurate spatial mapping and interaction with the environment. The system may also incorporate calibration methods to refine the extrinsic parameters, ensuring long-term reliability and accuracy in dynamic environments.

Claim 10

Original Legal Text

10. The system of claim 1, wherein the integration information includes scaling information and three-dimensional rotational position for the supplemental content.

Plain English Translation

This invention relates to systems for integrating supplemental content into a primary content stream, particularly in augmented reality (AR) or virtual reality (VR) environments. The problem addressed is the accurate alignment and positioning of supplemental content, such as digital overlays, within a three-dimensional space to ensure proper visualization and interaction. The system includes a primary content stream, such as a live video feed or a virtual environment, and supplemental content that is overlaid onto the primary content. The integration information, which is used to position the supplemental content, includes scaling information to adjust the size of the supplemental content relative to the primary content and three-dimensional rotational position data to orient the supplemental content correctly within the space. This ensures that the supplemental content appears seamlessly integrated with the primary content, maintaining spatial coherence and realism. The system may also include a display device, such as an AR headset or VR display, to present the combined primary and supplemental content to a user. The integration information is dynamically adjusted based on user movement or changes in the primary content to maintain accurate positioning. This allows for real-time adjustments, ensuring the supplemental content remains properly aligned even as the user's perspective or the environment changes. The system may also include sensors or tracking mechanisms to monitor the user's position and orientation, providing input for the integration information.

Claim 12

Original Legal Text

12. The method of claim 11, wherein the three-dimensional space includes one or more visible physical features, and wherein the presentation location is a surface on a visible physical feature within the three-dimensional space or is a portion of airspace within the three-dimensional space.

Plain English Translation

This invention relates to methods for presenting digital content within a three-dimensional space, addressing the challenge of integrating virtual elements with real-world environments in a spatially accurate manner. The method involves determining a presentation location for digital content within a three-dimensional space, where the space may include visible physical features such as objects, surfaces, or structures. The presentation location can be either a surface on one of these physical features or a specific portion of the airspace within the three-dimensional environment. The method further includes generating a representation of the digital content at the determined presentation location, ensuring the content is spatially aligned with the physical features or airspace. This approach enables seamless blending of virtual and real-world elements, enhancing applications in augmented reality, spatial computing, and interactive displays. The method may also involve tracking the physical features or airspace to dynamically adjust the presentation location as the environment changes, ensuring persistent and accurate placement of the digital content. This technique improves user experience by providing contextually relevant and spatially coherent digital overlays in real-world settings.

Claim 14

Original Legal Text

14. The method of claim 11, wherein the integration information includes the presence of one or more occluding physical features within the three-dimensional space that occludes or partially occludes the identified presentation location from the field of view of the camera.

Plain English Translation

This invention relates to systems for integrating virtual content into a three-dimensional space captured by a camera, addressing the challenge of accurately placing and rendering virtual objects in real-world environments. The method involves determining a presentation location for virtual content within the three-dimensional space based on camera data and then integrating the virtual content at that location. A key aspect of the invention is the inclusion of integration information that accounts for physical features in the environment that may occlude or partially occlude the presentation location from the camera's field of view. This ensures that the virtual content is rendered realistically, taking into account real-world obstructions that could otherwise disrupt the seamless blending of virtual and physical elements. The system may use depth sensing, object detection, or other techniques to identify these occluding features and adjust the rendering of the virtual content accordingly. This approach improves the accuracy and visual coherence of augmented reality applications by dynamically adapting to the physical environment.

Claim 15

Original Legal Text

15. The method of claim 14, wherein, responsive to the identified presentation location being occluded or partially occluded, the modified two-dimensional video content includes occluding or partially occluding the supplemental content at the identified presentation locations from the field of view of the camera, such that the supplemental content appears to be occluded or partially occluded by the one or more occluding physical features defined by the integration information.

Plain English Translation

This invention relates to augmented reality (AR) systems that overlay supplemental digital content onto a real-world environment captured by a camera. The problem addressed is ensuring that the supplemental content appears naturally integrated with the physical environment, particularly when real-world objects occlude or partially occlude the intended presentation locations of the digital content. The solution involves dynamically modifying the two-dimensional video content to simulate occlusion effects. When the system detects that a presentation location for supplemental content is occluded or partially occluded by physical features (e.g., objects, walls, or other obstacles), the supplemental content is adjusted to appear similarly occluded or partially occluded in the video output. This is achieved by analyzing integration information that defines the physical features causing the occlusion and applying corresponding visual modifications to the supplemental content. The result is a more realistic AR experience where digital content interacts with the real world in a physically plausible manner, enhancing immersion and user engagement. The method ensures that the supplemental content does not appear to float or intersect with occluding objects, maintaining visual coherence.

Claim 16

Original Legal Text

16. The method of claim 11, wherein the values of the intrinsic and extrinsic parameters of the camera change based on the progression of the two-dimensional video content.

Plain English Translation

This invention relates to dynamic camera parameter adjustment in two-dimensional video content. The problem addressed is the static nature of camera settings in traditional video production, which limits visual engagement and adaptability to changing scenes. The solution involves a method where intrinsic and extrinsic camera parameters are dynamically adjusted based on the progression of the video content. Intrinsic parameters, such as focal length and aperture, and extrinsic parameters, such as position and orientation, are modified in real-time to enhance visual storytelling. The adjustments are synchronized with scene changes, narrative pacing, or other content progression cues. For example, the camera's field of view may widen during action sequences or narrow during intimate dialogue scenes. The extrinsic parameters may shift the camera's perspective to emphasize different elements as the story unfolds. The method ensures seamless transitions between parameter changes to maintain visual coherence. This dynamic adjustment allows for more immersive and adaptable video content without requiring manual intervention for each parameter change. The system may use pre-defined rules, machine learning models, or real-time analysis of the video content to determine the appropriate parameter adjustments. The result is a more engaging viewing experience that responds to the evolving narrative and visual demands of the content.

Claim 17

Original Legal Text

17. The method of claim 11, wherein the supplemental content includes image content, video content, and text content.

Plain English Translation

This invention relates to a method for delivering supplemental content to a user during a media playback session. The method addresses the problem of providing relevant and engaging additional content to users while they consume media, such as audio or video files, to enhance their experience without disrupting the primary content. The method involves analyzing the media being played to identify key features or metadata, such as genre, artist, or themes. Based on this analysis, the system retrieves supplemental content that is contextually relevant to the media. The supplemental content may include image content, such as album art or related visuals, video content, such as music videos or interviews, and text content, such as lyrics, artist biographies, or related articles. The supplemental content is then dynamically presented to the user in a non-intrusive manner, ensuring it does not interfere with the primary media playback. The system may also track user interactions with the supplemental content to refine future recommendations, improving personalization over time. Additionally, the method may support user customization, allowing users to select or deselect specific types of supplemental content they wish to receive. This ensures a tailored experience that aligns with individual preferences. The goal is to provide an enriched media consumption experience by seamlessly integrating relevant supplemental content.

Claim 18

Original Legal Text

18. The method of claim 1, wherein the intrinsic parameters include at least one of focal length, focal point, aperture, and aspect ratio of the field of view.

Plain English Translation

This invention relates to image processing systems that analyze intrinsic camera parameters to enhance image capture or analysis. The problem addressed is the need for accurate and efficient determination of key camera characteristics to improve image quality, calibration, or computational tasks like object detection or scene reconstruction. The method involves extracting and utilizing intrinsic camera parameters, which include focal length, focal point, aperture, and aspect ratio of the field of view. These parameters define the camera's optical properties and geometric constraints, which are critical for tasks such as image rectification, 3D reconstruction, or automated focus adjustment. By incorporating these parameters, the system can optimize image processing algorithms, reduce distortion, or improve the accuracy of computer vision applications. The method may also involve capturing images with a camera, processing the images to extract or refine the intrinsic parameters, and applying these parameters to subsequent image analysis or correction steps. This ensures that the camera's optical behavior is accurately modeled, leading to more reliable results in applications such as augmented reality, robotics, or medical imaging. The approach may further include dynamic adjustments based on real-time parameter changes, such as when the camera's zoom or focus settings are modified.

Claim 19

Original Legal Text

19. The method of claim 11, wherein the extrinsic parameters include at least of camera location within the three-dimensional space and camera orientation, wherein camera orientation includes the positional angle of the camera.

Plain English Translation

This invention relates to a method for determining extrinsic parameters of a camera within a three-dimensional space, particularly focusing on camera location and orientation. The method addresses the challenge of accurately capturing and processing spatial data by incorporating positional and angular information to enhance the precision of camera positioning in three-dimensional environments. The extrinsic parameters, which include the camera's location and orientation, are critical for applications such as augmented reality, robotics, and computer vision, where accurate spatial mapping is essential. The camera orientation is defined by the positional angle, which specifies the camera's direction relative to a reference frame. By integrating these parameters, the method ensures that the camera's position and angle are precisely determined, enabling more accurate reconstruction of the three-dimensional space. This approach improves the reliability of spatial data acquisition and processing, making it suitable for dynamic environments where real-time adjustments are necessary. The method may also involve additional steps, such as capturing images or sensor data, to refine the extrinsic parameters further. The overall system enhances the accuracy and efficiency of camera-based spatial mapping, supporting applications requiring precise three-dimensional modeling and interaction.

Claim 20

Original Legal Text

20. The method of claim 11, wherein the integration information includes scaling information and three-dimensional rotational position for the supplemental content.

Plain English Translation

This invention relates to integrating supplemental content, such as digital overlays, into a three-dimensional environment, addressing the challenge of accurately aligning and scaling additional visual elements with real-world or virtual scenes. The method involves determining integration information for the supplemental content, which includes scaling data to ensure proper size and proportion relative to the environment, as well as three-dimensional rotational positioning to orient the content correctly within the space. This allows the supplemental content to be seamlessly blended with the existing environment, enhancing applications in augmented reality, virtual reality, or other immersive technologies. The integration process may involve analyzing the environment to determine optimal placement and orientation, ensuring the supplemental content appears natural and contextually relevant. The method may also include adjusting the content dynamically based on changes in the environment or user perspective, maintaining alignment and coherence. This approach improves the realism and usability of augmented or virtual environments by providing precise control over the placement and appearance of additional digital elements.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

Patent Metadata

Filing Date

February 2, 2023

Publication Date

April 23, 2024

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Citation & reuse

Analysis on this page is generated by Patentable — an AI-powered patent intelligence platform. AI-generated summaries, explanations, FAQs, and analysis may be reused with attribution and a visible link back to the canonical URL below. Patent abstracts and claims are USPTO public domain.

Cite as: Patentable. “Systems and methods to insert supplemental content into presentations of two-dimensional video content based on intrinsic and extrinsic parameters of a camera” (US-11968410). https://patentable.app/patents/US-11968410

© 2026 Nomic Interactive Technology LLC. Machine-readable context available at /api/llm-context/US-11968410. See llms.txt for full attribution policy.