Systems and methods for linking time-based media and temporal metadata provide single command control during editing of media having associated temporal metadata. A single control enables an editor to switch between monitoring both source media and its corresponding temporal metadata and monitoring both a pre-recorded version of the media and its corresponding temporal metadata. Another single control enables an editor to start and stop the recording of a source media track and its corresponding source temporal metadata. In one application, the editing is performed for media tracks having time-based spatial metadata for playback in immersive environments with the spatial metadata defining an apparent location of sound objects within the immersive environment.
Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.
1. A media editing method comprising: on a first media editing system: storing a track of time-based media; storing time-based metadata that pertains to and is temporally synchronized with the track of time-based media; while playing back the stored time-based media track and the stored time-based metadata on the first media editing system: receiving media and time-based metadata from a second media editing system, wherein the received media and metadata includes at least one of an edited version of the time-based media track and an edited version of the time-based metadata; enabling a user of the first media editing system to: actuate a first single control of the first media editing system to toggle between: (i) monitoring playback of the stored time-based media track and the stored time-based metadata; and (ii) monitoring playback of the media and time-based metadata received from the second media editing system; and actuate a second single control of the first media editing system to start and stop recording the media and time-based metadata received from the second media editing system, wherein recording the media and time-based metadata received from the second media editing system overwrites a temporally corresponding portion of the media track and the time-based metadata stored on the first system.
A media editing system allows synchronized editing of time-based media (like audio or video) and its associated time-based metadata (like spatial positioning or effect settings). The system operates across two editing workstations. The first workstation stores a media track and corresponding metadata. While playing this back, it receives edited versions of the media and/or metadata from a second workstation. A single control on the first workstation lets the user switch between monitoring the original media/metadata and the received, edited media/metadata. Another single control starts/stops recording the received media/metadata, overwriting the corresponding section of the original media and metadata stored on the first workstation. This enables real-time collaborative editing.
2. The method of claim 1 , wherein the time-based media tracks of the first and second media editing systems are audio tracks; and the time-based metadata of the first media editing systems comprise panning automation data that define an apparent location of a source of audio data on the audio track stored on the first media editing system in a space within a venue for playing back the audio track stored on the first media editing system; and the time-based metadata of the second media editing systems comprise panning automation data that define an apparent spatial location of a source of audio data on the audio track received from the second media editing system within a venue for playing back the audio track received from the second media editing system.
The media editing method from the previous description is enhanced for spatial audio editing. The time-based media tracks on both editing systems are audio tracks. The time-based metadata consists of panning automation data. This data defines the apparent spatial location of a sound source within a virtual venue for audio playback. The first and second editing systems can adjust the panning automation data, allowing editors to change the perceived location of sounds within the immersive audio environment.
3. The method of claim 1 , wherein the time-based media tracks are audio tracks and the time-based metadata comprise reverberation settings for audio data on the audio tracks in a venue for playing back the audio tracks.
The media editing method from the first description focuses on audio editing with reverberation effects. The time-based media tracks are audio tracks. The time-based metadata contains reverberation settings for the audio. These settings control how the audio sounds within a simulated venue. This enables coordinated editing of audio tracks and their reverberation characteristics for creating immersive audio experiences.
4. A media editing system comprising: a first digital audio workstation comprising a first automation system and first audio storage, the first digital audio station in data communication with a second digital audio workstation, the second digital audio workstation comprising a second automation system and second audio storage, wherein the first digital audio workstation is configured to: receive from the second digital audio workstation an audio track and time-based metadata pertaining to the audio track; enabling a user of the first digital audio workstation to: actuate a first single control of the first digital audio workstation to select whether the first digital audio workstation (i) monitors playback of the audio track and the time-based metadata received from the second digital audio workstation; or (ii) monitors playback of a corresponding audio track and time-based metadata pertaining to the audio track stored on the first digital audio workstation; and actuate a second single control of the first digital audio workstation to cause the first digital audio workstation to start recording both the audio track and the time-based metadata pertaining to the audio track received from the second digital audio workstation, wherein recording the received audio and time-based metadata overwrites a temporally corresponding portion of the audio track and time-based metadata stored on the first digital audio workstation.
A media editing system uses two digital audio workstations (DAWs) for collaborative editing. The first DAW has its own audio storage and automation system and is connected to a second DAW, which also has audio storage and automation. The first DAW receives audio tracks and associated time-based metadata from the second DAW. A single control on the first DAW allows the user to switch between monitoring the audio/metadata received from the second DAW and monitoring the audio/metadata stored locally on the first DAW. Another single control on the first DAW starts/stops recording the audio/metadata received from the second DAW, overwriting the corresponding portions of the locally stored audio/metadata.
5. The media editing system of claim 4 , wherein the time-based metadata stored on the first digital audio workstation and the time-based metadata received from the second digital audio workstation are panning metadata that define an apparent spatial location of a source of audio data of their corresponding audio tracks in a venue for playing back the audio tracks.
The media editing system described previously uses panning metadata to define the spatial location of audio. The time-based metadata stored on both digital audio workstations (DAWs) consists of panning metadata. This panning metadata determines the apparent spatial location of the audio source within a virtual venue where the audio tracks will be played back. This allows for the coordinated editing and adjustment of spatial audio properties across the two DAWs.
6. The media editing system of claim 4 , wherein the first digital audio workstation outputs audio data and time-based metadata that is being monitored by the first digital audio workstation to a rendering system, wherein the rendering system generates signals for a plurality of speakers to generate audio that appears to originate from a source location determined by the time-based metadata being monitored by the first digital audio workstation.
In the media editing system described two descriptions ago, the first digital audio workstation (DAW) outputs the audio and time-based metadata that the user is currently monitoring to a rendering system. This rendering system then uses the time-based metadata to generate signals for multiple speakers. These signals create the illusion that the audio is originating from a specific location determined by the time-based metadata. This allows the system to create an immersive audio experience based on the edited metadata.
7. A computer program product comprising: a non-transitory computer-readable medium with computer program instructions encoded thereon, wherein the computer program instructions, when processed by a first media editing system instruct the first media editing system to perform a method for media editing comprising: on the first media editing system: storing a track of time-based media; storing time-based metadata that pertains to and is temporally synchronized with the track of time-based media; while playing back the stored time-based media track and the stored time-based metadata on the first media editing system: receiving media and metadata from a second media editing system, wherein the received media and metadata includes at least one of an edited version of the time-based media track and an edited version of the time-based metadata; enabling a user of the first media editing system to: actuate a first single control of the first media editing system to toggle between: (i) monitoring playback of the stored time-based media track and the stored time-based metadata; and (ii) monitoring playback of the media and metadata received from the second media editing system; and actuate a second single control of the first media editing system to start and stop recording the media and metadata received from the second media editing system, wherein recording the media and metadata received from the second media editing system overwrites a temporally corresponding portion of the media track and the time-based metadata stored on the first system.
A software program, stored on a non-transitory computer-readable medium, instructs a media editing system to perform a synchronized media and metadata editing method. The method involves the first editing system storing a time-based media track and associated time-based metadata. While playing this back, the system receives edited versions of the media and/or metadata from a second workstation. A single control lets the user switch between monitoring the original media/metadata and the received, edited media/metadata. Another single control starts/stops recording the received media/metadata, overwriting the corresponding section of the original media and metadata stored on the first workstation.
8. The computer program product of claim 7 , wherein: the first media editing system and the second media editing system are digital audio workstations; the tracks of time-based media are audio tracks; and the time-based metadata of the first and second media editing systems comprise panning automation data that define an apparent spatial location of a source of audio data on their corresponding audio tracks within a venue for playing back the audio tracks.
The software program described previously is designed for digital audio workstations (DAWs) and spatial audio editing. The first and second editing systems are DAWs. The tracks of time-based media are audio tracks. The time-based metadata is panning automation data. This data defines the apparent spatial location of a sound source within a virtual venue for audio playback on each DAW. This allows coordinated spatial audio editing between the two systems.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
April 10, 2017
November 14, 2017
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.