Dolby Laboratories Patent Applications

SYSTEM AND TOOLS FOR ENHANCED 3D AUDIO AUTHORING AND RENDERING

Granted: January 16, 2025
Application Number: 20250024222
Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular…

SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Granted: January 16, 2025
Application Number: 20250024086
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated…

AUDIO ENCODING AND DECODING USING PRESENTATION TRANSFORM PARAMETERS

Granted: January 16, 2025
Application Number: 20250022475
A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second…

ACOUSTIC ZONING WITH DISTRIBUTED MICROPHONES

Granted: January 16, 2025
Application Number: 20250022465
A method for estimating a user's location in an environment may involve receiving output signals from each microphone of a plurality of microphones in the environment. At least two microphones of the plurality of microphones may be included in separate devices at separate locations in the environment and the output signals may correspond to a current utterance of a user. The method may involve determining multiple current acoustic features from the output signals of each microphone and…

LEARNABLE HEURISTICS TO OPTIMIZE A MULTI-HYPOTHESIS FILTERING SYSTEM

Granted: January 2, 2025
Application Number: 20250006170
Some disclosed methods involve receiving microphone signals from a microphone system, including signals corresponding to one or more sounds detected by the microphone system. Some methods may involve determining, via a trained neural network, a filtering scheme for the microphone signals, the filtering scheme including one or more filtering processes. The trained neural network may be configured to implement one or more subband-domain adaptive filter management modules. Some methods may…

AUDIO ENHANCEMENT FOR MOBILE CAPTURE

Granted: January 2, 2025
Application Number: 20250008284
A system for real-time monitoring of user-generated audio content for audio anomaly and a related method are disclosed. In some embodiments, the system is programmed to receive, in real time, audio data generated by a first mobile device, such as a smartphone. The system is programed to determine, in real time, whether an audio anomaly has occurred from the audio data. The system is programmed to cause, in real time, a presentation of an alert to a second mobile device, which could be…

ESTIMATION OF AUDIO DEVICE AND SOUND SOURCE LOCATIONS

Granted: January 2, 2025
Application Number: 20250008262
Some disclosed methods involve receiving, by a control system, location control data from a sound source as the sound source emits sound in a plurality of sound source locations within an audio environment. Some such methods involve receiving, by the control system, direction of arrival data from each audio device of a plurality of audio devices in the audio environment. In some examples, each audio device of the plurality of audio devices includes a microphone array and the direction of…

PROJECTION SYSTEM AND METHOD WITH BLENDED COLOR GAMUT

Granted: January 2, 2025
Application Number: 20250008080
A projection system and method therefore related to a first projection device; a second projection device; at least one spatial modulator; and an electronic processor configured to: receive a two-dimensional video data, generate a first plurality of intensity values of a first color gamut and a second plurality of intensity values of a second color gamut, subtract a luminance threshold from a plurality of pixel values of the second color gamut to yield a plurality of positive pixel…

SYSTEMS AND METHODS TO GENERATE COPIES OF DATA FOR TRANSMISSION OVER MULTIPLE COMMUNICATION CHANNELS

Granted: January 2, 2025
Application Number: 20250007645
Systems and methods to transmit data over multiple communication channels in parallel with forward error correction. Original packets are evenly distributed to the channels as the initial systematically channel-encoded packets. Subsequent channel-encoded packets are configured to be linearly independent of their base sets of channel-encoded packets, where a base set for a subsequent channel-encoded packet includes those scheduled to be transmitted before the subsequent packet in the same…

AUDIO CONTENT GENERATION AND CLASSIFICATION

Granted: January 2, 2025
Application Number: 20250006208
Some disclosed methods involve receiving audio data of at least a first audio data type and a second audio data type, including audio signals and associated spatial data indicating intended perceived spatial positions for the audio signals, determining at least a first feature type from the audio data and applying a positional encoding process to the audio data, to produce encoded audio data. The encoded audio data may include representations of at least the spatial data and the first…

DATA STRUCTURE FOR MULTIMEDIA APPLICATONS

Granted: January 2, 2025
Application Number: 20250005068
Embodiments described herein provide a unified container format for delivering different multimedia applications. One embodiment provides a data structure utilized for implementing a plurality of multimedia applications. The data structure includes a first metadata level including low-level metadata used to perform operations associated with media data in a bitstream. The data structure includes a second metadata level including mid-level metadata used to apply operation metadata to…

LIGHT PROJECTION SYSTEM USING WHITE LIGHT ILLUMINATION

Granted: January 2, 2025
Application Number: 20250004357
Light projection systems using white light illumination. One embodiment provides a projection system using white light illumination. The projection system includes an illumination assembly configured to receive a white light input. A prism is configured to separate the white light input into color light inputs, redirect the color light inputs to respective modulators, and combine modulated color light inputs from the respective modulators into a white light output. An optical filter is…

METHODS AND SYSTEMS FOR DESIGNING AND APPLYING NUMERICALLY OPTIMIZED BINAURAL ROOM IMPULSE RESPONSES

Granted: December 26, 2024
Application Number: 20240430637
Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other…

METHOD AND DEVICE FOR ENCODING AND DECODING IMAGE USING MOTION VECTOR RESOLUTION SCALING

Granted: December 26, 2024
Application Number: 20240430475
A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the…

LUMINANCE BASED CODING TOOLS FOR VIDEO COMPRESSION

Granted: December 26, 2024
Application Number: 20240430455
Sample data and metadata related to spatial regions in images may be received from a coded video signal. It is determined whether specific spatial regions in the images correspond to a specific region of luminance levels. In response to determining the specific spatial regions correspond to the specific region of luminance levels, signal processing and video compression operations are performed on sets of samples in the specific spatial regions. The signal processing and video…

TRANSMISSION-AGNOSTIC PRESENTATION-BASED PROGRAM LOUDNESS

Granted: December 26, 2024
Application Number: 20240428815
This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.

FACE REGION DETECTION AND LOCAL RESHAPING ENHANCEMENT

Granted: December 26, 2024
Application Number: 20240428612
Methods and corresponding systems to process face regions are disclosed. The described methods include providing face bounding boxes and confidence levels for the faces, generating a histogram of the pixels and the faces, generating a probability of face, and generating a face probability map. A face contrast adjustment and a face saturation adjustment can be applied to the face probability map.

TRANSMISSION-AGNOSTIC PRESENTATION-BASED PROGRAM LOUDNESS

Granted: December 19, 2024
Application Number: 20240420717
This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.

RENDERING BASED ON LOUDSPEAKER ORIENTATION

Granted: December 19, 2024
Application Number: 20240422503
An audio processing method may involve receiving audio signals and associated spatial data, listener position data, loudspeaker position data and loudspeaker orientation data, and rendering the audio data for reproduction, based, at least in part, on the spatial data, the listener position data, the loudspeaker position data and the loudspeaker orientation data, to produce rendered audio signals. The rendering may involve applying a loudspeaker orientation factor that tends to reduce a…

RESHAPER FOR LEARNING BASED IMAGE/VIDEO CODING

Granted: December 19, 2024
Application Number: 20240422345
An input image represented in an input domain is received from an input video signal. Forward reshaping is performed on the input image to generate a forward reshaped image represented in a reshaped image domain. Non-reshaping encoding operations are performed to encode the reshaped image into an encoded video signal. At least one of the non-reshaping encoding operations is implemented with an ML model that has been previously trained with training images in one or more training datasets…