Dolby Laboratories Patent Applications

NEURAL NETWORKS FOR HIGH DYNAMIC RANGE VIDEO SUPER-RESOLUTION

Granted: April 3, 2025
Application Number: 20250111475
Methods and systems for the super resolution of high dynamic range (HDR) video are described. Given a sequence of video frames, a current frame and two or more neighboring frames are processed by a neural-network (NN) feature extraction module, followed by a NN upscaling module, and a NN reconstruction module. In parallel, the current frame is upscaled using traditional up-sampling to generate an intermediate up-sampled frame. The output of the reconstruction module is added to the…

ENHANCING REMOTE VISUAL INTERACTION

Granted: April 3, 2025
Application Number: 20250111470
A communication client device operated by a first user in a communication session receives a viewing direction tracking data portion indicating a view direction of a second user in the communication session. It is determined that the view direction of the second user is towards a third user at a first time point in the communication session. The view direction of the second user is used to modify a pre-adapted visual depiction of the second user into an adapted visual depiction of the…

REVERBERATION GENERATION FOR HEADPHONE VIRTUALIZATION

Granted: March 27, 2025
Application Number: 20250106576
The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to…

BETA SCALE DYNAMIC DISPLAY MAPPING

Granted: March 27, 2025
Application Number: 20250106410
An input image to be coded into a video signal and a target image are received. The input image and the target image depict same visual content. One or more beta scaling method indicators and one or more sets of one or more beta scale parameters are generated. The one or more beta scaling method indicators indicate one or more beta scaling methods that use the one or more sets of beta scale parameters to perform beta scaling operations on the input image to generate a reconstructed image…

POST-PROCESSING GAINS FOR SIGNAL ENHANCEMENT

Granted: March 27, 2025
Application Number: 20250104728
A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The…

ACOUSTIC ENVIRONMENT SIMULATION

Granted: March 27, 2025
Application Number: 20250104720
Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (?2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a…

METHOD AND APPARATUS FOR ENCODING AND DECODING AN HOA REPRESENTATION

Granted: March 20, 2025
Application Number: 20250095661
The present invention relates to methods and apparatus for encoding an HOA signal representation (c(t)) of a sound field having an order of N and a number O=(N+1)2 of coefficient sequences to a mezzanine HOA signal representation (wMEZZ(t)). The present invention further relates to methods and apparatus for decoding a reconstructed HOA signal representation from the mezzanine HOA signal representation.

SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Granted: March 20, 2025
Application Number: 20250097480
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated…

DOWNLOAD CONTROL IN MULTI-SERVER COMMUNICATION SYSTEM

Granted: March 20, 2025
Application Number: 20250097287
Apparatuses and methods for data traffic management in multi-source content delivery are described. The apparatus includes a downloader and a controller. The downloader is coupled to servers via communication links. The controller is configured to determine initial download requests for the servers based on predetermined information about a quality of the links. The controller is also configured to send the initial download requests to the servers with the downloader. The controller is…

LOW LATENCY AUDIO FILTERBANK WITH IMPROVED FREQUENCY RESOLUTION

Granted: March 20, 2025
Application Number: 20250096778
A filterbank, suitable for modifying audio signals with dynamic gains in each band, is constructed so that the perceived latency is small, while a larger group delay is applied at low frequencies to enable higher frequency resolution in the lower frequency bands. The higher group delay at low frequencies is achieved by inserting an all-pass filter into the reconstructed filter response.

SPATIAL CODING OF HIGHER ORDER AMBISONICS FOR A LOW LATENCY IMMERSIVE AUDIO CODEC

Granted: March 20, 2025
Application Number: 20250095660
Described herein is a method of encoding Higher Order Ambisonics, HOA, audio, the method including: receiving an input HOA audio signal having more than four Ambisonics channels; encoding the HOA audio signal using a SPAR coding framework and a core audio encoder; and providing the encoded HOA audio signal to a downstream device, the encoded HOA audio signal including core encoded SPAR downmix channels and encoded SPAR metadata. Further described are a method of decoding Higher Order…

NEURAL NETWORKS FOR DYNAMIC RANGE CONVERSION AND DISPLAY MANAGEMENT OF IMAGES

Granted: March 20, 2025
Application Number: 20250095125
Methods and systems for dynamic range conversion and display mapping of standard dynamic range (SDR) images onto high dynamic range (HDR) displays are described. Given an SDR input image, a processor generates an intensity (luminance) image and optionally a base layer image and a detail layer image. A first neural network uses the intensity image to predict statistics of the SDR image in a higher dynamic range. These predicted statistics together with the original image statistics of the…

AUDIO PROCESSING IN IMMERSIVE AUDIO SERVICES

Granted: March 13, 2025
Application Number: 20250088816
The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to…

METHOD FOR ENCODING AND DECODING IMAGE USING ADAPTIVE DEBLOCKING FILTERING, AND APPARATUS THEREFOR

Granted: March 13, 2025
Application Number: 20250088674
Disclosed is an encoding/decoding method and apparatus related to adaptive deblocking filtering. There is provided an image decoding method performing adaptive filtering in inter-prediction, the method including: reconstructing, from a bitstream, an image signal including a reference block on which block matching is performed in inter-prediction of a current block to be encoded; obtaining, from the bitstream, a flag indicating whether the reference block exists within a current picture…

SCALABLE SYSTEMS FOR CONTROLLING COLOR MANAGEMENT COMPRISING VARYING LEVELS OF METADATA

Granted: March 13, 2025
Application Number: 20250088647
Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.

AUDIO ENCODER AND DECODER WITH DYNAMIC RANGE COMPRESSION METADATA

Granted: March 13, 2025
Application Number: 20250087224
An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC…

ENGAGEMENT MEASUREMENT AND LEARNING AS A SERVICE

Granted: March 13, 2025
Application Number: 20250086674
An apparatus may include an interface system and a first local control system. The first local control system may be configured to: receive first sensor data from a first preview environment while a content stream is being presented in the first preview environment; generate, based at least in part on the first sensor data, first user engagement data corresponding to one or more people in the first preview environment, the first user engagement data indicating estimated engagement with…

SYSTEM AND METHOD FOR NON-DESTRUCTIVELY NORMALIZING LOUDNESS OF AUDIO SIGNALS WITHIN PORTABLE DEVICES

Granted: March 6, 2025
Application Number: 20250078849
Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A…

PREDICTIVE MOTION VECTOR CODING

Granted: March 6, 2025
Application Number: 20250080749
Overlapped block disparity estimation and compensation is described. Compensating for images with overlapped block disparity compensation (OBDC) involves determining if OBDC is enabled in a video bit stream, and determining if OBDC is enabled for one or more macroblocks that neighbor a first macroblock within the video bit stream. The neighboring macroblocks may be transform coded. If OBDC is enabled in the video bit stream and for the one or more neighboring macroblocks, predictions may…

DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS

Granted: March 6, 2025
Application Number: 20250078858
Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation is based on a combination of: i) feature-based frame-by-frame classification, and ii) detecting dissimilarity by statistical methods. The segmentation information is then used by a speech…