Dolby Laboratories Patent Applications

METHOD AND DEVICE FOR ENCODING AND DECODING IMAGE USING MOTION VECTOR RESOLUTION SCALING

Granted: December 26, 2024
Application Number: 20240430475
A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the…

LUMINANCE BASED CODING TOOLS FOR VIDEO COMPRESSION

Granted: December 26, 2024
Application Number: 20240430455
Sample data and metadata related to spatial regions in images may be received from a coded video signal. It is determined whether specific spatial regions in the images correspond to a specific region of luminance levels. In response to determining the specific spatial regions correspond to the specific region of luminance levels, signal processing and video compression operations are performed on sets of samples in the specific spatial regions. The signal processing and video…

TRANSMISSION-AGNOSTIC PRESENTATION-BASED PROGRAM LOUDNESS

Granted: December 26, 2024
Application Number: 20240428815
This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.

FACE REGION DETECTION AND LOCAL RESHAPING ENHANCEMENT

Granted: December 26, 2024
Application Number: 20240428612
Methods and corresponding systems to process face regions are disclosed. The described methods include providing face bounding boxes and confidence levels for the faces, generating a histogram of the pixels and the faces, generating a probability of face, and generating a face probability map. A face contrast adjustment and a face saturation adjustment can be applied to the face probability map.

RENDERING BASED ON LOUDSPEAKER ORIENTATION

Granted: December 19, 2024
Application Number: 20240422503
An audio processing method may involve receiving audio signals and associated spatial data, listener position data, loudspeaker position data and loudspeaker orientation data, and rendering the audio data for reproduction, based, at least in part, on the spatial data, the listener position data, the loudspeaker position data and the loudspeaker orientation data, to produce rendered audio signals. The rendering may involve applying a loudspeaker orientation factor that tends to reduce a…

RESHAPER FOR LEARNING BASED IMAGE/VIDEO CODING

Granted: December 19, 2024
Application Number: 20240422345
An input image represented in an input domain is received from an input video signal. Forward reshaping is performed on the input image to generate a forward reshaped image represented in a reshaped image domain. Non-reshaping encoding operations are performed to encode the reshaped image into an encoded video signal. At least one of the non-reshaping encoding operations is implemented with an ML model that has been previously trained with training images in one or more training datasets…

TRANSMISSION-AGNOSTIC PRESENTATION-BASED PROGRAM LOUDNESS

Granted: December 19, 2024
Application Number: 20240420717
This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.

APPLYING MINIMUM AND AVERAGE DISTANCE CONSTRAINT IN VIDEO STREAMING

Granted: December 5, 2024
Application Number: 20240406461
Input images are received as input to a multi-node system. The input images are divided into segments assigned to respective nodes of the multi-node system. Primary and secondary scenes are identified in the segments to ensure compliance with minimum and average distance constraints. Scene-level forward reshaping mappings are generated for the scenes by a respective node for an assigned segment. Forward reshaped images in the segment are generated by the node using the forward reshaping…

BINAURAL DIALOGUE ENHANCEMENT

Granted: December 5, 2024
Application Number: 20240406650
Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the…

AUDIO SPEAKER COORDINATION SYSTEM

Granted: December 5, 2024
Application Number: 20240406629
Novel methods and systems for coordinating sound to both internal and external speakers for a device is disclosed. The audio signal is distributed among the internal and external speakers and aligned so that signals going to the internal speakers are aligned with signals going to the external speakers.

METHOD FOR DATA RATE AND BUFFER ESTIMATION FOR MULTI-SOURCE DELIVERY

Granted: December 5, 2024
Application Number: 20240406243
The present disclosure relates to a method and variable quality playback system for selecting a quality of media content. The method comprising receiving (S4001) media content of a data segment (1010) over at least one network path (1031a, 1031b, 1031c), the media content being encoded with network or application-layer code and storing (S4002) the media content in a network or application-layer decoder (1050). The network or application-layer decoder (1050) is configured to decode the…

CONTEXT-DEPENDENT COLOR-MAPPING OF IMAGE AND VIDEO DATA

Granted: December 5, 2024
Application Number: 20240404030
Systems and methods for performing color mapping operations. One system includes a processor to perform post-production editing of image data. The processor is configured to identify a first region of an image and identify a second region of the image. The first region includes a first white point having a first tone, and the second region includes a second white point having a second tone. The processor is further configured to determine a color mapping function based on the first tone,…

DYNAMIC SPATIAL METADATA FOR IMAGE AND VIDEO PROCESSING

Granted: November 28, 2024
Application Number: 20240397088
Methods and systems for generating and using dynamic spatial metadata in image and video processing are described. In an encoder, in addition to global metadata, local, spatial metadata for two or more image regions or image objects are generated, smoothed, and embedded as spatial metadata values. In a decoder, the decoder can reconstruct the spatial metadata and use interpolation techniques to generate metadata for specific regions of interest. Examples of generating spatial metadata…

METHOD AND DEVICE FOR ENCODING AND DECODING INTRA-FRAME PREDICTION

Granted: November 28, 2024
Application Number: 20240397039
A method and a device for encoding and decoding intra prediction are disclosed. An image decoding method for performing intra prediction comprises the steps of: receiving a bitstream including data on prediction modes of a current block and a block adjacent to the current block; extracting the data from the received bitstream so as to confirm the prediction mode of the adjacent block; determining whether a boundary pixel within the adjacent block can be used as a reference pixel for the…

SYSTEM AND METHOD FOR ENHANCEMENT OF A DEGRADED AUDIO SIGNAL

Granted: November 28, 2024
Application Number: 20240395267
The present disclosure relates to the field of audio enhancement, and in particular to methods, devices and software for supervised training of a machine learning model, MLM, the MLM trained to enhance a degraded audio signal by calculating gains to be applied to frequency bands of the degraded audio signal. The present disclosure further relates to methods, devices and software for use of such a trained MLM.

SELFIE VOLUMETRIC VIDEO

Granted: November 28, 2024
Application Number: 20240394987
Camera tracking data with respect to a camera operating in a 3D physical space is received. An image portion depicting one or more visual objects not physically present in the 3D physical space is generated using a camera perspective derived from the camera tracking data. The one or more visual objects is caused to be visually combined with the camera perspective into a personal image taken by the camera.

SOURCE COLOR VOLUME INFORMATION MESSAGING

Granted: November 21, 2024
Application Number: 20240388738
Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and…

METHODS AND DEVICES FOR CONTROLLING AUDIO PARAMETERS

Granted: November 14, 2024
Application Number: 20240378014
A method of controlling headphones having external microphone signal pass-through functionality may involve controlling a display to present a geometric shape on the display and receiving an indication of digit motion from a sensor system associated with the display. The sensor system may include a touch sensor system or a gesture sensor system. The indication may be an indication of a direction of digit motion relative to the display. The method may involve controlling the display to…

PERVASIVE ACOUSTIC MAPPING

Granted: November 14, 2024
Application Number: 20240381046
Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first calibration signals, generating first modified audio playback signals by inserting the first calibration signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may…

SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Granted: November 14, 2024
Application Number: 20240380928
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated…