RENDERING BASED ON LOUDSPEAKER ORIENTATION
Granted: December 19, 2024
Application Number:
20240422503
An audio processing method may involve receiving audio signals and associated spatial data, listener position data, loudspeaker position data and loudspeaker orientation data, and rendering the audio data for reproduction, based, at least in part, on the spatial data, the listener position data, the loudspeaker position data and the loudspeaker orientation data, to produce rendered audio signals. The rendering may involve applying a loudspeaker orientation factor that tends to reduce a…
RESHAPER FOR LEARNING BASED IMAGE/VIDEO CODING
Granted: December 19, 2024
Application Number:
20240422345
An input image represented in an input domain is received from an input video signal. Forward reshaping is performed on the input image to generate a forward reshaped image represented in a reshaped image domain. Non-reshaping encoding operations are performed to encode the reshaped image into an encoded video signal. At least one of the non-reshaping encoding operations is implemented with an ML model that has been previously trained with training images in one or more training datasets…
TRANSMISSION-AGNOSTIC PRESENTATION-BASED PROGRAM LOUDNESS
Granted: December 19, 2024
Application Number:
20240420717
This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.
BINAURAL DIALOGUE ENHANCEMENT
Granted: December 5, 2024
Application Number:
20240406650
Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the…
AUDIO SPEAKER COORDINATION SYSTEM
Granted: December 5, 2024
Application Number:
20240406629
Novel methods and systems for coordinating sound to both internal and external speakers for a device is disclosed. The audio signal is distributed among the internal and external speakers and aligned so that signals going to the internal speakers are aligned with signals going to the external speakers.
APPLYING MINIMUM AND AVERAGE DISTANCE CONSTRAINT IN VIDEO STREAMING
Granted: December 5, 2024
Application Number:
20240406461
Input images are received as input to a multi-node system. The input images are divided into segments assigned to respective nodes of the multi-node system. Primary and secondary scenes are identified in the segments to ensure compliance with minimum and average distance constraints. Scene-level forward reshaping mappings are generated for the scenes by a respective node for an assigned segment. Forward reshaped images in the segment are generated by the node using the forward reshaping…
METHOD FOR DATA RATE AND BUFFER ESTIMATION FOR MULTI-SOURCE DELIVERY
Granted: December 5, 2024
Application Number:
20240406243
The present disclosure relates to a method and variable quality playback system for selecting a quality of media content. The method comprising receiving (S4001) media content of a data segment (1010) over at least one network path (1031a, 1031b, 1031c), the media content being encoded with network or application-layer code and storing (S4002) the media content in a network or application-layer decoder (1050). The network or application-layer decoder (1050) is configured to decode the…
CONTEXT-DEPENDENT COLOR-MAPPING OF IMAGE AND VIDEO DATA
Granted: December 5, 2024
Application Number:
20240404030
Systems and methods for performing color mapping operations. One system includes a processor to perform post-production editing of image data. The processor is configured to identify a first region of an image and identify a second region of the image. The first region includes a first white point having a first tone, and the second region includes a second white point having a second tone. The processor is further configured to determine a color mapping function based on the first tone,…
METHOD AND DEVICE FOR ENCODING AND DECODING INTRA-FRAME PREDICTION
Granted: November 28, 2024
Application Number:
20240397039
A method and a device for encoding and decoding intra prediction are disclosed. An image decoding method for performing intra prediction comprises the steps of: receiving a bitstream including data on prediction modes of a current block and a block adjacent to the current block; extracting the data from the received bitstream so as to confirm the prediction mode of the adjacent block; determining whether a boundary pixel within the adjacent block can be used as a reference pixel for the…
DYNAMIC SPATIAL METADATA FOR IMAGE AND VIDEO PROCESSING
Granted: November 28, 2024
Application Number:
20240397088
Methods and systems for generating and using dynamic spatial metadata in image and video processing are described. In an encoder, in addition to global metadata, local, spatial metadata for two or more image regions or image objects are generated, smoothed, and embedded as spatial metadata values. In a decoder, the decoder can reconstruct the spatial metadata and use interpolation techniques to generate metadata for specific regions of interest. Examples of generating spatial metadata…
SYSTEM AND METHOD FOR ENHANCEMENT OF A DEGRADED AUDIO SIGNAL
Granted: November 28, 2024
Application Number:
20240395267
The present disclosure relates to the field of audio enhancement, and in particular to methods, devices and software for supervised training of a machine learning model, MLM, the MLM trained to enhance a degraded audio signal by calculating gains to be applied to frequency bands of the degraded audio signal. The present disclosure further relates to methods, devices and software for use of such a trained MLM.
SELFIE VOLUMETRIC VIDEO
Granted: November 28, 2024
Application Number:
20240394987
Camera tracking data with respect to a camera operating in a 3D physical space is received. An image portion depicting one or more visual objects not physically present in the 3D physical space is generated using a camera perspective derived from the camera tracking data. The one or more visual objects is caused to be visually combined with the camera perspective into a personal image taken by the camera.
SOURCE COLOR VOLUME INFORMATION MESSAGING
Granted: November 21, 2024
Application Number:
20240388738
Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and…
PERVASIVE ACOUSTIC MAPPING
Granted: November 14, 2024
Application Number:
20240381046
Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first calibration signals, generating first modified audio playback signals by inserting the first calibration signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may…
SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS
Granted: November 14, 2024
Application Number:
20240380928
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated…
METHODS AND DEVICES FOR CONTROLLING AUDIO PARAMETERS
Granted: November 14, 2024
Application Number:
20240378014
A method of controlling headphones having external microphone signal pass-through functionality may involve controlling a display to present a geometric shape on the display and receiving an indication of digit motion from a sensor system associated with the display. The sensor system may include a touch sensor system or a gesture sensor system. The indication may be an indication of a direction of digit motion relative to the display. The method may involve controlling the display to…
A METHOD OF PROCESSING AUDIO FOR PLAYBACK OF IMMERSIVE AUDIO
Granted: November 7, 2024
Application Number:
20240373184
A method (200) of processing audio in an immersive audio format comprising at least one height audio channel, comprising: obtaining (250) two height audio signals from at least a portion of the at least one height audio channel: modifying (270) a relative phase between the two height audio signals in frequency bands in which phase differences are predominantly out of phase to obtain two phase modified height audio signals: and playing back (290) the processed audio comprising the two…
SYSTEM AND METHOD OF DIGITAL WATERMARKING
Granted: November 7, 2024
Application Number:
20240370964
A digital watermarking method including receiving, by an electronic processor, an original image signal containing a series of original visual images, where the original image signal encoded uses a perceptual quantizer (PQ) luminance level encoding transfer function resulting in PQ luminance steps within the original image signal, and where the PQ luminance steps have varying sizes across a luminance range. The method further includes receiving, by the electronic processor, a watermark…
SPEECH ENHANCEMENT
Granted: October 31, 2024
Application Number:
20240363131
A method for dereverberating audio signals is provided. In some implementations, the method involves obtaining a real acoustic impulse response (AIR); identifying a first portion of the real AIR corresponding to early reflections of a direct sound and a second portion of the real AIR that corresponding to late reflections of the direct sound; generating one or more synthesized AIRs by modifying the first portion of the real AIR and/or the second portion of the real AIR; and using the…
METHOD AND SYSTEM FOR PICTURE SEGMENTATION USING COLUMNS
Granted: October 31, 2024
Application Number:
20240364894
Described is picture segmentation through columns and slices in video encoding and decoding. A video picture is divided into a plurality of columns, each column covering only a part of the video picture in a horizontal dimension. All coded tree blocks (“CTBs”) belonging to a slice may belong to one or more columns. The columns may be used to break the same or different prediction or in-loop filtering mechanisms of the video coding, and the CTB scan order used for encoding and/or…