Adaptive loudness normalization for audio object clustering
Granted: March 12, 2024
Patent Number:
11930347
A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the…
Electro-acoustic transducer
Granted: March 12, 2024
Patent Number:
11930342
An electro-acoustic transducer, comprising a supporting frame, a magnet assembly with an annular yoke surrounding a magnet a diaphragm attached to the front edge of the supporting frame, a voice coil suspended by the diaphragm in a gap formed between the magnet and the annular yoke, the voice coil being axially movable with respect to the magnet, and an annular damper arranged to stabilize the diaphragm. The transducer further comprises a damper holder having a substantially flat annular…
Method and apparatus for controlling enhancement of low-bitrate coded audio
Granted: March 12, 2024
Patent Number:
11929085
Described herein is a method of low-bitrate coding of audio data and generating enhancement metadata for controlling audio enhancement of the low-bitrate coded audio data at a decoder side, including the steps of: (a) core encoding original audio data at a low bitrate to obtain encoded audio data; (b) generating enhancement metadata to be used for controlling a type and/or amount of audio enhancement at the decoder side after core decoding the encoded audio data; and (c) outputting the…
Systems and methods for adapting human speaker embeddings in speech synthesis
Granted: March 12, 2024
Patent Number:
11929058
Novel methods and systems for adapting a voice cloning synthesizer for a new speaker using real speech data are disclosed. Utterances from one or more target speakers are parameterized and are used to initialize an embedding vector for use with a voice synthesizer, by means of clustering the utterance data and determining the centroid of the data, using a speaker identification neural network, and/or by finding the closest stored embedded vector to the utterance data.
Rendering binaural audio over multiple near field transducers
Granted: March 5, 2024
Patent Number:
11924619
An apparatus and method of rendering audio. A binaural signal is split on an amplitude weighting basis into a front binaural signal and a rear binaural signal, based on perceived position information of the audio. In this manner, the front-back differentiation of the binaural signal is improved.
Signal reshaping for high dynamic range signals
Granted: March 5, 2024
Patent Number:
11924477
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated…
HDR image generation from single-shot HDR color image sensors
Granted: March 5, 2024
Patent Number:
11922639
A method for generating an high-dynamic-range (HDR) color image from a dual-exposure-time single-shot HDR color image sensor includes obtaining pixel values generated by a local region of sensor pixels of the image sensor, determining a motion parameter for the local region from pixel values associated with a first color, and demosaicing the pixel values of the local region to determine, for each of three colors, an output value of the images pixel, wherein relative contributions of…
Estimating user location in a system including smart audio devices
Granted: February 27, 2024
Patent Number:
11917386
Methods and systems for performing at least one audio activity (e.g., conducting a phone call or playing music or other audio content) in an environment including by determining an estimated location of a user in the environment in response to sound uttered by the user (e.g., a voice command), and controlling the audio activity in response to determining the estimated user location. The environment may have zones which are indicated by a zone map and estimation of the user location may…
Scalable systems for controlling color management comprising varying levels of metadata
Granted: February 27, 2024
Patent Number:
11917171
Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.
Signal reshaping for high dynamic range signals
Granted: February 20, 2024
Patent Number:
11910025
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated…
Orientation-aware surround sound playback
Granted: February 13, 2024
Patent Number:
11902762
Example embodiments disclosed herein relate to orientation-aware surround sound playback. A method for processing audio on an electronic device that includes a plurality of loudspeakers is disclosed, the loudspeakers arranged in more than one dimension of the electronic device. The method includes, responsive to receipt of a plurality of received audio streams, generating a rendering component associated with the plurality of received audio streams, determining an orientation dependent…
Orientation-aware surround sound playback
Granted: February 13, 2024
Patent Number:
11902762
Example embodiments disclosed herein relate to orientation-aware surround sound playback. A method for processing audio on an electronic device that includes a plurality of loudspeakers is disclosed, the loudspeakers arranged in more than one dimension of the electronic device. The method includes, responsive to receipt of a plurality of received audio streams, generating a rendering component associated with the plurality of received audio streams, determining an orientation dependent…
Layered augmented entertainment experiences
Granted: February 6, 2024
Patent Number:
11893700
Spatial information that describes spatial locations of visual objects as in a three-dimensional (3D) image space as represented in one or more multi-view unlayered images is accessed. Based on the spatial information, a cinema image layer and one or more device image layers are generated from the one or more multi-view unlayered images. A multi-layer multi-view video signal comprising the cinema image layer and the device image layers is sent to downstream devices for rendering.
Method and apparatus for screen related adaptation of a Higher-Order Ambisonics audio signal
Granted: February 6, 2024
Patent Number:
11895482
A method for generating loudspeaker signals associated with a target screen size is disclosed. The method includes receiving a bit stream containing encoded higher order ambisonics signals, the encoded higher order ambisonics signals describing a sound field associated with a production screen size. The method further includes decoding the encoded higher order ambisonics signals to obtain a first set of decoded higher order ambisonics signals representing dominant components of the sound…
Steering of binauralization of audio
Granted: February 6, 2024
Patent Number:
11895479
A method for steering binauralization of audio is provided. The method comprises steps of: receiving (410) an audio input signal, calculating (430) a confidence value indicating a likelihood that a current audio frame of the audio input signal comprises binauralized audio; determining (450) a state signal based on the confidence value; determining (460) a steering signal, based on the first confidence value, the state signal and an energy value of the audio frame; and generating (470) an…
Methods and apparatus for compressing and decompressing a higher order ambisonics representation
Granted: February 6, 2024
Patent Number:
11895477
Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore, compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. The ambient HOA component is represented by a minimum number of HOA coefficient sequences. The remaining channels contain either directional signals or additional…
Electro-optical transfer function conversion and signal legalization
Granted: February 6, 2024
Patent Number:
11895416
A device includes an electronic processor configured to define a first set of sample pixels from a set of sample pixels determined from received video data according to a first electro-optical transfer function (EOTF) in a first color representation of a first color space; convert the first set of sample pixels to a second EOTF via a mapping function, producing a second set of sample pixels according to the second EOTF; convert the first and second set of sample pixels from the first…
Media-aware navigation metadata
Granted: February 6, 2024
Patent Number:
11895369
The present disclosure relates to methods and apparatus for processing media content having video content and associated audio content. A method of processing media content having video content and associated audio content comprises the method includes receiving the video content and the associated audio content, analyzing the associated audio content, determining one or more navigation points for enabling navigation of the media content based on the analysis, wherein the one or more…
Compressor target curve to avoid boosting noise
Granted: February 6, 2024
Patent Number:
11894006
The processing of audio signals during playback is provided, so that audio signals that fall below a specified threshold loudness level are processed to avoid making unwanted background noise audible. N-channel audio is received from a playback volume controller/leveler (101). The level of the audio is compared with a threshold level. If the level is greater than the threshold level, the audio is processed with a first amount of gain in accordance with a first dynamic range control (DRC)…
Enabling sampling rate diversity in a voice communication system
Granted: February 6, 2024
Patent Number:
11894005
An audio communication endpoint receives a bitstream containing spectral components representing spectral content of an audio signal, wherein the spectral components relate to a first range extending up to a first break frequency, above which any spectral components are unassigned. The endpoint adapts the received bitstream in accordance with a second range extending up to a second break frequency by removing spectral components or adding neutral-valued spectral components relating to a…