Metadata-preserved audio object clustering
Granted: March 19, 2024
Patent Number:
11937064
Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.
Headphones and headphone systems
Granted: March 19, 2024
Patent Number:
11937042
Some headphone systems include two ear cups, a headband assembly, an interface system and a control system. Each ear cup may include an ear cup enclosure, an ear pad assembly, a speaker system and a hinge assembly. The hinge assembly may be disposed within the ear cup enclosure such that it is not visible from outside the ear cup. The headband assembly may connect with each of the ear cups via the hinge assembly. The interface system may include at least one interface and a plurality of…
System and method for displaying high quality images in a dual modulation projection system
Granted: March 19, 2024
Patent Number:
11937023
A novel high efficiency image projection system includes a beam-steering modulator, an amplitude modulator, and a controller. In a particular embodiment the controller generates beam-steering drive values from image data and uses the beam-steering drive values to drive the beam-steering modulator. Additionally, the controller utilizes the beam-steering drive values to generate a lightfield simulation of a lightfield projected onto the amplitude modulator by the beam-steering modulator.…
Frame-rate scalable video coding
Granted: March 19, 2024
Patent Number:
11936888
Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific…
Image-dependent contrast and brightness control for HDR displays
Granted: March 19, 2024
Patent Number:
11935492
Methods and systems to adjust differently brightness and contrast for dark and bright pictures on a display are provided. Given a tone-mapping curve mapping an input dynamic range to a display comprising a minimum and maximum display luminance value, the maximum display luminance value is lowered to an adjusted luminance value according to user defined parameters. The input dynamic range is tone-mapped to the display dynamic range using the adjusted luminance value. For brightness…
Systems and methods for adapting human speaker embeddings in speech synthesis
Granted: March 12, 2024
Patent Number:
11929058
Novel methods and systems for adapting a voice cloning synthesizer for a new speaker using real speech data are disclosed. Utterances from one or more target speakers are parameterized and are used to initialize an embedding vector for use with a voice synthesizer, by means of clustering the utterance data and determining the centroid of the data, using a speaker identification neural network, and/or by finding the closest stored embedded vector to the utterance data.
Adaptive loudness normalization for audio object clustering
Granted: March 12, 2024
Patent Number:
11930347
A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the…
Electro-acoustic transducer
Granted: March 12, 2024
Patent Number:
11930342
An electro-acoustic transducer, comprising a supporting frame, a magnet assembly with an annular yoke surrounding a magnet a diaphragm attached to the front edge of the supporting frame, a voice coil suspended by the diaphragm in a gap formed between the magnet and the annular yoke, the voice coil being axially movable with respect to the magnet, and an annular damper arranged to stabilize the diaphragm. The transducer further comprises a damper holder having a substantially flat annular…
Blind detection of binauralized stereo content
Granted: March 12, 2024
Patent Number:
11929091
An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.
Method and apparatus for controlling enhancement of low-bitrate coded audio
Granted: March 12, 2024
Patent Number:
11929085
Described herein is a method of low-bitrate coding of audio data and generating enhancement metadata for controlling audio enhancement of the low-bitrate coded audio data at a decoder side, including the steps of: (a) core encoding original audio data at a low bitrate to obtain encoded audio data; (b) generating enhancement metadata to be used for controlling a type and/or amount of audio enhancement at the decoder side after core decoding the encoded audio data; and (c) outputting the…
Rendering binaural audio over multiple near field transducers
Granted: March 5, 2024
Patent Number:
11924619
An apparatus and method of rendering audio. A binaural signal is split on an amplitude weighting basis into a front binaural signal and a rear binaural signal, based on perceived position information of the audio. In this manner, the front-back differentiation of the binaural signal is improved.
Signal reshaping for high dynamic range signals
Granted: March 5, 2024
Patent Number:
11924477
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated…
HDR image generation from single-shot HDR color image sensors
Granted: March 5, 2024
Patent Number:
11922639
A method for generating an high-dynamic-range (HDR) color image from a dual-exposure-time single-shot HDR color image sensor includes obtaining pixel values generated by a local region of sensor pixels of the image sensor, determining a motion parameter for the local region from pixel values associated with a first color, and demosaicing the pixel values of the local region to determine, for each of three colors, an output value of the images pixel, wherein relative contributions of…
Estimating user location in a system including smart audio devices
Granted: February 27, 2024
Patent Number:
11917386
Methods and systems for performing at least one audio activity (e.g., conducting a phone call or playing music or other audio content) in an environment including by determining an estimated location of a user in the environment in response to sound uttered by the user (e.g., a voice command), and controlling the audio activity in response to determining the estimated user location. The environment may have zones which are indicated by a zone map and estimation of the user location may…
Scalable systems for controlling color management comprising varying levels of metadata
Granted: February 27, 2024
Patent Number:
11917171
Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.
Signal reshaping for high dynamic range signals
Granted: February 20, 2024
Patent Number:
11910025
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated…
Orientation-aware surround sound playback
Granted: February 13, 2024
Patent Number:
11902762
Example embodiments disclosed herein relate to orientation-aware surround sound playback. A method for processing audio on an electronic device that includes a plurality of loudspeakers is disclosed, the loudspeakers arranged in more than one dimension of the electronic device. The method includes, responsive to receipt of a plurality of received audio streams, generating a rendering component associated with the plurality of received audio streams, determining an orientation dependent…
Orientation-aware surround sound playback
Granted: February 13, 2024
Patent Number:
11902762
Example embodiments disclosed herein relate to orientation-aware surround sound playback. A method for processing audio on an electronic device that includes a plurality of loudspeakers is disclosed, the loudspeakers arranged in more than one dimension of the electronic device. The method includes, responsive to receipt of a plurality of received audio streams, generating a rendering component associated with the plurality of received audio streams, determining an orientation dependent…
Layered augmented entertainment experiences
Granted: February 6, 2024
Patent Number:
11893700
Spatial information that describes spatial locations of visual objects as in a three-dimensional (3D) image space as represented in one or more multi-view unlayered images is accessed. Based on the spatial information, a cinema image layer and one or more device image layers are generated from the one or more multi-view unlayered images. A multi-layer multi-view video signal comprising the cinema image layer and the device image layers is sent to downstream devices for rendering.
Media-aware navigation metadata
Granted: February 6, 2024
Patent Number:
11895369
The present disclosure relates to methods and apparatus for processing media content having video content and associated audio content. A method of processing media content having video content and associated audio content comprises the method includes receiving the video content and the associated audio content, analyzing the associated audio content, determining one or more navigation points for enabling navigation of the media content based on the analysis, wherein the one or more…