Dolby Laboratories Patent Applications

SPATIAL AUDIO SIGNAL MANIPULATION

Granted: January 14, 2021
Application Number: 20210014628
Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is…

Combined Near-Field and Far-Field Audio Rendering and Playback

Granted: January 14, 2021
Application Number: 20210014615
Some disclosed methods may involve receiving audio reproduction data and determining, based on the audio reproduction data, a sound source location at which a sound is to be rendered. A near-field gain and a far-field gain may be based, at least in part, on a sound source distance between the sound source location and a reproduction environment location. Room speaker feed signals may be based, at least in part, on room speaker positions, the sound source location and the far-field gain.…

BACKWARD COMPATIBLE DISPLAY MANAGEMENT METADATA COMPRESSION

Granted: December 31, 2020
Application Number: 20200413099
Sequence-level parameters are generated for an image frame sequence including sequence-level indicators for indicating metadata types present for each image frame in the sequence of image frames. Frame-present parameters are generated for a specific image frame in the sequence including frame-present indicators corresponding to the metadata types as indicated in the sequence-level parameters. The frame-present indicators identify first metadata types for which metadata parameter values…

SPEECH STYLE TRANSFER

Granted: December 31, 2020
Application Number: 20200410976
Computer-implemented methods for speech synthesis are provided. A speech synthesizer may be trained to generate synthesized audio data that corresponds to words uttered by a source speaker according to speech characteristics of a target speaker. The speech synthesizer may be trained by time-stamped phoneme sequences, pitch contour data and speaker identification data. The speech synthesizer may include a voice modeling neural network and a conditioning neural network.

Source Color Volume Information Messaging

Granted: December 24, 2020
Application Number: 20200404336
Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and…

VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD

Granted: December 24, 2020
Application Number: 20200403593
Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and…

METHODS AND APPARATUS FOR DECODING A COMPRESSED HOA SIGNAL

Granted: December 24, 2020
Application Number: 20200402518
Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding…

CONFIGURABLE MODAL AMPLIFIER SYSTEM

Granted: December 17, 2020
Application Number: 20200395908
Configurable amplifier systems are described in which the power supply rail of a linear amplifier, e.g., a class A amplifier, is modulated by a switching amplifier, e.g., a class D amplifier, that may also be configured to operate independently of the linear amplifier. Techniques are also described by which the standing current of the output stage of a linear amplifier is modulated based on the input signal to the linear amplifier or based on modulation of the power supply rail of the…

AUDIO SPEAKERS HAVING UPWARD FIRING DRIVERS FOR REFLECTED SOUND RENDERING

Granted: December 17, 2020
Application Number: 20200396559
Embodiments are directed to upward-firing speakers that reflect sound off a ceiling to a listening location at a distance from a speaker. The reflected sound provides height cues to reproduce audio objects that have overhead audio components. A virtual height filter based on a directional hearing model is applied to the upward-firing driver signal to improve the perception of height for audio signals transmitted by the virtual height speaker to provide optimum reproduction of the…

METHOD OF RENDERING ONE OR MORE CAPTURED AUDIO SOUNDFIELDS TO A LISTENER

Granted: December 17, 2020
Application Number: 20200396555
A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location.…

COMPANDING SYSTEM AND METHOD TO REDUCE QUANTIZATION NOISE USING ADVANCED SPECTRAL EXTENSION

Granted: December 17, 2020
Application Number: 20200395031
Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A compression process reduces an original dynamic range of an initial audio signal through a compression process that divides the initial audio signal into a plurality of segments using a defined window shape, calculates a wideband gain in the frequency domain using a non-energy based average of frequency domain samples of the initial audio signal, and applies individual gain values to…

Rapid Estimation of Effective Illuminance Patterns for Projected Light Fields

Granted: December 17, 2020
Application Number: 20200394974
Apparatus and methods are provided that employ one or more of a variety of techniques for reducing the time required to display high resolution images on a high dynamic range display having a light source layer and a display layer. In one technique, the image resolution is reduced, an effective luminance pattern is determined for the reduced resolution image, and the resolution of the effective luminance pattern is then increased to the resolution of the display layer. In another…

SOURCE SEPARATION FOR REVERBERANT ENVIRONMENT

Granted: December 10, 2020
Application Number: 20200389749
Embodiments of source separation for reverberant environment are disclosed. According to a method, first microphone signals for each individual one of at least one source are captured respectively by at least two microphones for a period during which only the individual one produces sounds. Mixing parameters for modeling acoustic paths between the at least one source and the at least two microphones are learned by a processor based on the first microphone signals. Second microphone…

Annoyance Noise Suppression

Granted: December 10, 2020
Application Number: 20200389718
Personal audio systems and methods are disclosed. A personal audio system includes a voice activity detector to determine whether or not an ambient audio stream contains voice activity, a pitch estimator to determine a frequency of a fundamental component of an annoyance noise contained in the ambient audio stream, and a filter bank to attenuate the fundamental component and at least one harmonic component of the annoyance noise to generate a personal audio stream. The filter bank…

IN-LOOP RESHAPING WITH LOCAL ILLUMINATION COMPENSATION IN IMAGE CODING

Granted: December 10, 2020
Application Number: 20200389648
Methods, processes, and systems are presented for combining signal reshaping (also referred to as luma mapping chroma residuals scaling) with local illumination compensation (LIC) in video coding. Examples and trade-offs when the LIC model parameters are computed in the original domain, the reshaped domain, or a mixed domain, are presented.

LAYERED AUGMENTED ENTERTAINMENT EXPERIENCES

Granted: December 10, 2020
Application Number: 20200388077
Spatial information that describes spatial locations of visual objects as in a three-dimensional (3D) image space as represented in one or more multi-view unlayered images is accessed. Based on the spatial information, a cinema image layer and one or more device image layers are generated from the one or more multi-view unlayered images. A multi-layer multi-view video signal comprising the cinema image layer and the device image layers is sent to downstream devices for rendering.

CONTEXT AWARE HEARING OPTIMIZATION ENGINE

Granted: December 3, 2020
Application Number: 20200380979
One or more context aware processing parameters and an ambient audio stream are received. One or more sound characteristics associated with the ambient audio stream are identified using a machine learning model. One or more actions to perform are determined using the machine learning model and based on the one or more context aware processing parameters and the identified one or more sound characteristics. The one or more actions are performed.

METHOD FOR AND APPARATUS FOR DECODING/RENDERING AN AMBISONICS AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK USING 2D SETUPS

Granted: December 3, 2020
Application Number: 20200382889
Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor g=1/?{square root over (L)}. The first matrix is…

FRAME-RATE SCALABLE VIDEO CODING

Granted: December 3, 2020
Application Number: 20200382802
Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific…

AUDIO OBJECT CLASSIFICATION BASED ON LOCATION METADATA

Granted: December 3, 2020
Application Number: 20200381003
Methods (700, 800, 900), systems (200, 300, 400, 500, 600) and computer program products are provided. Location metadata (620) associated with an audio object is received (801). The location metadata defines a position of the audio object in an audio scene. It is estimated (630, 802), based on the location metadata, whether the audio object includes dialog. A value representative of a result of the estimation is assigned (803) to an object type parameter (231). In some example…