Dolby Laboratories Patent Applications

POST-PROCESSING GAINS FOR SIGNAL ENHANCEMENT

Granted: December 28, 2023
Application Number: 20230419983
A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The…

METHODS AND APPARATUS FOR DECODING A COMPRESSED HOA SIGNAL

Granted: December 28, 2023
Application Number: 20230419975
Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding…

METHODS AND SYSTEMS FOR INTERACTIVE RENDERING OF OBJECT BASED AUDIO

Granted: December 28, 2023
Application Number: 20230419973
Methods for generating an object based audio program which is renderable in a personalizable manner, e.g., to provide an immersive, perception of audio content of the program. Other embodiments include steps of delivering (e.g., broadcasting), decoding, and/or rendering such a program. Rendering of audio objects indicated by the program may provide an immersive experience. The audio content of the program may be indicative of multiple object channels (e.g., object channels indicative of…

MACHINE LEARNING ASSISTED SPATIAL NOISE ESTIMATION AND SUPPRESSION

Granted: December 21, 2023
Application Number: 20230410829
In an embodiment, a method comprises: receiving bands of power spectra of an input audio signal and a microphone covariance, and for each band: estimating, using a classifier, respective probabilities of speech and noise; estimating, using a directionality model, a set of means for speech and noise, or a set of means and covariances for speech and noise, based on the microphone covariance for the band and the probabilities; estimating, using a level model, a mean and covariance of noise…

HYBRID CLOCKING SCHEME FOR TRANSMITTING PACKETIZED AUDIO AND POWER OVER A COMMON CONDUCTOR

Granted: December 14, 2023
Application Number: 20230403091
A distributed amplification and packetized audio transmission system for clock synchronization and alignment between an audio/power source and endpoints with dedicated amplifiers and speakers. An Ethernet audio signal is combined with a Power-Line Communications (PLC) signal for transmission from the source to the endpoints over a common conductor. A single master clock in the source synchronizes the Ethernet audio transmitter with the PLC transmitter. Each end-point has a PLC receiver…

METHOD AND APPARATUS FOR AUDIO PROCESSING USING A CONVOLUTIONAL NEURAL NETWORK ARCHITECTURE

Granted: December 14, 2023
Application Number: 20230401429
Systems, methods, and computer program products for audio processing based on convolutional neural network (CNN) are described. A first CNN architecture may comprise a contracting path of a U-net, a multi-scale CNN, and an expansive path of a U-net. The contracting path may comprise a first encoding layer and may be configured to generate an output representation of the contracting path. The multi-scale CNN may be configured to generate, based on the output representation of the…

METHOD AND APPARATUS FOR PROCESSING OF AUDIO USING A NEURAL NETWORK

Granted: December 7, 2023
Application Number: 20230395086
Described herein is a method of processing an audio signal using a neural network or using a first and a second neural network. Described is further a method of training said neural network or of jointly training a set of said first and said second neural network. Moreover, described is a method of obtaining and transmitting a latent feature space representation of a perceptual domain audio signal using a neural network and a method of obtaining an audio signal from a latent feature…

GENERAL MEDIA NEURAL NETWORK PREDICTOR AND A GENERATIVE MODEL INCLUDING SUCH A PREDICTOR

Granted: December 7, 2023
Application Number: 20230394287
A neural network system for predicting frequency coefficients of a media signal, the neural network system comprising a time predicting portion including at least one neural network trained to predict a first set of output variables representing a specific frequency band of a current time frame given coefficients of one or several previous time frames, and a frequency predicting portion including a at least one neural network trained to predict a second set of output variables…

PROJECTION SYSTEM AND METHOD WITH FOLD MIRROR AND INTEGRATING ROD ADJUSTMENT

Granted: December 7, 2023
Application Number: 20230393452
A projection system and calibration method therefore relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a fold mirror and an integrating rod, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a predetermined location as on-state light or to reflect the steered light as…

SYSTEM AND TOOLS FOR ENHANCED 3D AUDIO AUTHORING AND RENDERING

Granted: November 30, 2023
Application Number: 20230388738
Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular…

ASYMMETRICAL HIGH-FREQUENCY WAVEGUIDE, 3-AXIS RIGGING, AND SPHERICAL ENCLOSURE FOR SURROUND SPEAKERS

Granted: November 30, 2023
Application Number: 20230388702
Embodiments are described for a high-frequency waveguide that improves the performance of large-scale surround sound and immersive audio environments. A horn waveguide is configured to be asymmetric about one of a vertical axis and horizontal axis of the waveguide to form an asymmetric horn waveguide. A spherical enclosure surrounds the asymmetric horn waveguide to form a horn speaker, and a three-axis mounting system is configured to fix the horn speaker to one of a wall or ceiling…

TRIM-PASS CORRECTION FOR CLOUD-BASED CODING OF HDR VIDEO

Granted: November 30, 2023
Application Number: 20230388555
In a cloud-based system for encoding high dynamic range (HDR) video, each node receives a video segment and bumper frames. Each segment is subdivided into primary scenes and secondary scenes to derive scene-based forward reshaping functions that minimize the amount of reshaping-related metadata when coding the video segment. When a parent scene of a secondary scene is processed by two or more neighboring nodes, initial forward reshaping functions and trim-pass correction parameters are…

METHOD AND APPARTUS FOR AUDIO PROCESSING USING A NESTED CONVOLUTIONAL NEURAL NETWORK ARCHITECHTURE

Granted: November 30, 2023
Application Number: 20230386500
Systems, methods, and computer program products for audio processing based on convolutional neural network (CNN) are described. The CNN architecture may comprise a multi-scale input block and a multi-scale nested block. The multi-scale input block may be configured to receive input data and to generate a first downsampled input data set by downsampling the input data. The multi-scale nested block may comprise a first encoding layer configured to generate a first encoded data set by…

ADAPTIVE BLOCK SWITCHING WITH DEEP NEURAL NETWORKS

Granted: November 30, 2023
Application Number: 20230386486
The present invention relates to a method for predicting transform coefficients representing frequency content of an adaptive block length media signal, by receiving a frame and receiving block length information indicating a number of quantized transform coefficients for each block in the frame, the number of quantized transform coefficients being one of a first or second number, wherein the first number is greater than the second number, determining a first block has the second number…

BINAURAL RENDERING FOR HEADPHONES USING METADATA PROCESSING

Granted: November 30, 2023
Application Number: 20230385013
Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.

PROJECTION SYSTEM AND METHOD WITH ADJUSTABLE ANGLE ILLUMINATION USING LENS DECENTRATION

Granted: November 30, 2023
Application Number: 20230384656
A projection system and calibration method therefor relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a first lens group and a second lens group, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a predetermined location as on-state light or to reflect the steered light as…

ADAPTIVE LOCAL RESHAPING FOR SDR-TO-HDR UP-CONVERSION

Granted: November 16, 2023
Application Number: 20230370646
A global index value is generated for selecting a global reshaping function for an input image of a relatively low dynamic range using luma codewords in the input image. Image filtering is applied to the input image to generate a filtered image. The filtered values of the filtered image provide a measure of local brightness levels in the input image. Local index values are generated for selecting specific local reshaping functions for the input image using the global index value and the…

DEEP-LEARNING BASED SPEECH ENHANCEMENT

Granted: November 16, 2023
Application Number: 20230368807
A system for suppressing noise and enhancing speech and a related method are disclosed. The system trains a neural network model that takes banded energies corresponding to an original noisy waveform and produces a speech value indicating the amount of speech present in each band at each frame. The neural model comprises a feature extraction block that implements some lookahead. The feature extraction block is followed by an encoder with steady down-sampling along the frequency domain…

COLOR TRANSFORMATION FOR HDR VIDEO WITH A CODING-EFFICIENCY CONSTRAINT

Granted: November 16, 2023
Application Number: 20230368344
Using a standard-based RGB to YCbCr color transform a new RGB to YCC 3×3 transformation matrix and a 3×1 offset vector are derived under a set of coding-efficiency constraints. The new RGB to YCC 3×3 transform comprises a luminance scaling factor and a 2×2 chroma sub-matrix that preserves the energy of the standard-based RGB to YCbCr transform while maintaining or improving coding efficiency. It also adds support for an authorization or watermarking mechanism in streaming video…

AUDIO DECODER AND DECODING METHOD

Granted: November 9, 2023
Application Number: 20230360659
A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency…