Dolby Laboratories Patent Applications

IMPROVING BASS RESPONSE FOR A SPEAKER IN A PORTABLE COMPUTING DEVICE

Granted: December 28, 2023
Application Number: 20230421953
Methods and systems of improving bass response for a speaker in a portable computing device are described. One portable computing device includes first and second cover parts that are joined together to form a casing of the portable computing device, wherein a speaker volume is formed between portions of the first and second cover parts; a speaker arranged within the speaker volume; and one or more elastic spacers arranged between the first and second cover parts. The one or more elastic…

SUBBAND DOMAIN ACOUSTIC ECHO CANCELLER BASED ACOUSTIC STATE ESTIMATOR

Granted: December 28, 2023
Application Number: 20230421952
Some implementations involve receiving, from a first subband domain acoustic echo canceller (AEC) of a first audio device in an audio environment, first adaptive filter management data from each of a plurality of first adaptive filter management modules, each first adaptive filter management module corresponding to a subband of the first subband domain AEC, each first adaptive filter management module being configured to control a first plurality of adaptive filters. The first plurality…

DIRECTED INTERPOLATION AND DATA POST-PROCESSING

Granted: December 28, 2023
Application Number: 20230421813
An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on…

DIRECTED INTERPOLATION AND DATA POST-PROCESSING

Granted: December 28, 2023
Application Number: 20230421811
An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on…

MACHINE LEARNING ASSISTED SPATIAL NOISE ESTIMATION AND SUPPRESSION

Granted: December 21, 2023
Application Number: 20230410829
In an embodiment, a method comprises: receiving bands of power spectra of an input audio signal and a microphone covariance, and for each band: estimating, using a classifier, respective probabilities of speech and noise; estimating, using a directionality model, a set of means for speech and noise, or a set of means and covariances for speech and noise, based on the microphone covariance for the band and the probabilities; estimating, using a level model, a mean and covariance of noise…

HYBRID CLOCKING SCHEME FOR TRANSMITTING PACKETIZED AUDIO AND POWER OVER A COMMON CONDUCTOR

Granted: December 14, 2023
Application Number: 20230403091
A distributed amplification and packetized audio transmission system for clock synchronization and alignment between an audio/power source and endpoints with dedicated amplifiers and speakers. An Ethernet audio signal is combined with a Power-Line Communications (PLC) signal for transmission from the source to the endpoints over a common conductor. A single master clock in the source synchronizes the Ethernet audio transmitter with the PLC transmitter. Each end-point has a PLC receiver…

METHOD AND APPARATUS FOR AUDIO PROCESSING USING A CONVOLUTIONAL NEURAL NETWORK ARCHITECTURE

Granted: December 14, 2023
Application Number: 20230401429
Systems, methods, and computer program products for audio processing based on convolutional neural network (CNN) are described. A first CNN architecture may comprise a contracting path of a U-net, a multi-scale CNN, and an expansive path of a U-net. The contracting path may comprise a first encoding layer and may be configured to generate an output representation of the contracting path. The multi-scale CNN may be configured to generate, based on the output representation of the…

PROJECTION SYSTEM AND METHOD WITH FOLD MIRROR AND INTEGRATING ROD ADJUSTMENT

Granted: December 7, 2023
Application Number: 20230393452
A projection system and calibration method therefore relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a fold mirror and an integrating rod, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a predetermined location as on-state light or to reflect the steered light as…

METHOD AND APPARATUS FOR PROCESSING OF AUDIO USING A NEURAL NETWORK

Granted: December 7, 2023
Application Number: 20230395086
Described herein is a method of processing an audio signal using a neural network or using a first and a second neural network. Described is further a method of training said neural network or of jointly training a set of said first and said second neural network. Moreover, described is a method of obtaining and transmitting a latent feature space representation of a perceptual domain audio signal using a neural network and a method of obtaining an audio signal from a latent feature…

GENERAL MEDIA NEURAL NETWORK PREDICTOR AND A GENERATIVE MODEL INCLUDING SUCH A PREDICTOR

Granted: December 7, 2023
Application Number: 20230394287
A neural network system for predicting frequency coefficients of a media signal, the neural network system comprising a time predicting portion including at least one neural network trained to predict a first set of output variables representing a specific frequency band of a current time frame given coefficients of one or several previous time frames, and a frequency predicting portion including a at least one neural network trained to predict a second set of output variables…

SYSTEM AND TOOLS FOR ENHANCED 3D AUDIO AUTHORING AND RENDERING

Granted: November 30, 2023
Application Number: 20230388738
Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular…

ASYMMETRICAL HIGH-FREQUENCY WAVEGUIDE, 3-AXIS RIGGING, AND SPHERICAL ENCLOSURE FOR SURROUND SPEAKERS

Granted: November 30, 2023
Application Number: 20230388702
Embodiments are described for a high-frequency waveguide that improves the performance of large-scale surround sound and immersive audio environments. A horn waveguide is configured to be asymmetric about one of a vertical axis and horizontal axis of the waveguide to form an asymmetric horn waveguide. A spherical enclosure surrounds the asymmetric horn waveguide to form a horn speaker, and a three-axis mounting system is configured to fix the horn speaker to one of a wall or ceiling…

TRIM-PASS CORRECTION FOR CLOUD-BASED CODING OF HDR VIDEO

Granted: November 30, 2023
Application Number: 20230388555
In a cloud-based system for encoding high dynamic range (HDR) video, each node receives a video segment and bumper frames. Each segment is subdivided into primary scenes and secondary scenes to derive scene-based forward reshaping functions that minimize the amount of reshaping-related metadata when coding the video segment. When a parent scene of a secondary scene is processed by two or more neighboring nodes, initial forward reshaping functions and trim-pass correction parameters are…

METHOD AND APPARTUS FOR AUDIO PROCESSING USING A NESTED CONVOLUTIONAL NEURAL NETWORK ARCHITECHTURE

Granted: November 30, 2023
Application Number: 20230386500
Systems, methods, and computer program products for audio processing based on convolutional neural network (CNN) are described. The CNN architecture may comprise a multi-scale input block and a multi-scale nested block. The multi-scale input block may be configured to receive input data and to generate a first downsampled input data set by downsampling the input data. The multi-scale nested block may comprise a first encoding layer configured to generate a first encoded data set by…

ADAPTIVE BLOCK SWITCHING WITH DEEP NEURAL NETWORKS

Granted: November 30, 2023
Application Number: 20230386486
The present invention relates to a method for predicting transform coefficients representing frequency content of an adaptive block length media signal, by receiving a frame and receiving block length information indicating a number of quantized transform coefficients for each block in the frame, the number of quantized transform coefficients being one of a first or second number, wherein the first number is greater than the second number, determining a first block has the second number…

BINAURAL RENDERING FOR HEADPHONES USING METADATA PROCESSING

Granted: November 30, 2023
Application Number: 20230385013
Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.

PROJECTION SYSTEM AND METHOD WITH ADJUSTABLE ANGLE ILLUMINATION USING LENS DECENTRATION

Granted: November 30, 2023
Application Number: 20230384656
A projection system and calibration method therefor relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a first lens group and a second lens group, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a predetermined location as on-state light or to reflect the steered light as…

COLOR TRANSFORMATION FOR HDR VIDEO WITH A CODING-EFFICIENCY CONSTRAINT

Granted: November 16, 2023
Application Number: 20230368344
Using a standard-based RGB to YCbCr color transform a new RGB to YCC 3×3 transformation matrix and a 3×1 offset vector are derived under a set of coding-efficiency constraints. The new RGB to YCC 3×3 transform comprises a luminance scaling factor and a 2×2 chroma sub-matrix that preserves the energy of the standard-based RGB to YCbCr transform while maintaining or improving coding efficiency. It also adds support for an authorization or watermarking mechanism in streaming video…

ADAPTIVE LOCAL RESHAPING FOR SDR-TO-HDR UP-CONVERSION

Granted: November 16, 2023
Application Number: 20230370646
A global index value is generated for selecting a global reshaping function for an input image of a relatively low dynamic range using luma codewords in the input image. Image filtering is applied to the input image to generate a filtered image. The filtered values of the filtered image provide a measure of local brightness levels in the input image. Local index values are generated for selecting specific local reshaping functions for the input image using the global index value and the…

DEEP-LEARNING BASED SPEECH ENHANCEMENT

Granted: November 16, 2023
Application Number: 20230368807
A system for suppressing noise and enhancing speech and a related method are disclosed. The system trains a neural network model that takes banded energies corresponding to an original noisy waveform and produces a speech value indicating the amount of speech present in each band at each frame. The neural model comprises a feature extraction block that implements some lookahead. The feature extraction block is followed by an encoder with steady down-sampling along the frequency domain…