Dolby Laboratories Patent Applications

SOURCE SEPARATION AND REMIXING IN SIGNAL PROCESSING

Granted: February 6, 2025
Application Number: 20250046328
The present disclosure relates to a method and audio processing system (1) for performing source separation. The method comprises obtaining (S1) an audio signal (Sin) including a mixture of speech content and noise content, determining (S2a, S2b, S2c), from the audio signal, speech content (formula A), stationary noise content (formula C) and non-speech content (formula B). The stationary noise content (formula C) is a true subset of the non-speech content (formula B) and the method…

METHOD FOR NEURAL NETWORK TRAINING WITH MULTIPLE SUPERVISORS

Granted: February 6, 2025
Application Number: 20250045585
The present disclosure relates to a method for designing a processor (20) and a computer implemented neural network. The method comprises obtaining input data and corresponding ground truth target data and providing the input data to a processor (20) for outputting a first prediction of target data given the input data. The method further comprises providing the latent variables output by a processor module (21: 1, 21: 2, . . . 21: n?1) to a supervisor module (22: 1, 22: 2, 22: 3, . . .…

BINAURAL RENDERING FOR HEADPHONES USING METADATA PROCESSING

Granted: February 6, 2025
Application Number: 20250045010
Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.

DETERMINATION OF MOVEMENT DIRECTION

Granted: February 6, 2025
Application Number: 20250044100
Methods, systems, and media for determining user movement direction are provided. In some embodiments, a method involves obtaining, using a control system, user acceleration data associated with a user. The method involves determining, using the control system, a movement period associated with a movement activity of the user using the user acceleration data, wherein the movement period indicates a duration between two sequential movements by the user. The method involves determining,…

CONTENT-AWARE AUDIO LEVEL MANAGEMENT

Granted: January 30, 2025
Application Number: 20250038726
Described herein is a method of performing content-aware audio processing for an audio signal comprising a plurality of audio components of different types. The method includes source separating the audio signal into at least a voice-related audio component and a residual audio component. The method further includes determining a dynamic audio gain based on the voice-related audio component and the residual audio component. The method also includes performing audio level adjustment for…

CONTROL OF SPEECH PRESERVATION IN SPEECH ENHANCEMENT

Granted: January 30, 2025
Application Number: 20250037729
A method for performing denoising on audio signals is provided. In some implementations, the method involves determining an aggressiveness control parameter value that modulates a degree of speech preservation to be applied. In some implementations, the method involves obtaining a training set of training samples, a training sample having a noisy audio signal and a target denoising mask. In some implementations, the method involves training a machine learning model, wherein the trained…

SCREEN INTERACTION USING EOG COORDINATES

Granted: January 30, 2025
Application Number: 20250036195
A method comprising acquiring a set of voltage signals from a set of electrodes arranged in proximity to the ears of a user, based on the set of voltage signals, determining an EOG gaze vector in ego-centric coordinates, determining a head pose of the user in display coordinates, using a sensor device worn by the user, combining the EOG gaze vector and head pose to obtain a gaze vector in display coordinates, and determining a gaze point by calculating an intersection of the gaze vector…

AUDIO ENCODING AND DECODING USING PRESENTATION TRANSFORM PARAMETERS

Granted: January 16, 2025
Application Number: 20250022475
A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second…

SYSTEM AND TOOLS FOR ENHANCED 3D AUDIO AUTHORING AND RENDERING

Granted: January 16, 2025
Application Number: 20250024222
Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular…

SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Granted: January 16, 2025
Application Number: 20250024086
In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated…

ACOUSTIC ZONING WITH DISTRIBUTED MICROPHONES

Granted: January 16, 2025
Application Number: 20250022465
A method for estimating a user's location in an environment may involve receiving output signals from each microphone of a plurality of microphones in the environment. At least two microphones of the plurality of microphones may be included in separate devices at separate locations in the environment and the output signals may correspond to a current utterance of a user. The method may involve determining multiple current acoustic features from the output signals of each microphone and…

AUDIO ENHANCEMENT FOR MOBILE CAPTURE

Granted: January 2, 2025
Application Number: 20250008284
A system for real-time monitoring of user-generated audio content for audio anomaly and a related method are disclosed. In some embodiments, the system is programmed to receive, in real time, audio data generated by a first mobile device, such as a smartphone. The system is programed to determine, in real time, whether an audio anomaly has occurred from the audio data. The system is programmed to cause, in real time, a presentation of an alert to a second mobile device, which could be…

ESTIMATION OF AUDIO DEVICE AND SOUND SOURCE LOCATIONS

Granted: January 2, 2025
Application Number: 20250008262
Some disclosed methods involve receiving, by a control system, location control data from a sound source as the sound source emits sound in a plurality of sound source locations within an audio environment. Some such methods involve receiving, by the control system, direction of arrival data from each audio device of a plurality of audio devices in the audio environment. In some examples, each audio device of the plurality of audio devices includes a microphone array and the direction of…

PROJECTION SYSTEM AND METHOD WITH BLENDED COLOR GAMUT

Granted: January 2, 2025
Application Number: 20250008080
A projection system and method therefore related to a first projection device; a second projection device; at least one spatial modulator; and an electronic processor configured to: receive a two-dimensional video data, generate a first plurality of intensity values of a first color gamut and a second plurality of intensity values of a second color gamut, subtract a luminance threshold from a plurality of pixel values of the second color gamut to yield a plurality of positive pixel…

SYSTEMS AND METHODS TO GENERATE COPIES OF DATA FOR TRANSMISSION OVER MULTIPLE COMMUNICATION CHANNELS

Granted: January 2, 2025
Application Number: 20250007645
Systems and methods to transmit data over multiple communication channels in parallel with forward error correction. Original packets are evenly distributed to the channels as the initial systematically channel-encoded packets. Subsequent channel-encoded packets are configured to be linearly independent of their base sets of channel-encoded packets, where a base set for a subsequent channel-encoded packet includes those scheduled to be transmitted before the subsequent packet in the same…

AUDIO CONTENT GENERATION AND CLASSIFICATION

Granted: January 2, 2025
Application Number: 20250006208
Some disclosed methods involve receiving audio data of at least a first audio data type and a second audio data type, including audio signals and associated spatial data indicating intended perceived spatial positions for the audio signals, determining at least a first feature type from the audio data and applying a positional encoding process to the audio data, to produce encoded audio data. The encoded audio data may include representations of at least the spatial data and the first…

LEARNABLE HEURISTICS TO OPTIMIZE A MULTI-HYPOTHESIS FILTERING SYSTEM

Granted: January 2, 2025
Application Number: 20250006170
Some disclosed methods involve receiving microphone signals from a microphone system, including signals corresponding to one or more sounds detected by the microphone system. Some methods may involve determining, via a trained neural network, a filtering scheme for the microphone signals, the filtering scheme including one or more filtering processes. The trained neural network may be configured to implement one or more subband-domain adaptive filter management modules. Some methods may…

DATA STRUCTURE FOR MULTIMEDIA APPLICATONS

Granted: January 2, 2025
Application Number: 20250005068
Embodiments described herein provide a unified container format for delivering different multimedia applications. One embodiment provides a data structure utilized for implementing a plurality of multimedia applications. The data structure includes a first metadata level including low-level metadata used to perform operations associated with media data in a bitstream. The data structure includes a second metadata level including mid-level metadata used to apply operation metadata to…

LIGHT PROJECTION SYSTEM USING WHITE LIGHT ILLUMINATION

Granted: January 2, 2025
Application Number: 20250004357
Light projection systems using white light illumination. One embodiment provides a projection system using white light illumination. The projection system includes an illumination assembly configured to receive a white light input. A prism is configured to separate the white light input into color light inputs, redirect the color light inputs to respective modulators, and combine modulated color light inputs from the respective modulators into a white light output. An optical filter is…

FACE REGION DETECTION AND LOCAL RESHAPING ENHANCEMENT

Granted: December 26, 2024
Application Number: 20240428612
Methods and corresponding systems to process face regions are disclosed. The described methods include providing face bounding boxes and confidence levels for the faces, generating a histogram of the pixels and the faces, generating a probability of face, and generating a face probability map. A face contrast adjustment and a face saturation adjustment can be applied to the face probability map.