Pre-conditioning audio for echo cancellation in machine perception
Granted: September 3, 2024
Patent Number:
12080317
An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.
Transmission-agnostic presentation-based program loudness
Granted: September 3, 2024
Patent Number:
12080308
This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.
Source color volume information messaging
Granted: August 27, 2024
Patent Number:
12075098
Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and…
Projection system and method with blended color gamut
Granted: August 27, 2024
Patent Number:
12075021
A projection system and method therefore related to a first projection device; a second projection device; at least one spatial modulator; and an electronic processor configured to: receive a two-dimensional video data, generate a first plurality of intensity values of a first color gamut and a second plurality of intensity values of a second color gamut, subtract a luminance threshold from a plurality of pixel values of the second color gamut to yield a plurality of positive pixel…
Download control in multi-server communication system
Granted: August 27, 2024
Patent Number:
12074939
Apparatuses and methods for data traffic management in multi-source content delivery are described. The apparatus includes a downloader and a controller. The downloader is coupled to servers via communication links. The controller is configured to determine initial download requests for the servers based on predetermined information about a quality of the links. The controller is also configured to send the initial download requests to the servers with the downloader. The controller is…
Method and apparatus for speech source separation based on a convolutional neural network
Granted: August 27, 2024
Patent Number:
12073828
Described herein is a method for Convolutional Neural Network (CNN) based speech source separation, wherein the method includes the steps of: (a) providing multiple frames of a time-frequency transform of an original noisy speech signal; (b) inputting the time-frequency transform of said multiple frames into an aggregated multi-scale CNN having a plurality of parallel convolution paths; (c) extracting and outputting, by each parallel convolution path, features from the input…
Projection system and method with adjustable angle illumination
Granted: August 27, 2024
Patent Number:
12072614
A projection system and calibration method therefor relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a first mirror and a second mirror, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a filter as on-state light or to reflect the steered light as off-state light to a light…
Methods, apparatus and systems for decompressing a Higher Order Ambisonics (HOA) signal
Granted: August 20, 2024
Patent Number:
12069465
A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k?1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k?1)). The ambient HOA component ({tilde over (C)}AMB(k?1)) comprises, in a layered mode,…
Presentation independent mastering of audio content
Granted: August 20, 2024
Patent Number:
12069464
A method for generating mastered audio content, the method comprising obtaining an input audio content comprising a number, M1, of audio signals, obtaining rendered presentation of the input audio content, the rendered presentation comprising a number, M2, of audio signals, obtaining a mastered presentation generated by mastering the rendered presentation, comparing the mastered presentation with the rendered presentation to determine one or more indications of differences between the…
Controlling a jitter buffer
Granted: August 13, 2024
Patent Number:
12063162
Apparatus and methods for controlling a jitter buffer are described. In one embodiment, the apparatus for controlling a jitter buffer includes an inter-talkspurt delay jitter estimator for estimating an offset value of the delay of a first frame in the current talkspurt with respect to the delay of a latest anchor frame in a previous talkspurt, and a jitter buffer controller for adjusting a length of the jitter buffer based on a long term length of the jitter buffer for each frame and…
Binaural rendering for headphones using metadata processing
Granted: August 13, 2024
Patent Number:
12061835
Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.
Directed interpolation and data post-processing
Granted: August 6, 2024
Patent Number:
12058372
An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on…
Directed interpolation and data post-processing
Granted: August 6, 2024
Patent Number:
12058371
An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on…
Projector controller and associated method
Granted: July 30, 2024
Patent Number:
12052531
A projector controller includes an object detector and control electronics, and is configured to protect audience members from intense light imposing an exclusion zone in front of a projector. The object detector is configured to optically sense a presence of an object in a detection region beneath the exclusion zone and above the audience members. The control electronics is configured to control the projector when the object detector indicates the presence of the object in the detection…
Audio de-esser independent of absolute signal level
Granted: July 30, 2024
Patent Number:
12051435
Methods, systems, and computer program products of automatic de-essing are disclosed. An automatic de-esser can be used without manually setting parameters and can perform reliable sibilance detection and reduction regardless of absolute signal level, singer gender and other extraneous factors. An audio processing device divides input audio signals into buffers each containing a number of samples, the buffers overlapping one another. The audio processing device transforms each buffer…
Content-aware PQ range analyzer and tone mapping in live feeds
Granted: July 30, 2024
Patent Number:
12050830
In image processing system comprises an input configured to receive an image signal, the image signal including a plurality of frames of image data; and a processor configured to automatically determine an image classification based on at least one frame of the plurality of frames, and dynamically generate a mapping metadata based on the image classification. The processor includes determination circuitry configured to determine a content type for the image signal; segmentation circuitry…
Methods and devices for controlling audio parameters
Granted: July 23, 2024
Patent Number:
12045539
A method of controlling headphones having external microphone signal pass-through functionality may involve controlling a display to present a geometric shape on the display and receiving an indication of digit motion from a sensor system associated with the display. The sensor system may include a touch sensor system or a gesture sensor system. The indication may be an indication of a direction of digit motion relative to the display. The method may involve controlling the display to…
Adaptive image data linearization for HDR image sensors
Granted: July 23, 2024
Patent Number:
12047686
A high-dynamic-range (HDR) camera module with adaptive image data linearization includes (i) an HDR image sensor configured to generate tone-compressed HDR images as respective frames that include active pixel data and metadata, (ii) a processor outside the HDR image sensor, and (iii) a memory outside the HDR image sensor and storing machine-readable instructions that, when executed by the processor, control the processor to: (a) extract, from a frame of a first tone-compressed HDR…
Intra prediction mode mapping method and device using the method
Granted: July 23, 2024
Patent Number:
12047564
The present invention relates to an intra prediction mode mapping method and a device using the method. The intra prediction mode includes: decoding flag information providing information regarding whether an intra prediction mode of a plurality of candidate intra prediction modes for the current block is the same as the intra prediction mode for the current block, and decoding a syntax component including information regarding the intra prediction mode for the current block in order to…
Selective forward error correction for spatial audio codecs
Granted: July 23, 2024
Patent Number:
12046247
Systems and methods for providing forward error correction for a multi-channel audio signal are described. Blocks of an audio stream are buffered into a frame. A transformation can be applied that compacts the energy of each block into a plurality of transformed channels. The energy compaction transform may compact the most energy of a block into the first transformed channel and to compact decreasing amounts of energy into each subsequent transformed channel. The transformed frame may…