Source color volume information messaging
Granted: October 1, 2024
Patent Number:
12108086
Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and…
Frame-rate scalable video coding
Granted: October 1, 2024
Patent Number:
12108061
Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific…
Frame-rate scalable video coding
Granted: October 1, 2024
Patent Number:
12108060
Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific…
Bass enhancement for loudspeakers
Granted: September 24, 2024
Patent Number:
12101613
A method of audio processing includes generating harmonics in a hybrid complex quadrature mirror filter domain. Generating the harmonics may include multiplication, using a feedback delay loop, and dynamic compression. The harmonics may be generated based on one or more hybrid sub-bands of the complex transform domain signal.
Supporting view direction based random access of bitstream
Granted: September 24, 2024
Patent Number:
12101489
A non-random-access video stream is received. A first image block is encoded after second image blocks according to a non-random-access processing order. View direction data is received to indicate a viewer's view direction coinciding with a location covered by the first image block. The first image block is encoded into the random-access video stream before the second image blocks in a random-access processing order. The random-access video stream is delivered to a recipient decoding…
Video content creation tool and method
Granted: September 24, 2024
Patent Number:
12100426
A content-creation tool includes a processor and a memory. The processor is configured to receive a first video clip and a second video clip, a respective first and second metadata-item thereof being set to a respective first and second metadata-value. The memory stores video-editing software that includes a timeline interface and instructions that, when executed by the processor, control the processor to: add the first video clip to the timeline interface as a first timeline-track that…
Speaker
Granted: September 24, 2024
Patent Number:
D1043610
Coding and decoding of interleaved image data
Granted: September 17, 2024
Patent Number:
12096029
Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the…
Personalized HRTFs via optical capture
Granted: September 17, 2024
Patent Number:
12096200
An apparatus and method of generating personalized HRTFs. The system is prepared by calculating a model for HRTFs described as the relationship between a finite example set of input data, namely anthropometric measures and demographic information for a set of individuals, and a corresponding set of output data, namely HRTFs numerically simulated using a high-resolution database of 3D scans of the same set of individuals. At the time of use, the system queries the user for their…
Source color volume information messaging
Granted: September 17, 2024
Patent Number:
12096038
Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and…
Systems, methods and apparatus for conversion from channel-based audio to object-based audio
Granted: September 17, 2024
Patent Number:
12094476
Embodiments are disclosed for channel-based audio (CBA) (e.g., 22.2-ch audio) to object-based audio (OBA) conversion. The conversion includes converting CBA metadata to object audio metadata (OAMD) and reordering the CBA channels based on channel shuffle information derived in accordance with channel ordering constraints of the OAMD. The OBA with reordered channels is rendered in a playback device using the OAMD or in a source device, such as a set-top box or audio/video recorder. In an…
Generating binaural audio in response to multi-channel audio using at least one feedback delay network
Granted: September 10, 2024
Patent Number:
12089033
In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a…
Processing of microphone signals for spatial playback
Granted: September 10, 2024
Patent Number:
12089015
Disclosed are methods and systems which convert a multi-microphone input signal to a multichannel output signal making use of a time- and frequency-varying matrix. For each time and frequency tile, the matrix is derived as a function of a dominant direction of arrival and a steering strength parameter. Likewise, the dominant direction and steering strength parameter are derived from characteristics of the multi-microphone signals, where those characteristics include values representative…
Method and device for encoding and decoding intra-frame prediction
Granted: September 10, 2024
Patent Number:
12088788
A method and a device for encoding and decoding infra prediction are disclosed. An image decoding method for performing intra prediction comprises the steps of: receiving a bitstream including data on prediction modes of a current block and a block adjacent to the current block; extracting the data from the received bitstream so as to confirm the prediction mode of the adjacent block; determining whether a boundary pixel within the adjacent block can be used as a reference pixel for the…
Method and apparatus for encoding and decoding an HOA representation
Granted: September 10, 2024
Patent Number:
12087311
The present invention relates to methods and apparatus for encoding an HOA signal representation (c(t)) of a sound field having an order of N and a number O=(N+1)2 of coefficient sequences to a mezzanine HOA signal representation (wMEZZ(t)). The present invention further relates to methods and apparatus for decoding a reconstructed HOA signal representation from the mezzanine HOA signal representation.
Machine learning based dynamic composing in enhanced standard dynamic range video (SDR+)
Granted: September 10, 2024
Patent Number:
12086969
Training image pairs comprising training SDR image and corresponding training HDR images are received. Each training image pair in the training image pairs comprises a training SDR image and a corresponding training HDR image. The training SDR image and the corresponding training HDR image in the training image pair depict same visual content but with different luminance dynamic ranges. Training image feature vectors are extracted from training SDR images in the training image pairs. The…
Transmission-agnostic presentation-based program loudness
Granted: September 3, 2024
Patent Number:
12080308
This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.
Coding and decoding of interleaved image data
Granted: September 3, 2024
Patent Number:
12081797
Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the…
Scalable systems for controlling color management comprising varying levels of metadata
Granted: September 3, 2024
Patent Number:
12081778
Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.
Pre-conditioning audio for echo cancellation in machine perception
Granted: September 3, 2024
Patent Number:
12080317
An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.