ADJUSTING AUDIO AND NON-AUDIO FEATURES BASED ON NOISE METRICS AND SPEECH INTELLIGIBILITY METRICS
Granted: January 12, 2023
Application Number:
20230010466
Some implementations involve determining a noise metric and/or a speech intelligibility metric and determining a compensation process corresponding to the noise metric and/or the speech intelligibility metric. The compensation process may involve altering a processing of audio data and/or applying a non-audio-based compensation method. In some examples, altering the processing of the audio data does not involve applying a broadband gain increase to the audio signals. Some examples…
ADJUSTING AUDIO AND NON-AUDIO FEATURES BASED ON NOISE METRICS AND SPEECH INTELLIGIBILITY METRICS
Granted: January 12, 2023
Application Number:
20230010466
Some implementations involve determining a noise metric and/or a speech intelligibility metric and determining a compensation process corresponding to the noise metric and/or the speech intelligibility metric. The compensation process may involve altering a processing of audio data and/or applying a non-audio-based compensation method. In some examples, altering the processing of the audio data does not involve applying a broadband gain increase to the audio signals. Some examples…
ADJUSTING AUDIO AND NON-AUDIO FEATURES BASED ON NOISE METRICS AND SPEECH INTELLIGIBILITY METRICS
Granted: January 12, 2023
Application Number:
20230009878
Some implementations involve determining a noise metric and/or a speech intelligibility metric and determining a compensation process corresponding to the noise metric and/or the speech intelligibility metric. The compensation process may involve altering a processing of audio data and/or applying a non-audio-based compensation method. In some examples, altering the processing of the audio data does not involve applying a broadband gain increase to the audio signals. Some examples…
PROJECTION SYSTEM AND METHOD WITH MODULAR PROJECTION LENS
Granted: January 12, 2023
Application Number:
20230008842
A projection lens system and method therefor relate to a Fourier lens assembly including a first attachment section, the Fourier lens assembly configured to form a Fourier transform of an object at an exit pupil of the Fourier lens assembly; an aperture configured to block a portion of 5 incident light, the aperture located approximately at a plane of the Fourier transform; and a zoom lens assembly including a second attachment section configured to be removably attached to the first…
AUDIO CHANNEL SPATIAL TRANSLATION
Granted: January 5, 2023
Application Number:
20230007419
The present invention is directed to methods and apparatus for translating a first plurality of audio input channels to a second plurality of audio output channels. This includes determining that there is pair-wise coding among any of the first plurality of audio input channels, determining an input/output-mapping matrix for mapping at least a first set of the first plurality of audio input channels to at least a second set of the second plurality of audio output channels; and deriving…
ELECTRO-ACOUSTIC TRANSDUCER
Granted: January 5, 2023
Application Number:
20230007402
An electro-acoustic transducer, comprising a supporting frame, a magnet assembly with an annular yoke surrounding a magnet a diaphragm attached to the front edge of the supporting frame, a voice coil suspended by the diaphragm in a gap formed between the magnet and the annular yoke, the voice coil being axially movable with respect to the magnet, and an annular damper arranged to stabilize the diaphragm. The transducer further comprises a damper holder having a substantially flat annular…
DIRECTED INTERPOLATION AND DATA POST-PROCESSING
Granted: January 5, 2023
Application Number:
20230007302
An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on…
CASCADE PREDICTION
Granted: January 5, 2023
Application Number:
20230007294
A first predictor is applied to an input image to generate first-stage predicted codewords approximating prediction target codewords of a prediction target image. Second-stage prediction target values are created by performing an inverse cascade operation on the prediction target codewords and the first-stage predicted codewords. A second predictor is applied to the input image to generate second-stage predicted values approximating the second-stage prediction target values. Multiple…
USER-GUIDED IMAGE SEGMENTATION METHODS AND PRODUCTS
Granted: January 5, 2023
Application Number:
20230005243
A method for image segmentation includes (a) clustering, based upon k-means clustering, pixels of an image into first clusters, (b) outputting a cluster map of the first clusters (c) re-clustering the pixels into a new plurality of non-disjoint pixel-clusters, and (d) classifying the non-disjoint pixel-clusters in categories, according to a user-indicated classification. Another method for image segmentation includes (a) forming a graph with each node of the graph corresponding to a…
METHOD FOR AND APPARATUS FOR DECODING/RENDERING AN AMBISONICS AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK USING 2D SETUPS
Granted: December 29, 2022
Application Number:
20220417690
Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor g = 1 L . The first matrix is determined…
A PSYCHOACOUSTIC MODEL FOR AUDIO PROCESSING
Granted: December 29, 2022
Application Number:
20220415334
The present disclosure relates to the field of audio coding, in particular, it relates to a method for encoding audio signals through a masking model based on a hearing threshold of frequency intervals of the audio signal and a measured energy of the audio signal for the corresponding frequency intervals. The disclosure further relates to an encoder that is capable of carrying out the audio encoding method.
PERCEPTUAL LUMINANCE NONLINEARITY-BASED IMAGE DATA EXCHANGE ACROSS DIFFERENT DISPLAY CAPABILITIES
Granted: December 29, 2022
Application Number:
20220415283
A handheld imaging device has a data receiver that is configured to receive reference encoded image data. The data includes reference code values, which are encoded by an external coding system. The reference code values represent reference gray levels, which are being selected using a reference grayscale display function that is based on perceptual non-linearity of human vision adapted at different light levels to spatial frequencies. The imaging device also has a data converter that is…
METHOD FOR AND APPARATUS FOR DECODING/RENDERING AN AMBISONICS AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK USING 2D SETUPS
Granted: December 22, 2022
Application Number:
20220408209
Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor g = 1 L . The first matrix is determined…
TENSOR-PRODUCT B-SPLINE PREDICTOR
Granted: December 22, 2022
Application Number:
20220408081
A set of tensor-product B-Spline (TPB) basis functions is determined. A set of selected TPB prediction parameters to be used with the set of TPB basis functions for generating predicted image data in mapped images from source image data in source images of a source color grade is generated. The set of selected TPB prediction parameters is generated by minimizing differences between the predicted image data in the mapped images and reference image data in reference images of a reference…
CONTENT AND ENVIRONMENTALLY AWARE ENVIRONMENTAL NOISE COMPENSATION
Granted: December 22, 2022
Application Number:
20220406326
Some implementations involve receiving a content stream that includes audio data, determining a content type corresponding to the content stream and determining, based at least in part on the Receiving, by a control system and via an interface system, a content stream that includes audio data content type, a noise compensation method. Some examples involve performing the noise compensation method on the audio data to produce noise-compensated audio data, rendering the noise-compensated…
DEEP SOURCE SEPARATION ARCHITECTURE
Granted: December 22, 2022
Application Number:
20220406323
A speech separation server comprises a deep-learning encoder with nonlinear activation. The encoder is programmed to take a mixture audio waveform in the time domain, learn generalized patterns from the mixture audio waveform, and generate an encoded representation that effectively characterizes the mixture audio waveform for speech separation.
BITRATE DISTRIBUTION IN IMMERSIVE VOICE AND AUDIO SERVICES
Granted: December 22, 2022
Application Number:
20220406318
Embodiments are disclosed for bitrate distribution in immersive voice and audio services. In an embodiment, a method of encoding an IVAS bitstream comprises: receiving an input audio signal; downmixing the input audio signal into one or more downmix channels and spatial metadata; reading a set of one or more bitrates for the downmix channels and a set of quantization levels for the spatial metadata from a bitrate distribution control table; determining a combination of the one or more…
IMAGE QUALITY METRIC FOR HDR IMAGES AND VIDEO
Granted: December 15, 2022
Application Number:
20220398710
Methods and systems for generating an image quality metric are described. A reference and a test image are first converted to the ITP color space. After calculating difference images ?I, ?T, and ?P, using the color channels of the two images, the difference images are convolved with low pass filters, one for the I channel and one for the chroma channels (I or P). The image quality metric is computed as a function of the sum of squares of filtered ?I, ?T, and ?P values. The chroma…
ACOUSTIC TRANSDUCER HAVING DROP RING CONNECTED AT RESONANT NODE
Granted: December 15, 2022
Application Number:
20220400347
An acoustic transducer that includes a housing, a diaphragm, a spider, a motor, and a drop ring. The motor includes a backplate, a frontplate, a magnet, and a voice coil. The drop ring connects the diaphragm to the spider at a circumference of the spider. The drop ring extends parallel with respect to a central axis of the housing. The circumference of the spider is spaced away from the motor and connects to the diaphragm at a resonant node of the diaphragm.
AUDIO DECODER AND DECODING METHOD
Granted: December 15, 2022
Application Number:
20220399027
A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency…