NEURAL NOISE REDUCTION WITH LINEAR AND NONLINEAR FILTERING FOR SINGLE-CHANNEL AUDIO SIGNALS
Granted: November 7, 2024
Application Number:
20240371389
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to speech enhancement techniques that combine statistical signal processing with neural network inferencing. In some aspects, a speech enhancement system may include a linear filter, a deep neural network (DNN), and a nonlinear post-filter. The linear filter and the nonlinear post-filter are configured to suppress noise in audio signals using…
AUDIO SOURCE SEPARATION FOR MULTI-CHANNEL BEAMFORMING BASED ON PERSONAL VOICE ACTIVITY DETECTION (VAD)
Granted: November 7, 2024
Application Number:
20240371386
This disclosure provides methods, devices, and systems for speech enhancement. The present implementations more specifically relate to utilizing personal voice activity detectors (VADs) to suppress audio originating from a distractor audio source without distorting audio originating from a target audio source. In some aspects, a speech enhancement system may receive a multi-channel audio signal via a microphone array and may further generate, based on a neural network, an inference about…
AUDIO SOURCE SEPARATION FOR MULTI-CHANNEL BEAMFORMING BASED ON FACE DETECTION
Granted: October 24, 2024
Application Number:
20240355349
This disclosure provides methods, devices, and systems for speech enhancement. The present implementations more specifically relate to utilizing multiple modalities to suppress audio originating from a distractor audio source without distorting audio originating from a target audio source. In some aspects, a speech enhancement system may receive a multi-channel audio signal via a microphone array and may further receive an image associated with a respective frame of the audio signal. The…
SPEECH ENHANCEMENT SYSTEM
Granted: October 24, 2024
Application Number:
20240355347
A method of suppressing noise may include receiving a sequence of audio frames representing a multi-channel audio signal. The method may include determining a likelihood of speech in a first audio frame of the sequence of audio frames based on a Gaussian mixture model. Further, the method may include generating a first audio signal based on the likelihood of speech in the first audio frame and a second audio signal representing a first speech component of a second audio frame. The second…
CASCADE AUDIO SPOTTING SYSTEM
Granted: October 10, 2024
Application Number:
20240339124
Systems and methods for identifying audio events in one or more audio streams include the use of a cascade audio spotting system (such as a cascade keyword spotting system (KWS)) to reduce power consumption while maintaining a desired performance. An example cascade audio spotting system may include a first module and a high-power subsystem. The first module is to receive an audio stream from one or more audio streams, process the audio stream to detect a first target sound activity in…
AUDIO SOURCE CLASSIFICATION FOR HANDSFREE COMMUNICATIONS
Granted: September 19, 2024
Application Number:
20240312472
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to speech enhancement techniques that utilize multi-channel audio signals for audio source classification. In some aspects, a speech enhancement system may include an adaptive filter, a feature extractor, and a feature classifier. The adaptive filter is configured to receive a multi-channel audio signal, via at least a first microphone and a second…
ADAPTIVE PROXIMITY SENSING DELAY
Granted: September 19, 2024
Application Number:
20240310955
An input device includes a proximity sensing panel including sensor electrodes and a proximity sensing circuit. The proximity sensing circuit is configured to acquire, for a sensing frame, sensing measurements of a sensing region using the sensor electrodes, process, for the sensing frame, the sensing measurements to obtain positional information, transmit the positional information to a processing system, and receive, from the processing system, vertical synchronization (Vsync) signal…
LOW-LATENCY SPEECH ENHANCEMENT
Granted: September 12, 2024
Application Number:
20240304204
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to low-latency speech enhancement. In some aspects, a speech enhancement system may receive a number (B) of frames of a signal, where each of the B frames include a number (N) of time-domain samples. The speech enhancement system may transform the B*N time-domain samples into B*N first frequency-domain samples based on an N-point fast Fourier transform…
NETWORK-AGNOSTIC REGION OF INTEREST (ROI) INFERENCING
Granted: August 8, 2024
Application Number:
20240265665
This disclosure provides methods, devices, and systems for object detection. The present implementations more specifically relate to region of interest (ROI) inferencing techniques that can be implemented using a single object detection model. In some aspects, a computer vision system maps a set of grid cells to an input image so that each grid cell includes a respective portion of the image, and where each of the grid cells is assigned a respective priority value. The system selects an…
VIDEO COMPRESSION BASED ON SPATIAL-TEMPORAL FEATURES
Granted: August 1, 2024
Application Number:
20240259576
This disclosure provides methods, devices, and systems for video compression. The present implementations more specifically relate to video compression techniques that account for spatial-temporal changes in pixel values. In some aspects, an encoder may determine a change importance factor (CIF) for each image tile of a current image to be encoded. The encoder may calculate the CIF for an image tile of the current image (the “current image tile”) based on a degree of variation among…
VIDEO COMPRESSION BASED ON SPATIAL-TEMPORAL FEATURES
Granted: August 1, 2024
Application Number:
20240259575
This disclosure provides methods, devices, and systems for video compression. The present implementations more specifically relate to video compression techniques that account for spatial-temporal changes in pixel values. In some aspects, an encoder may determine a change importance factor (CIF) for each image tile of a current image to be encoded. The encoder may calculate the CIF for an image tile of the current image (the “current image tile”) based on a degree of variation among…
IMAGE PROCESSING SYSTEM FOR REGION-OF-INTEREST-BASED VIDEO COMPRESSION
Granted: August 1, 2024
Application Number:
20240259571
An apparatus for remote processing of raw image data receives the raw image data from a camera, such as a security camera. The apparatus includes a detection module to detect portions of the image data that contain possible regions of interest. Information indicating the portions that contain the possible regions of interest is then used during a compression process so that the portions that contain the possible regions of interest are compressed using one or more compression algorithms…
NEURAL TEMPORAL BEAMFORMER FOR NOISE REDUCTION IN SINGLE-CHANNEL AUDIO SIGNALS
Granted: August 1, 2024
Application Number:
20240257827
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to multi-frame beamforming using neural network supervision. In some aspects, a speech enhancement system may include a linear filter, a deep neural network (DNN), a voice activity detector (VAD), and an IFC calculator. The DNN infers a probability of speech (pDNN) in a current frame of a single-channel audio signal based on a neural network model. The…
SPATIO-TEMPORAL BEAMFORMER
Granted: August 1, 2024
Application Number:
20240257822
This disclosure provides methods, devices, and systems for signal processing. The present implementations relate more specifically to a spatio-temporal beamformer. In some aspects, a beamforming system may receive an audio signal via a plurality of microphones, the audio signal including a number (B) of frames for each of the plurality of microphones, each of the B frames for each of the plurality of microphones including a number (N) of time-domain samples. For a first microphone, the…
DATA PRE-PROCESSING FOR LOW-LIGHT IMAGES
Granted: August 1, 2024
Application Number:
20240257303
This disclosure provides methods, devices, and systems for low-light imaging. In some implementations, an image processor may be configured to reduce or remove noise associated with an image based, at least in part, on a neural network. For example, the neural network may be trained to infer a denoised representation of the image. In some aspects, the image processor may scale the brightness level of the image to fall within a normalized range of values associated with the neural…
HYBRID INTER-FRAME CODING USING AN AUTOREGRESSIVE MODEL
Granted: July 18, 2024
Application Number:
20240244237
This disclosure provides methods, devices, and systems for video coding. The present implementations more specifically relate to hybrid coding techniques that combine aspects of inter-frame coding with aspects of intra-frame coding. In some aspects, a video encoder may perform inter-frame coding in a weighted manner so that the coded video frames (also referred to as “residual frames”) may include contributions from the current video frame to be transmitted over a communication…
MULTI-PASS NEURAL NETWORK FOR SPEECH ENHANCEMENT
Granted: July 18, 2024
Application Number:
20240242726
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to multi-pass neural networks configured for speech enhancement. In some aspects, a speech enhancement system may include a deep neural network (DNN) and a statistical signal processor (SSP). The DNN is configured to receive an input audio signal and infer a speech signal representing a speech component of the input audio signal based on a neural…
NEURAL NETWORK CACHING FOR VIDEO
Granted: June 27, 2024
Application Number:
20240212333
This disclosure provides methods, devices, and systems for machine learning. The present implementations more specifically relate to techniques for reducing the computational load of a convolutional neural network (CNN) when processing successive video frames. In some aspects, a machine learning system may cache or store the outputs (also referred to as “activations”) produced by one or more layers of a CNN so that one or more cached activations can be substituted for respective…
BACKLIGHT CONTROL FOR DISPLAY DEVICES
Granted: June 20, 2024
Application Number:
20240203368
A display driver includes a display panel, a backlight array, and backlight control circuitry. The backlight array includes a plurality of light sources and is configured to illuminate the display panel. The backlight control circuitry is configured to determine first base luminance for a first light source of the plurality of light sources based at least in part on pixel data for a first set of pixels associated with the first light source. The backlight control circuitry is further…
IMAGE DATA RECEPTION VIA NON-VIDEO INTERFACE
Granted: June 6, 2024
Application Number:
20240184737
This disclosure provides methods, devices, and systems for transmitting and receiving image data. The present implementations more specifically relate to repurposing non-video interfaces to receive image data. In some aspects, an image processing device may be coupled to an image source via an audio interface. For example, the audio interface may be an inter-IC sound (I2S) serial bus interface having at least a serial data input and a word select (WS) input. In some implementations, the…