SIGNAL LEVEL-INDEPENDENT SPEECH ENHANCEMENT
Granted: January 16, 2025
Application Number:
20250022479
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to speech enhancement techniques that are agnostic to varying signal levels in near-end audio signals. In some aspects, a speech enhancement system may include a delay estimator, an input normalizer, and an acoustic echo and noise (AEN) decoupling filter. The delay estimator receives a near-end audio signal via a microphone and a far-end audio signal…
OBJECT DETECTION NETWORKS FOR DISTANT OBJECT DETECTION IN MEMORY-CONSTRAINED DEVICES
Granted: January 2, 2025
Application Number:
20250005906
This disclosure provides methods, devices, and systems for object detection. The present implementations more specifically relate to techniques for improving distant object detection in memory-constrained computer vision systems. In some aspects, a computer vision system may include an ROI extraction component, a feature pyramid network (FPN) having a number (N) of pyramid levels, and N network heads associated with the N pyramid levels, respectively. The FPN extracts N feature maps from…
SYSTEM TO COLLECT TRAINING DATA FOR IMAGING UNDER DISPLAY
Granted: December 5, 2024
Application Number:
20240406580
This disclosure provides methods, devices, and systems for machine learning. The present implementations more specifically relate to automatons that can acquire input images and ground truth images for training neural network models. In some aspects, a system for acquiring training data may include a camera, an electronic display, and an apparatus configured to maintain the camera in a stationary position while moving the electronic display in and out of the camera's field-of-view (FOV).…
HIGH DYNAMIC RANGE (HDR) IMAGE PROCESSING WITH ADAPTIVE COLOR VOLUME MAPPING
Granted: November 28, 2024
Application Number:
20240394857
A method and apparatus for image processing. A data conversion and color-space mapping (DCM) circuit includes an inverse opto-electrical transfer function (IOETF), a color-space converter, and a color-space re-mapper. The IOETF receives image data for one or more frames acquired by an image capture device and transfers the image data from a non-linear domain to a linear domain. The color-space converter converts the linear image data from a first color space to a second color space,…
NETWORK-CAPABLE DOCKING STATION
Granted: November 28, 2024
Application Number:
20240393833
A method performed by a docking station operable in a plurality of modes is disclosed. The method may include obtaining first data via a first interface of the docking station and second data via a second interface of the docking station, responsive to operating in a first mode of the plurality of modes. The first interface may be configured to couple the docking station to a computing device, and the second interface may be configured to communicate with a network. The method may also…
INTERNET OF THINGS (IOT) SENSORS FOR AUTOMOBILES
Granted: November 21, 2024
Application Number:
20240386754
This disclosure provides methods, devices, and systems for achieving connected vehicle functionality. The present implementations more specifically relate to installation-free Internet of things (IoT) sensors for automobiles. In some aspects, an IoT device may include a housing having a power connector configured to interface with an auxiliary power outlet of an automobile and also may include one or more sensors, disposed within the housing, configured to detect changes to a surrounding…
THIRD PARTY APPLICATIONS FOR A NETWORK-CAPABLE DOCKING STATION
Granted: November 21, 2024
Application Number:
20240385971
A docking station operable in a plurality of modes is disclosed. An example method includes obtaining first data via a first interface of the docking station configured to couple the docking station to a computing device and second data via a second interface of the docking station configured to communicate with a network, responsive to operating in a first mode, obtaining third data via the second interface of the docking station, responsive to operating in a second mode, and…
NEURAL NOISE REDUCTION WITH LINEAR AND NONLINEAR FILTERING FOR SINGLE-CHANNEL AUDIO SIGNALS
Granted: November 7, 2024
Application Number:
20240371389
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to speech enhancement techniques that combine statistical signal processing with neural network inferencing. In some aspects, a speech enhancement system may include a linear filter, a deep neural network (DNN), and a nonlinear post-filter. The linear filter and the nonlinear post-filter are configured to suppress noise in audio signals using…
AUDIO SOURCE SEPARATION FOR MULTI-CHANNEL BEAMFORMING BASED ON PERSONAL VOICE ACTIVITY DETECTION (VAD)
Granted: November 7, 2024
Application Number:
20240371386
This disclosure provides methods, devices, and systems for speech enhancement. The present implementations more specifically relate to utilizing personal voice activity detectors (VADs) to suppress audio originating from a distractor audio source without distorting audio originating from a target audio source. In some aspects, a speech enhancement system may receive a multi-channel audio signal via a microphone array and may further generate, based on a neural network, an inference about…
AUDIO SOURCE SEPARATION FOR MULTI-CHANNEL BEAMFORMING BASED ON FACE DETECTION
Granted: October 24, 2024
Application Number:
20240355349
This disclosure provides methods, devices, and systems for speech enhancement. The present implementations more specifically relate to utilizing multiple modalities to suppress audio originating from a distractor audio source without distorting audio originating from a target audio source. In some aspects, a speech enhancement system may receive a multi-channel audio signal via a microphone array and may further receive an image associated with a respective frame of the audio signal. The…
SPEECH ENHANCEMENT SYSTEM
Granted: October 24, 2024
Application Number:
20240355347
A method of suppressing noise may include receiving a sequence of audio frames representing a multi-channel audio signal. The method may include determining a likelihood of speech in a first audio frame of the sequence of audio frames based on a Gaussian mixture model. Further, the method may include generating a first audio signal based on the likelihood of speech in the first audio frame and a second audio signal representing a first speech component of a second audio frame. The second…
CASCADE AUDIO SPOTTING SYSTEM
Granted: October 10, 2024
Application Number:
20240339124
Systems and methods for identifying audio events in one or more audio streams include the use of a cascade audio spotting system (such as a cascade keyword spotting system (KWS)) to reduce power consumption while maintaining a desired performance. An example cascade audio spotting system may include a first module and a high-power subsystem. The first module is to receive an audio stream from one or more audio streams, process the audio stream to detect a first target sound activity in…
AUDIO SOURCE CLASSIFICATION FOR HANDSFREE COMMUNICATIONS
Granted: September 19, 2024
Application Number:
20240312472
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to speech enhancement techniques that utilize multi-channel audio signals for audio source classification. In some aspects, a speech enhancement system may include an adaptive filter, a feature extractor, and a feature classifier. The adaptive filter is configured to receive a multi-channel audio signal, via at least a first microphone and a second…
ADAPTIVE PROXIMITY SENSING DELAY
Granted: September 19, 2024
Application Number:
20240310955
An input device includes a proximity sensing panel including sensor electrodes and a proximity sensing circuit. The proximity sensing circuit is configured to acquire, for a sensing frame, sensing measurements of a sensing region using the sensor electrodes, process, for the sensing frame, the sensing measurements to obtain positional information, transmit the positional information to a processing system, and receive, from the processing system, vertical synchronization (Vsync) signal…
LOW-LATENCY SPEECH ENHANCEMENT
Granted: September 12, 2024
Application Number:
20240304204
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to low-latency speech enhancement. In some aspects, a speech enhancement system may receive a number (B) of frames of a signal, where each of the B frames include a number (N) of time-domain samples. The speech enhancement system may transform the B*N time-domain samples into B*N first frequency-domain samples based on an N-point fast Fourier transform…
NETWORK-AGNOSTIC REGION OF INTEREST (ROI) INFERENCING
Granted: August 8, 2024
Application Number:
20240265665
This disclosure provides methods, devices, and systems for object detection. The present implementations more specifically relate to region of interest (ROI) inferencing techniques that can be implemented using a single object detection model. In some aspects, a computer vision system maps a set of grid cells to an input image so that each grid cell includes a respective portion of the image, and where each of the grid cells is assigned a respective priority value. The system selects an…
DATA PRE-PROCESSING FOR LOW-LIGHT IMAGES
Granted: August 1, 2024
Application Number:
20240257303
This disclosure provides methods, devices, and systems for low-light imaging. In some implementations, an image processor may be configured to reduce or remove noise associated with an image based, at least in part, on a neural network. For example, the neural network may be trained to infer a denoised representation of the image. In some aspects, the image processor may scale the brightness level of the image to fall within a normalized range of values associated with the neural…
VIDEO COMPRESSION BASED ON SPATIAL-TEMPORAL FEATURES
Granted: August 1, 2024
Application Number:
20240259575
This disclosure provides methods, devices, and systems for video compression. The present implementations more specifically relate to video compression techniques that account for spatial-temporal changes in pixel values. In some aspects, an encoder may determine a change importance factor (CIF) for each image tile of a current image to be encoded. The encoder may calculate the CIF for an image tile of the current image (the “current image tile”) based on a degree of variation among…
NEURAL TEMPORAL BEAMFORMER FOR NOISE REDUCTION IN SINGLE-CHANNEL AUDIO SIGNALS
Granted: August 1, 2024
Application Number:
20240257827
This disclosure provides methods, devices, and systems for audio signal processing. The present implementations more specifically relate to multi-frame beamforming using neural network supervision. In some aspects, a speech enhancement system may include a linear filter, a deep neural network (DNN), a voice activity detector (VAD), and an IFC calculator. The DNN infers a probability of speech (pDNN) in a current frame of a single-channel audio signal based on a neural network model. The…
SPATIO-TEMPORAL BEAMFORMER
Granted: August 1, 2024
Application Number:
20240257822
This disclosure provides methods, devices, and systems for signal processing. The present implementations relate more specifically to a spatio-temporal beamformer. In some aspects, a beamforming system may receive an audio signal via a plurality of microphones, the audio signal including a number (B) of frames for each of the plurality of microphones, each of the B frames for each of the plurality of microphones including a number (N) of time-domain samples. For a first microphone, the…