Google Patent Applications

SYSTEMS AND METHODS FOR WIRELESSLY PROVIDING AN AUDIO STREAM

Granted: May 2, 2024
Application Number: 20240143272
Features described herein pertain to systems and methods for wirelessly providing an audio stream. When audio that is to be output to an audio output device is associated with an application a set of parameters for modifying an established Connected Isochronous Stream (CIS) of a wireless link between an audio source and the audio output device can be determined and the CIS of the wireless link can be modified based on the set of parameters. The audio that is associated with the…

Camera Assembly with Concave-Shaped Front Face

Granted: May 2, 2024
Application Number: 20240147035
The various implementations described herein include a video camera assembly that includes: (1) a housing; (2) an image sensor positioned within the housing and having a field of view corresponding to a scene in the smart home environment; and (3) a concave-shaped front face positioned in front of the image sensor such that light from the scene passes through the front face prior to entering the image sensor; where the front face includes: (a) an inner section corresponding to the image…

METHODS, SYSTEMS, AND MEDIA FOR PRESENTING NOTIFICATIONS INDICATING RECOMMENDED CONTENT

Granted: May 2, 2024
Application Number: 20240146985
Methods, systems, and media for presenting notifications indicating recommended content are provided. A notification of recommended content can be provided. An indication that a user device has initiated a casting session with a display device can be received. A request for recommended content to be presented on the display device can be received. A media content item can be identified based on at least one media content item that has been previously selected by a user account associated…

Event Based Recording

Granted: May 2, 2024
Application Number: 20240146866
An electronic device comprises an image sensor, one or more processors, and memory storing instructions for receiving an event recording profile based on configuration data of the electronic device, the configuration data including a location type or a power type; receiving a plurality of images of a scene captured by the image sensor; detecting a trigger event based on one or more of the plurality of images of the scene; in response to detecting the trigger event, identifying an object…

FIREWALL INSIGHTS PROCESSING AND MACHINE LEARNING

Granted: May 2, 2024
Application Number: 20240146695
A computer-implemented method causes data processing hardware to perform operations for training a firewall utilization model. The operations include receiving firewall utilization data for firewall connection requests during a utilization period. The firewall utilization data includes hit counts for each sub-rule associated with at least one firewall rule. The operations also include generating training data based on the firewall utilization data. The training data includes unused…

END-TO-END SPEECH DIARIZATION VIA ITERATIVE SPEAKER EMBEDDING

Granted: May 2, 2024
Application Number: 20240144957
A method includes receiving an input audio signal corresponding to utterances spoken by multiple speakers. The method also includes encoding the input audio signal into a sequence of T temporal embeddings. During each of a plurality of iterations each corresponding to a respective speaker of the multiple speakers, the method includes selecting a respective speaker embedding for the respective speaker by determining a probability that the corresponding temporal embedding includes a…

EXPORTING MODULAR ENCODER FEATURES FOR STREAMING AND DELIBERATION ASR

Granted: May 2, 2024
Application Number: 20240144917
A method includes obtaining a base encoder from a pre-trained model, and receiving training data comprising a sequence of acoustic frames characterizing an utterance paired with a ground-truth transcription of the utterance. At each of a plurality of output steps, the method includes: generating, by the base encoder, a first encoded representation for a corresponding acoustic frame; generating, by an exporter network configured to receive a continuous sequence of first encoded…

Rejecting Biased Data Using A Machine Learning Model

Granted: May 2, 2024
Application Number: 20240144095
A method for rejecting biased data using a machine learning model includes receiving a cluster training data set including a known unbiased population of data and training a clustering model to segment the received cluster training data set into clusters based on data characteristics of the known unbiased population of data. Each cluster of the cluster training data set includes a cluster weight. The method also includes receiving a training data set for a machine learning model and…

Interpretable Tabular Data Learning Using Sequential Sparse Attention

Granted: May 2, 2024
Application Number: 20240144005
A method of interpreting tabular data includes receiving, at a deep tabular data learning network (TabNet) executing on data processing hardware, a set of features. For each of multiple sequential processing steps, the method also includes: selecting, using a sparse mask of the TabNet, a subset of relevant features of the set of features; processing using a feature transformer of the TabNet, the subset of relevant features to generate a decision step output and information for a next…

SCALABLE EXACTLY-ONCE DATA PROCESSING USING TRANSACTIONAL STREAMING WRITES

Granted: May 2, 2024
Application Number: 20240143469
A method for processing data exactly once using transactional stream writes includes receiving, from a client, a batch of data blocks for storage on memory hardware in communication with the data processing hardware. The batch of data blocks is associated with a corresponding sequence number and represents a number of rows of a table stored on the memory hardware. The method also includes partitioning the batch of data blocks into a plurality of sub-batches of data blocks. For each…

CONFIDENCE-BASED APPLICATION-SPECIFIC USER INTERACTIONS

Granted: April 25, 2024
Application Number: 20240134462
This application is directed to a method for controlling user experience (UX) operations on an electronic device that executes an application. A touchless UX operation associated with the application has an initiation condition including at least detection of a presence and a gesture in a required proximity range with a required confidence level. The electronic device then determines from a first sensor signal the proximity of the presence with respect to the electronic device. In…

Signal Adjustments in User Equipment-Coordination Set Joint Transmissions

Granted: April 25, 2024
Application Number: 20240137073
Techniques described herein describe aspects of signal adjustments in user equipment-coordination set, UECS, joint transmissions. A base station analyzes a first joint transmission from multiple user equipments, UEs, participating in a UECS, where the multiple UEs include a coordinating UE of the UECS and at least one non-coordinating UE participating in the UECS. The base station determines that the first joint transmission fails to meet a performance metric and directs the multiple UEs…

EVALUATION-BASED SPEAKER CHANGE DETECTION EVALUATION METRICS

Granted: April 25, 2024
Application Number: 20240135934
A method includes obtaining a multi-utterance training sample that includes audio data characterizing utterances spoken by two or more different speakers and obtaining ground-truth speaker change intervals indicating time intervals in the audio data where speaker changes among the two or more different speakers occur. The method also includes processing the audio data to generate a sequence of predicted speaker change tokens using a sequence transduction model. For each corresponding…

Universal Monolingual Output Layer for Multilingual Speech Recognition

Granted: April 25, 2024
Application Number: 20240135923
A method includes receiving a sequence of acoustic frames as input to a multilingual automated speech recognition (ASR) model configured to recognize speech in a plurality of different supported languages and generating, by an audio encoder of the multilingual ASR, a higher order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. The method also includes generating, by a language identification (LID) predictor of the multilingual ASR, a language…

Knowledge Distillation with Domain Mismatch For Speech Recognition

Granted: April 25, 2024
Application Number: 20240135918
A method includes receiving distillation data including a plurality of out-of-domain training utterances. For each particular out-of-domain training utterance of the distillation data, the method includes generating a corresponding augmented out-of-domain training utterance, and generating, using a teacher ASR model trained on training data corresponding to a target domain, a pseudo-label corresponding to the corresponding augmented out-of-domain training utterance. The method also…

RESIDUAL ADAPTERS FOR FEW-SHOT TEXT-TO-SPEECH SPEAKER ADAPTATION

Granted: April 25, 2024
Application Number: 20240135915
A method for residual adapters for few-shot text-to-speech speaker adaptation includes obtaining a text-to-speech (TTS) model configured to convert text into representations of synthetic speech, the TTS model pre-trained on an initial training data set. The method further includes augmenting the TTS model with a stack of residual adapters. The method includes receiving an adaption training data set including one or more spoken utterances spoken by a target speaker, each spoken utterance…

CONVERSATION-AWARE PROACTIVE NOTIFICATIONS FOR A VOICE INTERFACE DEVICE

Granted: April 25, 2024
Application Number: 20240135914
A method for proactive notifications in a voice interface device includes: receiving a first user voice request for an action with an future performance time; assigning the first user voice request to a voice assistant service for performance; subsequent to the receiving, receiving a second user voice request and in response to the second user voice request initiating a conversation with the user; and during the conversation: receiving a notification from the voice assistant service of…

METHOD FOR SPEECH-TO-SPEECH CONVERSION

Granted: April 25, 2024
Application Number: 20240135117
The present disclosure relates to a streaming speech-to-speech conversion model, where an encoder runs in real time while a user is speaking, then after the speaking stops, a decoder generates output audio in real time. A streaming-based approach produces an acceptable delay with minimal loss in conversion quality when compared to other non-streaming server-based models. A hybrid model approach for combines look-ahead in the encoder and a non-causal stacker with non-causal…

Using Memory Protection Data

Granted: April 25, 2024
Application Number: 20240135042
The present disclosure describes techniques and apparatuses that are directed to using memory protection data within a computing device. Techniques include allocating regions of a memory for storing application data and protection data. Techniques also include creating a bitmap having bit values corresponding to memory blocks within the allocated regions. The one or more bit values can be indicative of whether application data and/or protection data are present in a memory block. The…

IDENTIFY MALICIOUS SOFTWARE

Granted: April 25, 2024
Application Number: 20240134980
A method for identifying malicious software includes receiving and executing a software application, identifying a plurality of uniform resource identifiers the software application interacts with during execution of the software application, and generating a vector representation for the software application using a feed-forward neural network configured to receive the plurality of uniform resource identifiers as feature inputs. The method also includes determining similarity scores for…