Google Patent Applications

ATTENTIVE SCORING FUNCTION FOR SPEAKER IDENTIFICATION

Granted: January 25, 2024
Application Number: 20240029742
A speaker verification method includes receiving audio data corresponding to an utterance, processing the audio data to generate a reference attentive d-vector representing voice characteristics of the utterance, the evaluation ad-vector includes ne style classes each including a respective value vector concatenated with a corresponding routing vector. The method also includes generating using a self-attention mechanism, at least one multi-condition attention score that indicates a…

Voice Query QoS based on Client-Computed Content Metadata

Granted: January 25, 2024
Application Number: 20240029740
A method includes receiving an automated speech recognition (ASR) request from a user device that includes a speech input captured by the user device and content metadata associated with the speech input. The content metadata is generated by the user device. The method also includes determining a priority score for the ASR request based on the content metadata associated with the speech input and caching the ASR request in a pre-processing backlog of pending ASR requests each having a…

DEVICES AND METHODS FOR A SPEECH-BASED USER INTERFACE

Granted: January 25, 2024
Application Number: 20240029706
A device may identify a plurality of sources for outputs that the device is configured to provide. The plurality of sources may include at least one of a particular application in the device, an operating system of the device, a particular area within a display of the device, or a particular graphical user interface object. The device may also assign a set of distinct voices to respective sources of the plurality of sources. The device may also receive a request for speech output. The…

Using Aligned Text and Speech Representations to Train Automatic Speech Recognition Models without Transcribed Speech Data

Granted: January 25, 2024
Application Number: 20240029715
A method includes receiving training data that includes unspoken textual utterances in a target language. Each unspoken textual utterance not paired with any corresponding spoken utterance of non-synthetic speech. The method also includes generating a corresponding alignment output for each unspoken textual utterance using an alignment model trained on transcribed speech utterance in one or more training languages each different than the target language. The method also includes…

Streaming Automatic Speech Recognition With Non-Streaming Model Distillation

Granted: January 25, 2024
Application Number: 20240029716
A method for training a streaming automatic speech recognition student model includes receiving a plurality of unlabeled student training utterances. The method also includes, for each unlabeled student training utterance, generating a transcription corresponding to the respective unlabeled student training utterance using a plurality of non-streaming automated speech recognition (ASR) teacher models. The method further includes distilling a streaming ASR student model from the plurality…

Unified End-To-End Speech Recognition And Endpointing Using A Switch Connection

Granted: January 25, 2024
Application Number: 20240029719
A single E2E multitask model includes a speech recognition model and an endpointer model. The speech recognition model includes an audio encoder configured to encode a sequence of audio frames into corresponding higher-order feature representations, and a decoder configured to generate probability distributions over possible speech recognition hypotheses for the sequence of audio frames based on the higher-order feature representations. The endpointer model is configured to operate…

NETWORK-BASED REMOTE CONTROL

Granted: January 18, 2024
Application Number: 20240022781
The subject matter of the present disclosure can be implemented by, among other things, a remote control device configured to receive an identifier, present a plurality of controllable devices for selection on a display of the remote control device, and receive a selection of two or more controllable devices via the user interface. Using the identifier, a session with the two or more controllable devices is generated and performances of a same task by the two or more controllable devices…

ARBITRARILY GROUPING COMPUTER SYSTEM RESOURCES

Granted: January 18, 2024
Application Number: 20240022475
A method for arbitrarily grouping computer system resources includes receiving a resource policy to be applied to a group of resources associated with a user. The resource policy includes a unique user-defined label assigned to each resource of the group of resources. The method also includes identifying the group of resources among a plurality of resources using the user-defined label. Here, the plurality of resources is associated with the user and distributed across a plurality of…

Tracking Subsea Telecommunications Asset Capacity and Spectrum

Granted: January 18, 2024
Application Number: 20240022438
A method includes generating a first asset token that represents control, by a first entity, of a portion of a physical communication asset. The method includes publishing, to a distributed ledger, ownership of the first asset token and receiving, from a second entity, a request to control the portion of the physical communication asset represented by the first asset token. In response to receiving the request, the method includes removing the first asset token from circulation on the…

Data Transformations to Create Canonical Training Data Sets

Granted: January 18, 2024
Application Number: 20240021310
A method includes obtaining a dataset that includes health data in a Fast Healthcare Interoperability Resources (FHIR) standard. The health data includes a plurality of healthcare events. The method includes generating, using the dataset, an events table that includes the plurality of healthcare events and is indexed by time and a unique identifier per patient encounter. The method also includes generating, using the dataset, a traits table that includes static data and is indexed by the…

Sub-models for Neural Contextual Biasing with Attention and Embedding Space

Granted: January 18, 2024
Application Number: 20240021190
A method for training a sub-model for contextual biasing for speech recognition includes obtaining a base speech recognition model trained on non-biased data. The method includes obtaining a set of training utterances representative of a particular domain, each training utterance in the set of training utterances including audio data characterizing the training utterances and a ground truth transcription of the training utterance. The method further includes, for each corresponding…

ANALYZING LARGE-SCALE DATA PROCESSING JOBS

Granted: January 18, 2024
Application Number: 20240020215
Methods, systems, and apparatus for data analysis in a distributed computing system by accessing data stored at a first processing zone associated with a distributed data processing job, detecting information identifying a particular child job associated with the distributed data processing job, comparing the identifying information to data stored at a second processing zone, and identifying an additional child job as associated with the distributed data processing job based on a result…

NETWORK-BASED REMOTE CONTROL

Granted: January 18, 2024
Application Number: 20240022781
The subject matter of the present disclosure can be implemented by, among other things, a remote control device configured to receive an identifier, present a plurality of controllable devices for selection on a display of the remote control device, and receive a selection of two or more controllable devices via the user interface. Using the identifier, a session with the two or more controllable devices is generated and performances of a same task by the two or more controllable devices…

ARBITRARILY GROUPING COMPUTER SYSTEM RESOURCES

Granted: January 18, 2024
Application Number: 20240022475
A method for arbitrarily grouping computer system resources includes receiving a resource policy to be applied to a group of resources associated with a user. The resource policy includes a unique user-defined label assigned to each resource of the group of resources. The method also includes identifying the group of resources among a plurality of resources using the user-defined label. Here, the plurality of resources is associated with the user and distributed across a plurality of…

Tracking Subsea Telecommunications Asset Capacity and Spectrum

Granted: January 18, 2024
Application Number: 20240022438
A method includes generating a first asset token that represents control, by a first entity, of a portion of a physical communication asset. The method includes publishing, to a distributed ledger, ownership of the first asset token and receiving, from a second entity, a request to control the portion of the physical communication asset represented by the first asset token. In response to receiving the request, the method includes removing the first asset token from circulation on the…

Data Transformations to Create Canonical Training Data Sets

Granted: January 18, 2024
Application Number: 20240021310
A method includes obtaining a dataset that includes health data in a Fast Healthcare Interoperability Resources (FHIR) standard. The health data includes a plurality of healthcare events. The method includes generating, using the dataset, an events table that includes the plurality of healthcare events and is indexed by time and a unique identifier per patient encounter. The method also includes generating, using the dataset, a traits table that includes static data and is indexed by the…

Sub-models for Neural Contextual Biasing with Attention and Embedding Space

Granted: January 18, 2024
Application Number: 20240021190
A method for training a sub-model for contextual biasing for speech recognition includes obtaining a base speech recognition model trained on non-biased data. The method includes obtaining a set of training utterances representative of a particular domain, each training utterance in the set of training utterances including audio data characterizing the training utterances and a ground truth transcription of the training utterance. The method further includes, for each corresponding…

ANALYZING LARGE-SCALE DATA PROCESSING JOBS

Granted: January 18, 2024
Application Number: 20240020215
Methods, systems, and apparatus for data analysis in a distributed computing system by accessing data stored at a first processing zone associated with a distributed data processing job, detecting information identifying a particular child job associated with the distributed data processing job, comparing the identifying information to data stored at a second processing zone, and identifying an additional child job as associated with the distributed data processing job based on a result…

Resource Block-Level Index Modulation

Granted: January 11, 2024
Application Number: 20240014986
Techniques and apparatuses are described for resource block-level index modulation. In aspects, a wireless transmitter modulates a first portion of data for a wireless receiver to provide modulation symbols that correspond to the first portion of the data. The wireless transmitter also selects, based on a value of a second portion of the data, respective index locations for one or more resource blocks by which to transmit the modulation symbols. The wireless transmitter then transmits…

Unsupervised Data Selection via Discrete Speech Representation for Automatic Speech Recognition

Granted: January 11, 2024
Application Number: 20240013777
A method includes obtaining a corpus of unlabeled training data including a plurality of spoken utterances, each corresponding spoken utterance of the plurality of spoken utterances includes audio data characterizing the corresponding spoken utterance. The method also includes receiving a target domain. The method also includes selecting, using a contrastive data selection model, a subset of the utterances from the corpus of unlabeled training data that correspond to the target domain.…