Google Patent Applications

Text Injection For Training Auxiliary Tasks In Speech Recognition Models

Granted: September 5, 2024
Application Number: 20240296840
A joint auxiliary task and ASR model includes an encoder to receive a sequence of acoustic frames and generate, at each of a plurality of output steps, a higher-order feature representation for a corresponding acoustic frame. The model also includes a multi-output HAT decoder to generate at each of the plurality of output steps a probability distribution over possible speech recognition hypotheses, and an indication of whether the output step corresponds to an auxiliary token associated…

SYSTEMS AND METHODS FOR DIGITAL INK GENERATION AND EDITING

Granted: September 5, 2024
Application Number: 20240296603
Systems and methods for editing and generating digital ink. The present technology may provide systems and methods for training a handwriting model to generate digital ink that is stylistically and visually consistent with an original handwriting input, but which incorporates one or more changes to the text of the original handwriting input. In some examples, training may be performed using training examples that include an original handwriting sample and an original label representing…

Large-Scale Language Model Data Selection for Rare-Word Speech Recognition

Granted: August 29, 2024
Application Number: 20240290323
A method of training a language model for rare-word speech recognition includes obtaining a set of training text samples, and obtaining a set of training utterances used for training a speech recognition model. Each training utterance in the plurality of training utterances includes audio data corresponding to an utterance and a corresponding transcription of the utterance. The method also includes applying rare word filtering on the set of training text samples to identify a subset of…

Enlarging Active Areas of Displays Using Variable Pixel and/or Transistor Densities

Granted: August 29, 2024
Application Number: 20240292660
This document describes systems and techniques directed at enlarging active areas of displays using variable pixel and/or transistor densities. In aspects, a display includes a cover layer positioned as a topmost layer and an array of pixels positioned thereunder. A plurality of transistors, positioned under the array of pixels, may control an electrical activation of one or more pixels within the array of pixels. In implementations, the plurality of transistors define a smaller area…

Intra-User Equipment-Coordination Set Communication

Granted: August 29, 2024
Application Number: 20240292405
Methods, devices, systems, and means for intra-UECS communication by a coordinating user equipment, UE, in a user equipment-coordination set, UECS, are described herein. The coordinating UE allocates first air interface resources to a second UE and second air interface resources to a third UE for intra-UECS communication. The coordinating UE receives, using the allocated first air interface resources, an Internet Protocol, IP, data packet from the second UE in the UECS. The coordinating…

Network Reachability Impact Analysis

Granted: August 29, 2024
Application Number: 20240291720
A method includes obtaining a stream of consecutive network configuration snapshots each including network configuration information. The method also includes determining that first network configuration information of a first network configuration snapshot of the network from the stream of consecutive network configuration snapshots for the network is not the same as second network configuration information of a second network configuration snapshot of the network from the stream of…

Managing Data Availability on Encryption Key Status Changes in Replicated Storage Systems

Granted: August 29, 2024
Application Number: 20240291638
A method includes obtaining a key status for a first cryptographic key. The first cryptographic key is used to encrypt replicated data of a first replication instance. The method also includes determining, based on the key status, that the first cryptographic key is inaccessible which causes the first replication instance to be unavailable. In response to determining that the first cryptographic key is inaccessible, the method includes scheduling a second replication instance to be…

PROVIDING PRE-COMPUTED HOTWORD MODELS

Granted: August 29, 2024
Application Number: 20240290333
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining, for each of multiple words or sub-words, audio data corresponding to multiple users speaking the word or sub-word; training, for each of the multiple words or sub-words, a pre-computed hotword model for the word or sub-word based on the audio data for the word or sub-word; receiving a candidate hotword from a computing device; identifying one or more pre-computed hotword…

Language Model Prediction of API Call Invocations and Verbal Response

Granted: August 29, 2024
Application Number: 20240290327
A method includes obtaining an utterance from a user including a user query directed toward a digital assistant. The method includes generating, using a language model, a first prediction string based on the utterance and determining whether the first prediction string includes an application programming interface (API) call to invoke a program via an API. When the first prediction string includes the API call to invoke the program, the method includes calling, using the API call, the…

Distilling to a Target Device Based on Observed Query Patterns

Granted: August 29, 2024
Application Number: 20240290324
A method includes receiving user queries directed toward a cloud-based assistant service. For each received user query directed toward the cloud-based assistant service, the method also includes extracting one or more attributes from the user query and logging the user query into one or more of a plurality of category buckets based on the one or more attributes extracted from the user query. The method also includes determining when at least one of the plurality of category buckets…

CLUSTERING AND MINING ACCENTED SPEECH FOR INCLUSIVE AND FAIR SPEECH RECOGNITION

Granted: August 29, 2024
Application Number: 20240290322
A method of training an accent recognition model includes receiving a corpus of training utterances spoken across various accents, each training utterance in the corpus including training audio features characterizing the training utterance, and executing a training process to train the accent recognition model on the corpus of training utterances to teach the accent recognition model to learn how to predict accent representations from the training audio features. The accent recognition…

CHUNK-WISE ATTENTION FOR LONGFORM ASR

Granted: August 29, 2024
Application Number: 20240290321
A method includes receiving training data including a corpus of multilingual unspoken textual utterances, a corpus of multilingual un-transcribed non-synthetic speech utterances, and a corpus of multilingual transcribed non-synthetic speech utterances. For each un-transcribed non-synthetic speech utterance, the method includes generating a target quantized vector token and a target token index, generating contrastive context vectors from corresponding masked audio features, and deriving…

Semantic Segmentation With Language Models For Long-Form Automatic Speech Recognition

Granted: August 29, 2024
Application Number: 20240290320
A joint segmenting and ASR model includes an encoder to receive a sequence of acoustic frames and generate, at each of a plurality of output steps, a higher order feature representation for a corresponding acoustic frame. The model also includes a decoder to generate based on the higher order feature representation at each of the plurality of output steps a probability distribution over possible speech recognition hypotheses, and an indication of whether the corresponding output step…

Enlarging Active Areas of Displays in Electronic Devices

Granted: August 29, 2024
Application Number: 20240290272
This document describes systems and techniques directed at enlarging active areas of displays in electronic devices. In aspects, a display includes a grid of transistors positioned within a display panel module to control an illumination of one or more electroluminescent layers. Routing lines extend from one or more transistors of the grid of transistors to at least one electroluminescent layer. In this way, the at least one electroluminescent layer can be positioned away from the grid…

Radar Application Programming Interface

Granted: August 22, 2024
Application Number: 20240280669
Techniques and apparatuses are described that implement a radar application programming interface. The radar application programming interface provides a standardized language for external entities, such as an application a smart device, to communicate with a radar system without having to understand the design and operational complexities employed by the radar system. With the radar application programming interface, a third-party can readily interact with the radar system and customize…

Battery Connector

Granted: August 22, 2024
Application Number: 20240283179
The disclosure describes a battery connector that includes a bendable axis configured to enable a first portion of the battery connector to be electrically connected to a corresponding connector prior to connecting a second portion. A flexible printed circuit may connect the first portion to the second portion and the bendable axis may be located along the flexible printed circuit. A hinge may connect the first portion to the second portion and enable the first portion to bend with…

VOICE RECOGNITION SYSTEM

Granted: August 22, 2024
Application Number: 20240282309
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for voice recognition. In one aspect, a method includes the actions of receiving a voice input; determining a transcription for the voice input, wherein determining the transcription for the voice input includes, for a plurality of segments of the voice input: obtaining a first candidate transcription for a first segment of the voice input; determining one or more contexts associated with…

USING SPEECH RECOGNITION TO IMPROVE CROSS-LANGUAGE SPEECH SYNTHESIS

Granted: August 22, 2024
Application Number: 20240282292
A method for training a speech recognition model includes obtaining a multilingual text-to-speech (TTS) model. The method also includes generating a native synthesized speech representation for an input text sequence in a first language that is conditioned on speaker characteristics of a native speaker of the first language. The method also includes generating a cross-lingual synthesized speech representation for the input text sequence in the first language that is conditioned on…

SINGLE-SIDED DISTRIBUTED STORAGE SYSTEM

Granted: August 22, 2024
Application Number: 20240281404
A distributed storage system including memory hosts and at least one curator in communication with the memory hosts. Each memory host has memory, and the curator manages striping of data across the memory hosts. In response to a memory access request by a client in communication with the memory hosts and the curator, the curator provides the client a file descriptor mapping data stripes and data stripe replications of a file on the memory hosts for remote direct memory access of the file…

Independent Clocking for Configuration and Status Registers

Granted: August 22, 2024
Application Number: 20240281021
This document describes systems and techniques that enable independent clocking for configuration and status registers (CSRs). The described systems and techniques can provide a clock signal to a CSR set of an IP block with a derived clock rate an integer division slower than a clock rate of another clock signal that enables operation of the IP block, which may include communication between the IP block and an application processor. The derived clock rate is synchronous to but…