REDUCING POWER CONSUMPTION ASSOCIATED WITH FREQUENCY TRANSITIONING IN A MEMORY INTERFACE
Granted: December 19, 2024
Application Number:
20240419343
Methods and systems are disclosed for frequency transitioning in a memory interface system. Techniques disclosed include receiving a signal indicative of a change in operating frequency, into a new frequency, in a processing unit interfacing with memory via the memory interface system; switching the system from a normal mode of operation into a transition mode of operation; updating control and state register (CSR) banks of respective transceivers of the system through a mission bus used…
Atomic Execution of Processing-in-Memory Operations
Granted: December 19, 2024
Application Number:
20240419330
Scheduling processing-in-memory transactions in systems with multiple memory controllers is described. In accordance with the described techniques, an addressing system segments operations of a transaction into multiple microtransactions, where each microtransaction includes a subset of the transaction operations that are scheduled by a corresponding one of the multiple memory controllers. Each transaction, and its associated microtransactions, is assigned a transaction identifier based…
SPLIT BOUNDING VOLUMES FOR INSTANCES
Granted: December 12, 2024
Application Number:
20240412446
A technique for performing ray tracing operations is provided. The technique includes detecting intersection of a ray with a split bounding volume of an instance of a bounding volume hierarchy; determining whether the split bounding volume meets an instance traversal limiting criterion; and continuing BVH traversal based on the determining.
TRAVERSAL AND PROCEDURAL SHADER BOUNDS REFINEMENT
Granted: December 12, 2024
Application Number:
20240412445
A technique for performing ray tracing operations is provided. The technique includes, traversing through a bounding volume hierarchy for a ray to arrive at a well-fit bounding volume that is associated with first node, wherein the first node is one of a traversal node or a procedural node, and wherein the well-fit bounding volume comprises geometry other than a single axis-aligned bounding box for the first node; evaluating the ray for intersection with the well-fit bounding volume;…
Super-Temporal Cache Replacement Policy
Granted: December 12, 2024
Application Number:
20240411692
Cache replacement policies are described. In accordance with the described techniques, a request for data is received and a cache replacement policy controls how a controller responds to the request. The cache replacement policy assigns each cacheline a priority value, which indicates whether the cacheline should be preserved relative to other cachelines, in response to the request being a cache miss that necessitates eviction of at least one cacheline. The cache replacement policy…
Local Triggering of Processing-in-Memory Operations
Granted: December 12, 2024
Application Number:
20240411462
Local and dynamic triggering of operations executed by a processing-in-memory component is described. In accordance with the described techniques, a processing-in-memory component receives a command from a host for execution by the processing-in-memory component. The processing-in-memory component references a tracking table that includes at least one entry associated with an operation performed as part of executing the command and identifies at least one additional command to be…
PARALLELIZED BOOT SEQUENCE
Granted: December 5, 2024
Application Number:
20240403065
The disclosed device includes multiple special purpose processors that are configured to perform, in parallel, a power on transition sequence for the device, which can involve restoring a data state of components of the device using data stored in local storages of the special purpose processors. Various other methods, systems, and computer-readable media are also disclosed.
FLEXIBLE MEMORY SYSTEM
Granted: December 5, 2024
Application Number:
20240402907
A technique for operating a memory system is disclosed. The technique includes performing a first request, by a first memory client, to access data at a first memory address, wherein the first memory address refers to data in a first memory section that is coupled to the first memory client via a direct memory connection; servicing the first request via the direct memory connection; performing a second request, by the first client, to access data at a second memory address, wherein the…
Memory Control for Data Processing Pipeline Optimization
Granted: November 28, 2024
Application Number:
20240393943
Generating optimization instructions for data processing pipelines is described. A pipeline optimization system computes resource usage information that describes memory and compute usage metrics during execution of each stage of the data processing pipeline. The system additionally generates data storage information that describes how data output by each pipeline stage is utilized by other stages of the pipeline. The pipeline optimization system then generates the optimization…
JITTER REDUCTION IN MIXED-SIGNAL PROCESSING CIRCUITS
Granted: November 28, 2024
Application Number:
20240395295
A signal processing circuit includes an analog front-end circuit and a digital delay circuit. The analog circuit receives a clock signal and provides a compensated clock signal. The digital delay circuit is coupled to the analog front-end circuit and provides a compensated sample clock signal in response to delaying the compensated clock signal. The analog circuit measures a variation of a power supply voltage and adjusts a gain through the analog circuit according to a measured…
Aggregation and Scheduling of Accelerator Executable Tasks
Granted: November 21, 2024
Application Number:
20240385872
In accordance with the described techniques for aggregation and scheduling of accelerator executable tasks, an accelerator device includes a processing element array and a command processor to receive a plurality of fibers each including multiple tasks and dependencies between the multiple tasks. The command processor places a first fiber in a sleep pool based on a first task within the first fiber having an unresolved dependency, and the command processor further places a second fiber…
CLOCK DRIVER WITH DUTY CYCLE CORRECTION
Granted: November 14, 2024
Application Number:
20240380391
A clock driver with duty cycle correction includes a first driver circuit, a second driver circuit, and a correction logic circuit. The first driver circuit performs duty cycle correction on a clock input signal and has parameters selected for a first frequency range of the clock input signal. The second driver circuit is nested with the first driver circuit and performs duty cycle correction on the clock input signal with parameters selected for a second frequency range of the clock…
SYSTEM AGNOSTIC AUTONOMOUS SYSTEM STATE MANAGEMENT
Granted: November 7, 2024
Application Number:
20240370077
A computing device is provided which comprises memory and a processor in communication with the memory. The processor is configured to autonomously acquire input parameter values, comprising one of monitored device input parameter values from a component of the computing device and monitored user input parameter values. The processor is also configured to select, from a plurality of modes of operation, a mode of operation comprising parameter settings which are determined based on the…
Voltage Range for Training Physical Memory
Granted: October 24, 2024
Application Number:
20240355379
Voltage range for training physical memory is described. A device is configurable to include a PHY having an interface to support communication of command signals and data with a physical memory. The PHY implements a training mode to train the interface over a training voltage range to communicate the command signals or data and an operational mode to use the trained interface to communicate the command signals or data over an operational voltage range that is smaller than the training…
SYSTEM AND METHOD FOR EXECUTING A TASK
Granted: October 24, 2024
Application Number:
20240355044
A method, system, and computer-readable medium for executing a task is disclosed. The method includes receiving input data and computing instructions, launching a workgroup including wavefronts to execute the task, wherein the launching causes the wavefronts to process the input data by sharing intermediate results and resources, and adjusting the operation based on characteristics of the wavefronts. The characteristics include data dependencies, computational load, memory usage, and…
INPUT LOCALITY-ADAPTIVE KERNEL CO-SCHEDULING
Granted: October 3, 2024
Application Number:
20240330045
A technique for scheduling executing items on a highly parallel processing architecture is provided. The technique includes identifying a plurality of execution items that share data, as indicated by having matching commonality metadata; identifying an execution unit for executing the plurality of execution items together; and scheduling the plurality of execution items for execution together on the execution unit.
SYSTEMS AND METHODS FOR IMPROVING RESOURCE UTILIZATION AND SYSTEM PERFORMANCE IN END-TO-END ENCRYPTION
Granted: October 3, 2024
Application Number:
20240333519
The disclosed computing device can include super flow control unit (flit) generation circuitry configured to generate a super flit containing two or more flits having two or more requests embedded therein, wherein the two or more requests have the same destination node identifiers and the super flit has a variable size based on a flit size and a number of existing requests in a source node that target a same destination node. The device can additionally include authentication circuitry…
SPHERE-BASED RAY-CAPSULE INTERSECTOR FOR CURVE RENDERING
Granted: October 3, 2024
Application Number:
20240331266
Devices and methods for rendering curves using ray tracing are provided which include tessellating a curve, representing at least a portion of an object in a scene, into a chain of capsules each comprising two spheres and a connecting cone, generating an acceleration structure comprising the chain of capsules, casting a ray in a space comprising the curve, and performing, for a capsule of the chain of capsules, a closed-form intersection test to render the curve. In a first example, the…
Boot-Up and Memory Testing with Chipset Attached Memory
Granted: October 3, 2024
Application Number:
20240330134
A system that includes at least a system memory, a chipset link, and a chipset attached memory is powered down. A boot-up process is loaded in the chipset attached memory. The boot-up process is performed for the system, via the chipset link, by the chipset attached memory. The boot-up process includes loading one or more memory testing applications. The system memory is tested using the one or more memory testing applications loaded by the chipset attached memory.
Task Allocation with Chipset Attached Memory and Additional Processing Unit
Granted: October 3, 2024
Application Number:
20240330076
Task allocation with chipset attached memory and additional processing unit is described. In accordance with the described techniques, a computing device includes a main system and one or more sub-systems which are coupled to the main system via a chipset link. The main system includes at least a processing unit and a system memory. The one or more sub-systems each include at least a chipset attached processing unit and a chipset attached memory. Contents of the system memory are…