Check patentability & draft patents in minutes with Patsnap Eureka AI!

Systems and methods for optimizing nested loop instructions in pipeline processing stages within a machine perception and dense algorithm integrated circuit

a technology of machine perception and integrated circuits, applied in the field of integrated circuits, can solve the problems of insufficient robust processing capabilities of sensors for high performance and real-time computing of sensor signal data, and the architectures of most gpus are not optimized for handling many complex machine learning algorithms. , to achieve the effect of reducing the operational penalty of the integrated circuit, and improving the performance of the integrated circui

Active Publication Date: 2021-07-08
QUADRIC IO INC
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

The patent describes a method and system for improving the performance of an integrated circuit by evaluating a target instruction set and identifying nested loop bodies within it. The system then determines whether a most inner loop body requires a loop optimization to mitigate an operational penalty to the integrated circuit. The loop optimization modifies the target instruction set to include loop optimization instructions to control, at runtime, an execution and a termination of the most inner loop body, thereby reducing the operational penalty. The technical effect of this patent is improved performance and more efficient execution of complex integrated circuit functions.

Problems solved by technology

However, while many of these advanced and capable sensors may enable a robust capture of the physical environments of many autonomous implementations, the underlying processing circuitry that may function to process the various sensor signal data from the sensors often lack in corresponding robust processing capabilities sufficient to allow for high performance and real-time computing of the sensor signal data.
In many applications, GPUs are implemented rather than CPUs because GPUs are capable of executing bulky or large amounts of computations relative to CPUs.
However, the architectures of most GPUs are not optimized for handling many of the complex machine learning algorithms (e.g., neural network algorithms, etc.) used in machine perception technology.
But, modern GPUs are not constructed for handling these additional high computation tasks.
This fragmented and piecemeal approach to handling the additional perception processing needs of robotics and autonomous machines results in a number of inefficiencies in performing computations including inefficiencies in sensor signal processing.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Systems and methods for optimizing nested loop instructions in pipeline processing stages within a machine perception and dense algorithm integrated circuit
  • Systems and methods for optimizing nested loop instructions in pipeline processing stages within a machine perception and dense algorithm integrated circuit
  • Systems and methods for optimizing nested loop instructions in pipeline processing stages within a machine perception and dense algorithm integrated circuit

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031]The following description of preferred embodiments of the present application are not intended to limit the inventions to these preferred embodiments, but rather to enable any person skilled in the art of to make and use these inventions.

[0032]I. Overview

[0033]In an integrated circuit configured with pipeline processing stages, a backwards branching instruction may cause stalls in the execution of one or more instruction sets thereby increasing a number clock cycles required for processing the instruction set and reducing a performance of the integrated circuit. Backwards branching instructions may typically be found in loop instructions and the like in which an execution of a new iteration of the loop may require that the integrated circuit jump from a terminal instruction of a loop body of the loop back to a branch for executing a new iteration of the loop instructions. Additionally, for certain instruction types such as tight nested loops, stalls may be extremely expensive ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

In one embodiment, a method for improving a performance of an integrated circuit includes implementing one or more computing devices executing a compiler program that: (i) evaluates a target instruction set intended for execution by an integrated circuit; (ii) identifies one or more nested loop instructions within the target instruction set based on the evaluation; (iii) evaluates whether a most inner loop body within the one or more nested loop instructions comprises a candidate inner loop body that requires a loop optimization that mitigates an operational penalty to the integrated circuit based on one or more executional properties of the most inner loop instruction; and (iv) implements the loop optimization that modifies the target instruction set to include loop optimization instructions to control, at runtime, an execution and a termination of the most inner loop body thereby mitigating the operational penalty to the integrated circuit.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS[0001]This application claims the benefit of U.S. Provisional Application No. 62 / 957,688, filed 6 Jan. 2020, and of the U.S. Provisional Application No. 63 / 050,971, filed 13 Jul. 2020, which are incorporated herein in their entireties by this reference.TECHNICAL FIELD[0002]The one or more inventions described herein relate generally to the integrated circuitry field, and more specifically to a new and useful perception and dense algorithm processing integrated circuitry architecture in the integrated circuitry field.BACKGROUND[0003]Modern applications of artificial intelligence and generally, machine learning appear to be driving innovations in robotics and specifically, in technologies involving autonomous robotics and autonomous vehicles. Also, the developments in machine perception technology have enabled the abilities of many of the implementations in the autonomous robotics' and autonomous vehicles' spaces to perceive vision, perceive hear...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/32G06F9/38G06F8/41
CPCG06F9/325G06F9/3804G06F8/443G06F9/3836G06F9/3808G06F8/4441G06F8/445G06F8/452G06F9/30065G06F9/30185
Inventor DREGO, NIGELRAVICHANDRAN, MRINALINICHANG, JIANMANFIRU, DANIELKHETERPAL, VEERBHAN
Owner QUADRIC IO INC
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More