# Design Concepts with System Architect: Level 0

John Leidel

Chief Scientist, Tactical Computing Laboratories

ver 2019.03.19



Tactical Computing Laboratories

#### **Tutorial Series**

- Level 0: Introduction to System Architect
- Level 1: System Architect Design Concepts and Developing a basic RISC processor
- Level 2: Instruction-Level (StoneCutter) Implementation Concepts
- Level 3: Advanced Design Concepts
- Level 4: System Architect Plugins and Integrating External RTL



## System Architect Overview

Modular, High-Level Design Concepts



Tactical Computing Laboratories

#### What is System Architect?

- A family of tools, APIs and associated infrastructure to permit users to rapidly develop multi-faceted hardware
- Utilizes a combination of modular hardware design reuse principles, object oriented development and dependence analysis techniques (compiler theory) to provide an infrastructure for:
  - Design & Design Experimentation
  - High Level Verification
- The artifacts generated by a System Architect design flow include:
  - Chisel HDL and Verilog RTL
  - C++ cycle-based simulator
  - LLVM compiler



## What is System Architect **NOT**?

- System Architect is not the latest C-to-gates tool
  - It permits rapid design, verification and reuse
  - It does not auto-generate hardware based upon application code
- System Architect still relies upon the user to utilize reasonable design concepts
  - System Architect will not auto-generate optimized designs based upon unreasonable inputs
  - Users need to have a concept of the physical platform (FPGA, ASIC, etc)
- System Architect does **not** currently have a notion of physical layout
  - The generated output will not include LUT counts, physical design dimensions or power estimates
  - External FPGA/ASIC tools are required for this level of detail



## System Architect Design Flow



## Typical System Architect Design Flows

| <b>Rapid ISA Development</b>                                                                                                                                                                                                                                                                                       | Modular Design                                                                                                                                                                                                                                | <b>Compiler Environment</b>                                                                                                                                                                                                                                                                                                                                                                        |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <ul> <li>Rapid development of<br/>ISA's with backend RTL<br/>and LLVM compiler as<br/>artifacts</li> <li>Cycle-based simulator<br/>that supports<br/>immediate<br/>experimentation</li> <li>Rapid design evaluation<br/>and prototyping</li> <li>High level verification of<br/>design before synthesis</li> </ul> | <ul> <li>Multi-module<br/>design and<br/>integration</li> <li>Integration with<br/>other System<br/>Architect designs<br/>(sub-designs)</li> <li>External module<br/>integration</li> <li>Plugin support for<br/>custom-modularity</li> </ul> | <ul> <li>Auto-generated LLVM<br/>compiler infrastructure<br/>from design parameters</li> <li>Modern C/C++ frontend<br/>support         <ul> <li>Other frontends can be<br/>supported as well</li> </ul> </li> <li>Re-useable as LLVM<br/>mainline continues to<br/>develop         <ul> <li>Re-spinning entire<br/>compiler based on new<br/>LLVM versions is<br/>automated</li> </ul> </li> </ul> |

#### System Architect Infrastructure



## How does it work?

- We input designs in the user-facing tools and generate **CoreGen Intermediate Representation (IR)**
- The IR preserves the natural dependencies within the design
  - Register classes depend upon registers
  - Instruction formats depend upon register classes
  - Instruction sets depend upon instructions
  - Cores require instruction sets
- We execute high level verification "passes" against the IR
  - Similar in design to traditional compiler passes
  - Walks the IR dependence graph and derives properties of the design
  - Reports issues in the design infrastructure, outputs interesting data or optimizes the design infrastructure
- Users implement instructions in StoneCutter instruction implementation language
  - C-like integrated with CoreGen IR to define a <u>single</u> instruction
  - Optimized by a traditional compiler flow to generate Chisel HDL for a single instruction
- Following the high level verification phase, we execute generate downstream code (code generation)
  - Generates Chisel HDL
    - Compiled down to Verilog & C++ cycle-based simulator
  - Generates LLVM compiler for the target design



Example dependence graph from CoreGen design input



#### CoreGen IR Passes

- Passes can be selectively enabled or disabled by the user (just like a normal compiler)
- Four types of passes
  - Analysis
    - Analyze the connectivity and the structure of the graph
    - Reports back the identified state to the user
    - DOES NOT modify the graph (IR is unchanged)
  - Optimization
    - Optimizes connectivity and structure of the graph
    - Much like Kennedy/Callahan dependence analysis/optimization
    - MODIFIES the dependence graph (IR is changed)
  - Data
    - Generates statistical data/output based upon the structure/content of the graph
    - EG, outputs a LaTeX specification document based upon the respective ISA
    - DOES NOT modify the graph (IR is unchanged)
  - System
    - Mixtures of each of the aforementioned pass types
    - Must be manually instantiated by the tools/users

- The internal representation of the IR is stored as a directed acyclic graph (DAG)
- Contains four levels that describe varying levels of detail
  - This is a performance optimization
  - Each subsequent level is a superset of the previous level
  - $Level^{N+1} \supseteq Level^N$



#### CoreGen IR Passes: Example Analysis Passes

| RegSafetyPass/RegIdxPass                                                                                                                                                                                                                                                                                                                                                      | EncodingCollisionPass                                                                                                                                                                                                                                                                                                                                                                             |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <ul> <li>Find inconsistencies between registers within the same register file</li> <li>Multiple registers with the same index</li> <li>Registers with missing indicies</li> <li>Registers with prescribed values &gt; than their bit width</li> <li>Sub-register fields with overlapping names</li> <li>Sub-register fields with overlapping bit field definitions</li> </ul> | <ul> <li>Identifies potential ISA encoding issues</li> <li>Utilizes all instruction formats within an ISA</li> <li>Builds a table of every known instruction within the ISA <ul> <li>Initializes a bit vector for every instruction using the encoding (eg, opcode), immediate values and register fields (bitmask)</li> </ul> </li> <li>Examines collisions in the "loaded" encodings</li> </ul> |



#### CoreGen IR Passes: Example Data Passes

| StatsPass                                                                                                                                                                                                                                                                                                                      | SpecDoc                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <ul> <li>Walks the entire dependence graph<br/>and collects data about the<br/>connectivity of the graph and<br/>incidence of the nodes</li> <li>Reports the number of adjacent<br/>dependent nodes for each node in the<br/>graph (outdegree)</li> <li>Reports final counts of the incidence<br/>of each node type</li> </ul> | <ul> <li>Walks the entire dependence graph<br/>and collects data about the encoding<br/>infrastructure of the ISA and portions<br/>of the micro architecture</li> <li>Outputs a specification document in<br/>LaTeX that details each of: <ul> <li>Register Classes</li> <li>Register Encodings</li> <li>Subregister encodings (bit fields within<br/>register)</li> </ul> </li> <li>Instruction Formats</li> <li>Instruction Encodings</li> <li>Master instruction table</li> </ul> |

#### CoreGen IR Passes: Example System Passes

| SafeDeletePass                                                                                                                                                                                                                                                                                                                                                      | ASPSolverPass                                                                                                                                                                                                                                                                                                                             |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <ul> <li>Determines whether removing a node from the graph is safe</li> <li>For example: <ul> <li>"If I remove this register, will I break the dependence graph?"</li> <li>"If I remove this encoding format, will I break the dependence graph?"</li> </ul> </li> <li>Generally utilized within other tooling, but can be utilized directly by the user</li> </ul> | <ul> <li>Utilized when an existing analysis pass does not find a specific corner case</li> <li>Can be used to "programmatically" define new dependence solvers using a specific syntax</li> <li>Can be utilized as design constraints and/or regression tests to ensure specific functionality/connectivity exists in a design</li> </ul> |



## CoreGen Infrastructure

- All hardware modules/units are defined as DAG nodes
- Dependence graph between nodes is "lowered" in multiple stages in order to expose increasing levels of complexity
  - Similar to Open64 notion of multi-dimensional IR
- DAG Levels:
  - Level 0: Basic node connectivity
  - Level 1: Expands "extension" nodes to contain all their children
  - Level 2: Expands all communication links
  - Level 3: Expands all instruction and register encodings



## What type of nodes in the CoreGen IR?

- **Soc Nodes**: Defines a top-level system on chip (one per design)
- <u>Core Nodes</u>: Defines a single core with associated ISA and dependent nodes
- **ISA Nodes**: Defines an instruction set architecture container
- Instruction Format Nodes: Defines an instruction format with all of its subfields
  - Sub-fields have properties that define encoding fields, register fields, immediate value fields, etc
- Instruction Nodes: Defines a single instruction based upon a prescribed instruction format.
  - Ability to define encodings, register classes for register fields, immediate values, etc



## What type of nodes in the CoreGen IR (cont)?

- <u>Pseudo Instruction Nodes</u>: Defines specific encodings for existing instructions
  - Ability to define prescribed encoding fields as static (eg, immediate = 0x00)
  - DOES NOT induce instruction encoding collisions
  - EG: "MOV RT, Ra" = "ADD RT, Ra, \$0"
- <u>Register Class Nodes</u>: Defines a container node with multiple dependent registers. Register indices within a register class cannot overlap
- <u>**Register Nodes**</u>: Defines a single register node with an index, bit width, sub-register fields and attributes (RO, RW, etc)
- <u>Encoding Nodes</u>: Define encodings for parent nodes (EG, register encodings)
- Cache Nodes: Defines a single cache layer
  - Can be interconnected into multi-level caches



## What type of nodes in the CoreGen IR (cont)?

- <u>Communication Channel Nodes</u>: Interconnect multiple nodes via on-chip data+control paths
  - Multiple topologies supported
- Scratchpad Nodes: Represent addressable on-chip scratchpads
- <u>Memory Controller Nodes</u>: Represents a basic memory controller with multiple input/output ports
- Virtual to Physical Nodes: Handles virtual to physical memory translation
- \*\*<u>Extension Nodes</u>: Special node type that permits users to "import" other CoreGendeveloped project artifacts into other projects
  - Accelerators are excellent examples of extensions
- \*\* Plugin Nodes: Special node type that represents a third-party templated design
  - Can contain unlimited number of special properties (not defined by standard CoreGen IR spec)
  - May also contain custom code generation facilities to output any style of HDL/RTL, etc



## CoreGen Plugins

- CoreGen plugins are containers for selfcontained extensions
  - Each plugin is effectively its own template
  - Bundled as a shared library (can be licensed and distributed outside of System Architect)
- These can be:
  - Cores, ISAs, cache modules, periphery components, etc
- Each plugin can drive unique "code generators" to modify their internal HDL state and/or generate the source compiler
- Projects can import any number of plugins
- DAG analysis works across plugins



#### CoreGen IR Specification

- IR Spec is governed in the same manner as source code development
  - Changes to the spec must be received in the form of pull requests on Github
  - Adjacent pull requests (that include all the necessary tests) must also exist in CoreGen library tree
  - NO changes to the spec are accepted without support in CoreGen
- Entire IR spec is documented with examples
- Latest revision:
  - <u>http://www.systemarchitect.tech/index.php/coregenirspec/</u>



#### What now?

- Level 1 Tutorial: Describes basic design concepts and walks through the initial definition of a RISC-like design
- Level 2 Tutorial: Implementing individual instructions using the StoneCutter language and compiler
  - Extends the design from Level 1
- Level 3 Tutorial: Advanced design and implementation concepts
  - Extends the work done in Level 2
- Level 4 Tutorial: Building external plugins and integrating external RTL
  - How do we integrate existing IP?



## What do you need to continue?

- Linux/OSX system with the tools installed
  - Prebuilt packages are available:
  - <u>https://github.com/opensocsysarch/SystemArchitectRelease</u>
- Text editor
  - VIM, Emacs, Notepad, etc
- For those seeking to use the GUI
  - Graphics environment (X11, OSX, etc)
- Basic knowledge of computer architecture
- Basic knowledge of software architecture



## References

Where do I find more info?



Tactical Computing Laboratories

### Web Links

- System Architect Public Web
  - http://www.systemarchitect.tech/
- Documentation
  - Latest IR Specification:
    - <u>http://www.systemarchitect.tech/index.php/coregenirspec/</u>
- Tutorials
  - <u>http://www.systemarchitect.tech/index.php/tutorials/</u>
  - <u>https://github.com/opensocsysarch/CoreGenTutorials</u>



#### Source Code

- Main source code hosted on Github:
  - https://github.com/opensocsysarch
- CoreGen Infrastructure
  - https://github.com/opensocsysarch/CoreGen
- CoreGenPortal GUI
  - <u>https://github.com/opensocsysarch/CoreGenPortal</u>
- CoreGen IR Spec
  - <u>https://github.com/opensocsysarch/CoreGenIRSpec</u>
- System Architect Weekly Development Releases
  - <u>https://github.com/opensocsysarch/SystemArchitectRelease</u>





- Issues should be submitted through the respective Github issues pages (see source code links)
- Mailing Lists:
  - <u>http://www.systemarchitect.tech/index.php/lists/</u>
- Direct developer contacts
  - John Leidel: jleidel<at>tactcomplabs<dot>com
  - Frank Conlon: fconlon<at>tactcomplabs<dot>com

