## Embla Trasti Bygland

# Power Modeling of Complex Designs

Master's thesis in Electronics Systems Design and Innovation Supervisor: Snorre Aunet, Knut Austbø July 2020

nology ctrical Master's thesis

NTNU Norwegian University of Science and Technology Faculty of Information Technology and Electrical Engineering Department of Electronic Systems



Embla Trasti Bygland

# **Power Modeling of Complex Designs**

Master's thesis in Electronics Systems Design and Innovation Supervisor: Snorre Aunet, Knut Austbø July 2020

Norwegian University of Science and Technology Faculty of Information Technology and Electrical Engineering Department of Electronic Systems



## Abstract

In this project, a tool for making power models of designs at the Register Transfer Level (RTL) is implemented. The generated power model is intended to be used with a power estimation tool, to give an early, fast and accurate power estimate. Nordic Semiconductor ASA issues this masters project with the motivation of making RTL simulations power-aware. Discovering power bugs early in the implementation of a design may save iterations in the Application Specific Integrated Circuit (ASIC) design flow, and thus reduce time to market for a product.

The method for estimating power at the RTL called the top-down method was chosen for the implementation. Among other desired qualities, it does not require a gate-level representation of the design to produce a power estimate. This allows for power estimation to be done concurrently to simulations for functional verification of the RTL, before synthesis of the design.

The power modeling problem is divided into three tasks:

- 1. Extracting structural information from an elaborated SystemVerilog representation of the design.
- 2. Extracting information about available cells and their power consumption characteristics from the cell library.
- **3.** Combining the structural representation with the cell- and power information retrieved, in order to create a power model.

In the implementation, the structure of the design is represented by a node tree, while a cell library object was created to represent available cells from the cell library and their power data. In order to produce a power model, the implementation takes sequences of generic cells from the structure tree and replace them with cells obtained from the cell library. The power model consists of several power-aware node trees. The power model representation is more similar to the gate-level netlist than the elaborated SystemVerilog representation. However, more work is needed to obtain a proper comparison between them.

The implementation shows promise for accurate and fast power estimation. Several ab-

stractions are done in the process so that fast estimations can be made, and their effect on the power consumption have been evaluated together with other alternatives. When creating the power-aware node tree, cells from the generic cell library are grouped to more complex cells from the cell library. This grouping ensures a reduction in the number of cells, which brings the model closer to the gate-level representation.

Some work remains to complete the power model; the most complex generic cells from the elaborated SystemVerilog file need to be constructed from several cells from the cell library. Complex cells with no equivalent yet are those representing arithmetic operations, shifters and comparators. When these cells have a representation, switching activity can be propagated through the structure trees in order to get a power consumption estimate for each of them. The final job of the power estimation tool is to solely use the activity data from the RTL simulation, together with the power values from each structure tree to yield the power estimate.

## Sammendrag

I dette prosjektet implementeres et verktøy for å lage effektmodeller av RTL design. Den genererte effektmodellen er ment å brukes sammen med et effektestimeringsverktøy for å gi et tidlig, raskt og nøyaktig effektestimat. Nordic Semiconductor ASA utsteder dette masterprosjektet med motivasjonen å gjøre RTL simuleringer effektbevisste. Å oppdage power bugs tidlig i implementeringen av en design kan spare iterasjoner i ASIC designflyten, og dermed redusere tiden som kreves for å få et produkt på markedet.

Metoden for å estimere effekt på RTL kalt top-down metoden ble valgt for implementeringen. Blant andre ønskede kvaliteter krever det ikke en syntetisert nettliste-representasjon av designet for å produsere et effektestimat. Dette gjør at effektestimering kan gjøres samtidig med simuleringer for funksjonell verifisering av RTL, før syntesen av designet.

Effektmodelleringen er delt inn i tre deler:

- 1. Hente ut strukturell informasjon fra en prosessert SystemVerilog representasjon av designet.
- 2. Hente ut informasjon om tilgjengelige celler og deres effektforbruk fra cellebiblioteket.
- **3.** Kombinere den strukturelle representasjonen med celle- og effektinformasjonen, og lage en effektmodell.

I implementasjonen er strukturen til et design representert av et nodetre, mens et cellebibliotekobjekt er laget for å representere tilgjengelige celler fra cellebiblioteket og effektforbruket deres. For å produsere en effektmodell tar implementasjonen sekvenser av generiske celler fra strukturtreet og erstatter dem med celler hentet fra cellebiblioteket. Effektmodellen består av flere effektbevisste nodetrær. Den effekt-bevisste representasjonen har mange likheter med den syntetiserte nettlisten. Dog, mer arbeid er nødvendig for å lage en god sammenligning mellom representasjonene.

Implementasjonen er lovende for nøyaktig og rask høy-nivås estimering av effektforbruk. Flere abstraksjoner blir gjort i prosessen slik at estimasjonen er rask. Hvordan abstraksjonene påvirker effektestimatet er evaluert sammen med andre alternativer. Når et effektbevisst nodetre lages, grupperes generiske celler til mer komplekse celler fra cellebiblioteket. Denne grupperingen gjør at antall celler i representasjonen reduseres, noe som bringer modellen nærmere den syntetiserte nettlisten.

Noe arbeid gjenstår for å gjøre effektmodellen komplett; flere komplekse generiske cellene fra den prosesserte SystemVerilog-filen må settes sammen av tilgjengelige celler fra cellebiblioteket. Komplekse generiske celler som ennå ikke har noen ekvivalent effektbevisst representasjon er de som representerer aritmetiske operasjoner, skiftere og komparatorer. Når disse generiske cellene har en representasjon i effektmodellen, kan signaler propageres gjennom strukturtrærne, og et effektestimat lages for hvert nodetre. Jobben til effektestimeringsverktøyet som skal bruke effektmodellen er å kombinere aktivitetsdata fra en RTLsimulering med effektverdiene fra hvert nodetre i et vilkårlig design, og gi et effektestimat for designet.

## Preface

This Master's Thesis concludes a five-year M.Sc. degree at the Norwegian University of Science and Technology (NTNU) at the programme Electronics Systems Design and Innovation, with a specialisation in Design of Digital Systems.

Preliminary research was done during the fall semester of 2019, which resulted in an unpublished literary review on Register Transfer Level power estimation. Methods discussed in this review are reconsidered, and one method is selected for the implementation of a power model.

The thesis is written in cooperation with Nordic Semiconductor ASA. They have contributed with the required Synopsys licenses, a workplace with a computer, and a wonderful supervisor Knut Austbø, who does not seem to mind late-night readthroughs of text with too few commas in it. I also had great supervision from Snorre Aunet from the Institute of Electronic Systems at NTNU. They both have my gratitude.

I would also like to thank my friends from my study programme for providing companionship and focus through video chat during long days of working from home. I am grateful for their help, and the opposite of grateful to the corona virus and the backache working from my kitchen table has given me.

Lastly, I wish to thank my mom, and if there is a Best Mom Award given by any reader of this thesis, I hereby nominate her.

# Contents

| Al | bstra  | $\operatorname{ct}$                      | i |
|----|--------|------------------------------------------|---|
| Sa | mme    | endrag ii                                | i |
| Pı | reface |                                          | v |
| Co | onter  | ts                                       | v |
| G  | lossa  | x x                                      | i |
| A  | crony  | rms xii                                  | i |
| 1  | Intr   | oduction                                 | 1 |
|    | 1.1    | Motivation                               | 1 |
|    | 1.2    | Problem description                      | 3 |
|    | 1.3    | Report structure                         | 4 |
| 2  | The    | ory                                      | 7 |
|    | 2.1    | Terminology                              | 7 |
|    | 2.2    | The ASIC design flow                     | 8 |
|    | 2.3    | CMOS power consumption                   | ) |
|    |        | 2.3.1 Dynamic power consumption          | ) |
|    |        | 2.3.2 Static power consumption           | 2 |
|    | 2.4    | Process, Voltage and Temperature corners | 2 |
| 3  | Bac    | kground 1                                | 5 |
|    | 3.1    | Bottom-up power estimation               | ĵ |
|    | 3.2    | Top-down power estimation                | 7 |
|    |        | 3.2.1 Fast synthesis power estimation    | 9 |
|    | 3.3    | Prestudy                                 | ) |
| 4  | Sug    | gesting a solution 23                    | 3 |
|    | 4.1    | Structural information                   | 5 |
|    | 4.2    | Cell library information                 | 5 |

|   | 4.3            | The p   | ower modeling flow                             | 26 |
|---|----------------|---------|------------------------------------------------|----|
| 5 | Des            | ign to  | ols and file formats                           | 29 |
|   | 5.1            | Design  | n elaboration                                  | 29 |
|   | 5.2            | Libert  | y file format                                  | 33 |
|   |                | 5.2.1   | Power characteristics                          | 33 |
|   |                | 5.2.2   | Power related library attributes and groups    | 36 |
|   |                | 5.2.3   | Cell attributes and groups                     | 37 |
|   |                | 5.2.4   | Pin attributes and groups                      | 38 |
|   | 5.3            | Test fi | iles and modules                               | 39 |
|   |                | 5.3.1   | Test modules                                   | 39 |
|   |                | 5.3.2   | Calibration netlist                            | 39 |
|   |                | 5.3.3   | Liberty file                                   | 39 |
|   |                | 5.3.4   | Project files                                  | 39 |
| 6 | $\mathbf{Ext}$ | racting | g design structure                             | 41 |
|   | 6.1            | Elabo   | rated SystemVerilog                            | 42 |
|   | 6.2            | Struct  | ural representation of a design                | 43 |
|   | 6.3            | Abstra  | actions made                                   | 45 |
|   | 6.4            | Elabo   | rated SystemVerilog parser implementation      | 46 |
|   |                | 6.4.1   | Parsing                                        | 47 |
|   |                | 6.4.2   | Post-processing                                | 47 |
|   |                | 6.4.3   | Register levelised structure trees             | 48 |
|   | 6.5            | Comp    | aring cell counts                              | 49 |
|   | 6.6            | Struct  | ural representation discussion                 | 51 |
|   |                | 6.6.1   | Cell counts                                    | 51 |
|   |                | 6.6.2   | The register-levelised node tree               | 52 |
|   |                | 6.6.3   | Abstractions introduced by generic cell groups | 53 |
|   |                | 6.6.4   | Registers being optimised away                 | 53 |
|   |                | 6.6.5   | Possible optimisations                         | 54 |
| 7 | $\mathbf{Ext}$ | racting | g library information                          | 55 |
|   | 7.1            | Releva  | ant power data                                 | 56 |
|   | 7.2            | Abstra  | actions                                        | 56 |
|   |                | 7.2.1   | The difference between fall- and rise power    | 56 |
|   |                | 7.2.2   | The difference between data input pins         | 57 |
|   |                |         |                                                |    |

| в  | Tecl | nnical  | implementation of the liberty parser                                                                                          | B-1  |
|----|------|---------|-------------------------------------------------------------------------------------------------------------------------------|------|
| Α  | Tecl | nnical  | implementation of the elaborated SystemVerilog parser                                                                         | A-1  |
|    | 10.2 | Impler  | menting a power estimation tool                                                                                               | . 87 |
|    |      |         | ing the power model                                                                                                           |      |
| 10 | Futu | ure wo  | rk                                                                                                                            | 87   |
| 9  | Con  | clusio  | n                                                                                                                             | 85   |
|    |      | 8.8.5   | The accuracy/speed trade-off                                                                                                  | . 83 |
|    |      | 8.8.4   | Improvements to consider                                                                                                      | . 82 |
|    |      | 8.8.3   | Evaluating the power model                                                                                                    |      |
|    |      | 8.8.2   | Consequences of abstractions                                                                                                  | . 80 |
|    |      | 8.8.1   | The quality of the cell mapping                                                                                               | . 79 |
|    | 8.8  | Discus  | sion                                                                                                                          | . 79 |
|    | 8.7  | Result  | s                                                                                                                             | . 76 |
|    | 8.6  | Impler  | mentation                                                                                                                     | . 74 |
|    | 8.5  | Estima  | ating the switching power                                                                                                     | . 74 |
|    |      | 8.4.1   | The <i>select</i> cell                                                                                                        | . 72 |
|    | 8.4  | Generi  | ic cells with no library equivalent                                                                                           | . 72 |
|    |      | 8.3.1   | Need for optimisation                                                                                                         |      |
|    | 8.3  |         | ining the structural information and the liberty data                                                                         |      |
|    | 8.2  |         | ations introduced by the cell library representation                                                                          |      |
|    | 8.1  |         | ations introduced by the structural representation                                                                            | . 66 |
| 8  | Gen  | eratin  | g a power model                                                                                                               | 65   |
|    |      | 7.5.3   | On the calibration                                                                                                            | . 64 |
|    |      | 7.5.2   | Other representations                                                                                                         | . 64 |
|    |      | 7.5.1   | Choosing a cell from a group                                                                                                  | . 63 |
|    | 7.5  | Discus  | sion                                                                                                                          | . 63 |
|    |      | 7.4.3   | Summary                                                                                                                       |      |
|    |      | 7.4.2   | Putting together a cell library object                                                                                        | . 63 |
|    |      | 7.4.1   | Parsing Liberty and storing data                                                                                              |      |
|    | 7.4  |         | $mentation \dots \dots$ |      |
|    | 7.3  | Cells v | with same functionality                                                                                                       |      |
|    |      | 7.2.3   | The state-dependency of leakage power                                                                                         | . 59 |

| С            | Technical implementation of the power model | C-1 |
|--------------|---------------------------------------------|-----|
| D            | Code implemented in Chapter 6               | D-1 |
| $\mathbf{E}$ | Code implemented in Chapter 7               | E-1 |
| $\mathbf{F}$ | Code implemented in Chapter 8               | F-1 |

## Glossary

- **Dennard Scaling** A MOSFET scaling law claiming the power density stays constant as transistors scale, thus making it possible to reduce power consumption by reducing the design size. This has held until recently, as leakage power is not negligible anymore with the smaller gate lengths in newer technology
- fan-in is the reduction of signals caused by several signals being connected to a cell with fewer outputs than inputs. E.g. a 3-inputs AND gate has a fan-in of 3.
- fan-out is the number of input gates that is driven by an output of a logic gate
- **JSON** stands for JavaScript Object Notation and is a format for representing structured data.
- **Liberty** is a widely adopted library format. The format is managed by the Liberty Techincal Advisory Board, which is sponsored by Synopsys [1]
- one-hot is form of signal encoding where only one bit of the signal can be high at a time
- **power bug** is a fault with the design causing the power consumption to behave unexpectedly. It may cause the design to violate its power constraints.
- SystemVerilog is a hardware description- and hardware verification language
- VHDL is a hardware description- and hardware verification language

# Acronyms

- ASIC Application Specific Integrated Circuit
- **BDD** Binary Decision Diagram
- ${\bf BN}\,$ Boolean Network

CDFG Control flow Data Flow Graph

CMOS Complementary Metal-Oxide-Semiconductor

HDL Hardware Descriptive Language

I/O Input/Output

- IC Integrated Circuit
- **IEEE** Institute of Electrical and Electronics Engineers

LUT Lookup Table

**RT-level** Register Transfer Level

**RTL** Register Transfer Level

 $\mathbf{SV}$  SystemVerilog

# List of Tables

| 5.1 | Elaborated cells                                                                 | 9 |
|-----|----------------------------------------------------------------------------------|---|
| 5.2 | Groups of elaborated SystemVerilog constructs                                    | 2 |
| 5.3 | Library group and attribute overview                                             | 6 |
| 5.4 | Power related cell groups and attributes overview                                | 7 |
| 5.5 | Pin power related groups and attributes overview                                 | 8 |
| 5.7 | Code listings and code documentation                                             | 0 |
| 6.1 | Gate counts from elaborated structure and synthesised file                       | 9 |
| 7.1 | Increase in leakage power from least consuming to most consuming state 59        | 9 |
| 8.1 | Power consumption in AND cells of different sizes using AND2 as the reference 69 | 9 |
| 8.2 | Power consumption in AND4 optimisations, in comparison to the three              |   |
|     | AND2 gate implementation in Figure 8.3a                                          | 0 |
| A.1 | Overview of the functions in the elaborated SystemVerilog (SV) parser A-2        | 2 |
| A.2 | Helper functions for the elaborated SV parser                                    | 3 |
| A.3 | Overview of classes in the elaborated SV parser and their variables and          |   |
|     | procedures                                                                       | 4 |
| B.1 | functions for processing the liberty file information                            | 2 |
| B.2 | Class overview for processing the liberty file information                       | 3 |
| C.1 | functions for making the power model                                             | 2 |
| C.2 | Class overview for the power model                                               | 2 |

# List of Figures

| 1.1        | Graph relating design abstraction level and power estimation accuracy                                                                                                      | 2  |
|------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----|
| 2.1<br>2.2 | Illustration of the iterative ASIC design flow, [2] $\ldots \ldots \ldots \ldots \ldots$<br>Illustration of the short-circuit power in CMOS logic [3]. When $V_{IN}$ rises | 9  |
|            | and falls $I_{SC}$ will flow from $V_{DD}$ to ground for a short period of time                                                                                            | 11 |
| 2.3        | Complementary Metal-Oxide-Semiconductor (CMOS) design corners [4]                                                                                                          | 13 |
| 3.1        | The estimation flow in the case of bottom-up power estimation $\ldots \ldots \ldots$                                                                                       | 17 |
| 3.2        | A top-down estimation flow                                                                                                                                                 | 18 |
| 3.3        | The estimation flow in the case of fast synthesis estimation $\ldots \ldots \ldots$                                                                                        | 20 |
| 4.1        | The intended estimation flow of the top-down power estimation. The blocks<br>highlighted in green are already existing, while the orange ones have to be                   |    |
| 1.0        | implemented to make the top down power estimator.                                                                                                                          | 24 |
| 4.2        | A refined flow for the top-down power estimation. The already existing<br>Liberty parser is highlighted in yellow. The part of the flow that is out of                     |    |
|            | scope is drawn in dotted lines                                                                                                                                             | 27 |
| 4.3        | An overview of the synthesis process from RTL to netlist                                                                                                                   | 28 |
| 6.1        | The modeling flow with the part of the flow relevant to this chapter highlighted                                                                                           | 41 |
| 6.2        | Different methods to levelise a logic circuit                                                                                                                              | 44 |
| 6.3        | The elaborated SystemVerilog file is parsed and a set of <i>structure</i> class objects are made                                                                           | 46 |
| 6.4        | A structural representation of the circuit in Figure 6.2b as a tree of <i>structure</i>                                                                                    | 40 |
|            | objects                                                                                                                                                                    | 48 |
| 7.1        | The modeling flow with the part related to retrieving power information from                                                                                               |    |
|            | the cell library highlighted.                                                                                                                                              | 55 |
| 7.2        | The impact on power estimation when summarising rise- and fall power $\ .$ .                                                                                               | 57 |
| 7.3        | ANDOR21                                                                                                                                                                    | 58 |
| 7.4        | The dataflow of retrieving the relevant Liberty data.                                                                                                                      | 61 |
| 8.1        | The modeling flow with the flow relevant to this chapter highlighted                                                                                                       | 65 |

| 8.2 | Different representations of an AND4 gate                                                                      | 67  |
|-----|----------------------------------------------------------------------------------------------------------------|-----|
| 8.3 | Different implementations of a 4-input AND gate                                                                | 69  |
| 8.4 | A common CMOS schematic for a NAND2 and an AND2 gate. The AND2                                                 |     |
|     | schematic is the same as the NAND2 but with an added inverter                                                  | 71  |
| 8.5 | One-hot multiplexers with different data<br>width                                                              | 73  |
| 8.6 | An AND4 gate as made by the power model generator                                                              | 75  |
| A.1 | The function hierarchy of the elaborated SV parser. Functions at the same level are called from left to right. | A-1 |
| B.1 | The function hierarchy of the liberty parser                                                                   | B-1 |
| B.2 | The function hierarchy of the liberty power data retrieving                                                    | B-2 |
| C.1 | Function hierarchy for the power model implementation                                                          | C-1 |

# 1 Introduction

### 1.1 Motivation

Power consumption is becoming increasingly important in Integrated Circuit (IC) design with the emergence of more and more battery-driven devices [5]. Transistor dimensions have continuously been shrinking to lower the power consumption of ICs. However, with the breakdown of Dennard Scaling [6] in the 2000s, leading to an increase in power density with smaller dimensions, downscaling has less effect on power consumption than it used to. Designers are now pushed to focus more on power consumption in their designs, and designing circuits for low power usage is becoming just as important as designing for high performance. The latter may be easier for designers, while many may lack the intuition to create circuits with low power in mind. To aid designers in this endeavour, tools for estimating the power consumption are essential.

Power estimation can be done at all design stages, until, in the end, it can be measured on the physical IC. The closer one is to the final implementation; the more accurate the power estimation typically can get. The less abstract the design representation is, the more one knows about parameters critical to power consumption. This is illustrated in Figure 1.1.

The system-level representation of a design is very abstract, and few aspects of the physical endproduct are known. The RTL representation is less abstract than the system-level representation, but still much remains unknown about the physical IC. The gate-level representation is closer to the endproduct than the two others, and many parameters relating to power consumption are determined at this level. The accuracy of power estimation will typically follow the trend of the graph; being more accurate the less abstract the design representation is.



Figure 1.1: Graph relating design abstraction level and power estimation accuracy.

The ASIC design process is an iterative process, described in Section 2.2. Discovery of issues at a particular stage might bring one back to earlier stages in the design process, where more significant changes can be done. Each iteration is costly in development time and effort, and may increase the time to market for a product. Ideally, the design should be made with as few iterations as possible. Discovering and fixing power bugs already at the RTL is thus beneficial, possibly reducing the number of design iterations necessary.

There is a lack of suitable tools for estimating power at the RTL. They tend to be either too time consuming to run or too inaccurate to give assured results. Many also output an average power estimate with no granularity in time and space, which is needed if one is to use this estimate to deal with power bugs.

### 1.2 Problem description

This project aims to investigate and develop power models for use in power estimation at the RTL. A general method for making these models is found and it holds for all types of RTL designs. To make this model, information about the cell library used is necessary together with an RTL description of the design.

Nordic Semiconductor ASA requested this project, and their motivations are to be able to discover power bugs early and enhance their design flow by developing a power estimation tool able to yield power estimates corresponding to the RTL simulations. It is necessary to have a low spatial and temporal granularity in the model made, in order to discover power bugs.

In Chapter 4, different approaches to RTL power estimation are investigated, and an implementation using the top-down method is decided upon. An advantage of the top-down method is its ability to yield a power estimate before having a gate-level representation of the design. This way, the estimation method does not introduce extra iterations to the design flow described in Section 2.2.

The top-down method tends to be less accurate than other options for RTL power estimation. The approach presented in this thesis tries to atone for this by using the Liberty file to get accurate power information about the cells to be used in the design, and combine this cell information with elaborated Hardware Descriptive Language (HDL) structural information, which potentially yields a better structural representation than an unprocessed HDL representation. The HDL elaboration will be done using Synopsys HDL Compiler.

This project implementation is divided into three main tasks:

#### 1. Analysing the structure of a design

By using an elaborated HDL implementation, information about the structure of a design, necessary for estimating power, will be retrieved. This information could be the number of gates, number of registers, amount of combinatorial logic, how the signals are connected, and so on.

#### 2. Obtaining power characteristics from the technology library

Finding a means to retrieve information about the available cells and their power characteristics from the cell library. Power characteristics being the leakage power and the dynamic power of the cells. It is also necessary to retrieve information about the cells in question to be able to relate their functionality to the power data.

#### 3. Creating a power model

Combining information about the structure retrieved in task **1** and information about the available cells retrieved in task **2** to make a power model of a design, representing all the signals and logic in the HDL representation.

The novelty of this power model is its generality and its use of the elaborated HDL and a cell library. The generality allows power models being made for any RTL as long as it can be elaborated by Synopsys HDL Compiler. The power models can be made with any cell library, their dimensions being irrelevant. The use of a cell library in the power model generation and a structural representation derived from elaborated SystemVerilog aims to achieve a high accuracy to future power estimations at the RTL.

## **1.3** Report structure

After this introduction this report consists of the following chapters:

| 2 Theory                         | In this chapter some relevant and useful theory for<br>the project is presented.                              |
|----------------------------------|---------------------------------------------------------------------------------------------------------------|
| 3 Background                     | This chapter presents related work and gives an introduction to RTL power estimation.                         |
| 5 Design tools and file formats  | Here relevant design tools and file formats are pre-<br>sented.                                               |
| 6 Extracting design structure    | Describes how the structural information is re-<br>trieved from the RTL representation.                       |
| 7 Extracting library information | Describes how the power relevant information in<br>the cell library is found, stored and used.                |
| 8 Generating a power model       | This chapter combines the information retrieved<br>in the two preceeding chapters to create a power<br>model. |

## 9 Conclusion

Concludes the work done.

#### 10 Future work

Suggestions towards future work of improving the power model and applying it in top-down RTL power estimation.

# 2 Theory

## 2.1 Terminology

Some terminology that will be used in this report is shown below.

| GATE"N"           | Logic gates will be referred to in capital letters annotated with the<br>number of inputs the gate has. For instance, a 2-input and gate will<br>be written AND2. For more complex gates the numbers annotated<br>refer to clusters of inputs, if this number is one it often skips the<br>firs operator. For example ANDOR21 is a AND2 gate followed<br>by an OR2 gate, where one of the OR2 inputs is the output of the<br>AND2 gate. These more complex gates will be explained with logic<br>functions or figures to make this clearer. |
|-------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                   | functions of figures to make this clearer.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| *                 | Logical AND operation                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
| +                 | Logical OR operation                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
| !                 | Logical NOT operation                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
| cell              | A building block in ASIC design. A transistor circuit encapsulated<br>into a logic function, such as an AND gate. Could also describe<br>building blocks with other purposes, but in this project this will<br>not be visited. All available cells are gathered in a cell library.                                                                                                                                                                                                                                                          |
| generic gate/cell | A cell of the generic cell library used by Synopsys HDL Compiler<br>when doing the design elaboration. In cases where the cell repre-<br>sents a logic gate, it may be referred to as a generic gate instead of<br>a generic cell.                                                                                                                                                                                                                                                                                                          |

## 2.2 The ASIC design flow

The ASIC design flow is a mature design flow used in the making of Integrated Circuits. This flow allows one to, step-by-step, go from an abstract design description, towards the layout sent to a foundry for manufacturing the physical IC. The flow is iterative and may, at any point before the physical IC is produced, return to an earlier stage, where larger changes can be made [2].

In Figure 2.1 different ways to represent the circuit with decreasing abstraction is shown. The steps in the design flow is described below.

| System Level            | At this level the design is described as a set of functionalities,<br>characteristics and constraints.                                                                                                                                                                                                               |
|-------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Algorithmic level       | Here the design is described and verified on an algorithmic level,<br>often using high level programming languages.                                                                                                                                                                                                  |
| Register Transfer Level | This representation makes use of a hardware descriptive lan-<br>guage, to describe the design as digital signals, logic operations<br>and registers, and verified as such, for instance using SystemVer-<br>ilog or VHDL.                                                                                            |
| Logic Level             | The Register Transfer Level description of the design can be <i>synthesised</i> into a Logic level description. Here the description is mapped to the available logic cells in the cell library. The synthesis process also checks timing and area constraints so one knows whether these hold or not at this level. |
| Physical Layout         | For the physical layout representation the cells from the logic<br>level representation are placed and connected on a theoretical<br>chip. Analog phenomenon, eg. wire capacitances, are taken into<br>concern in an attempt to model the physical IC.                                                               |
| IC                      | Here the endproduct of the process is made in a foundry. based<br>on a GDSII file from the physical layout, which contains all<br>information necessary to produce the IC.                                                                                                                                           |



Figure 2.1: Illustration of the iterative ASIC design flow, [2]

## 2.3 CMOS power consumption

The power consumption in digital CMOS-based circuits can be divided into dynamic and static power consumption. The dynamic power consumption is caused by the switching activity in the system, while the static power consumption is caused by leakage in the CMOS transistors [4]. The total power consumption of the system is the sum of these two as is given by Equation (2.1).

$$P_{total} = P_{dynamic} + P_{static} \tag{2.1}$$

#### 2.3.1 Dynamic power consumption

The dynamic power consumption can be divided into switching power and short-circuit power [4].

The main contributor to the dynamic power consumption is the switching power, which is the power it takes to charge and discharge the output capacitance of a logic gate. It can be calculated as shown in Equation (2.2), where  $\alpha$  is an activity factor describing how often the output switches (changes value).  $C_L$  is the load capacitance on the gate output,  $V_{dd}$  is the supply voltage and  $f_{clock}$  is the clock frequency.

$$P_{SW} = \frac{\alpha}{2} C_L V_{dd}^2 f_{clock} \tag{2.2}$$

Another contributor to dynamic power consumption is the short-circuit current. When CMOS logic is in the middle of switching both the NMOS and the PMOS transistor will be partially open, allowing some current to flow from  $V_{dd}$  to ground. This is illustrated in Figure 2.2.



Figure 2.2: Illustration of the short-circuit power in CMOS logic [3]. When  $V_{IN}$  rises and falls  $I_{SC}$  will flow from  $V_{DD}$  to ground for a short period of time.

The short-circuits contribution to power consumption can be calculated using the expression shown in Equation (2.3), where  $t_{sc}$  is the duration of the short circuit current,  $V_{dd}$  is the supply voltage of the system,  $I_{sc}$  is the average short-circuit current and  $f_{clock}$  is the clock frequency.

$$P_{SC} = t_{sc} V_{dd} I_{sc} f_{clock} \tag{2.3}$$

The total dynamic power consumed in the circuit will be the sum of the switching power and the short-circuit power consumed by all the transistors in a design, shown in Equation (2.4). N is the number of transistors,  $P_{SW_t}$  is the switching power- and  $P_{SC_t}$  is the short-circuit power of transistor t.

$$P_{dynamic} = \sum_{t=0}^{N} \left( P_{SW\_t} + P_{SC\_t} \right)$$

$$(2.4)$$

#### Dynamic power consumption from a logic circuit perspective

A different way of viewing switching power consumption, more suited for logic designs, is gotten from dividing the power consumption in two contributions: The contribution from switching of nets in a design, the **switching power**, and the contribution from the switching of internal signals in a logic cell, which also includes the short-circuit contribution, **internal power**. The total dynamic power of a design can thus be seen as a sum of the **switching power**,  $P_{SW}$ , of all nets, and the **internal power**,  $P_{IN}$ , of all cells. This is shown in Equation (2.5).

$$P_{dynamic} = P_{SW} + P_{IN} \tag{2.5}$$

#### 2.3.2 Static power consumption

The static power consumption in the CMOS transistors is caused by leakage current. This leakage has traditionally been negligible compared to the switching power, but the downscaling of the technologies and the lower supply voltages, which in turn has lead to lower threshold Voltage,  $V_t$ . Nowadays the static power consumption of transistors, is just as significant as the switching power. Contributions to the leakage current come from the sub-threshold leakage, the gate leakage and the junction leakage [4].

- Sub-threshold leakage is current leaking from source to drain while the transistor is operating in the weak inversion region ( $V_G < V_t$ ). It increases exponentially when lowering  $V_t$  [4] and is the largest contributor to the static power consumption.
- Gate leakage is current caused by electrons tunnelling through the oxide layer of the gate.
- Junction leakage is caused by potential differences between the drain diffusion region and the substrate. It is often negligible compared to the other two contributors.

## 2.4 Process, Voltage and Temperature corners

Variations in the manufacturing and the environment will lead to significant changes in the characteristics of a transistor. These changes may cause the IC behaviour to vary. To make a circuit operate as expected these variations should be taken into account. The sources of these variation are **process variation**, **supply voltage** and **temperature** [4].

The **process variation** is caused by slight variations in the manufacturing process, like the concentration of dopants or the oxide thickness.

These variations lead to manufactured transistors having varied characteristics. These are described as; F (fast) and S (slow) for the corner-cases, where the transistor will operate faster and slower than expected, and T (typical) describing an average transistor. For a CMOS transistor, consisting of one PMOS and one NMOS transistor this yields four operating corner-cases describing a constricted area, in which the pair og transistors will always operate within. FF, SS, SF and FS. The center of this area is (TT), the average transistor. This is illustrated in Figure 2.3.



Figure 2.3: CMOS design corners [4]

The variation in **temperature** also affects the transistor's operation significantly as it lowers the threshold voltage. If the operating temperature is high the transistor will have a higher leakage, which increase its power consumption.

Lastly, the **supply voltage** can deviate from the intended value for many reasons, such as the tolerance of the voltage regulators and noise.

Thus, it is not enough to only take an average transistor in the TT corner, operating in room temperature with the intended supply voltage into account. One also needs to consider the transistor in its slow corner, operating on a high temperature with a low voltage, and all other corner-cases.

# 3 Background

As demands to power consumption rise, the size of battery-driven devices sink and smaller transistor dimensions lead to higher power density, the need for power estimation tools rise. The designers wish to optimise the ASIC design flow and minimise the time to market, while still keeping up with state of the art power demands. Power can be estimated at all stages in the design process described in Section 2.2.

At the system level, accurate power estimation tools are few, but maybe not for much longer. In 2019 Institute of Electrical and Electronics Engineers (IEEE) released a new standard for power modeling at the system level [7]. There has not been a standard way of representing power data at the system level before. The organisation suggests the lack of such a standard could be why the industry is still inadequate in this field.

At the functional level, there power estimation tools exist, but they are mainly meant to speed up the simulations. One can argue that there are two main reasons one can wish to estimate power at a high level;

- 1. One wants to get an approximate indication of the power consumption at this level before lower-level representations are made.
- 2. Simulations at this level is faster than low-level simulation

In 2. one returns to a more abstract design representation to run faster simulations. Increasing the simulation speed is the primary motivation for the functional level power estimation tools. They are based on already existing gate-level representations.

Zhong et al. [8] try to estimate power at the functional level using some RTL power models derived from a gate-level representation of the design. Here a cycle-accurate functional description is merely an abstraction of the known RTL, in order to to speed up the RTL power estimation by going up an abstraction level for the simulation. In another paper, Zhong et al. [9] further improve their solution. Lee and Gerstlauer [10] annotates a functional model of a design with constructs allowing the capturing of activity. Using machine learning, power models can be synthesised from this functional model. An advantage of this method is that it allows for high-speed simulations. However, the functional model requires an existing gate-level representation of the design to train the power model. The methods for estimating power at the RTL can be divided into two main methods of implementation. They will be referred to as bottom-up and top-down methods. The bottom-up method starts with a less abstract representation of the design, such as the gate-level representation, and tries to relate power estimates done at this level to factors that are also known at the RTL [11], [12], [13]. The estimation method then returns to the RTL representation of the design and does power estimation on different scenarios there. The top-down method, on the other hand starts at the abstract RTL and tries to estimate lower-level information about the design in order to estimate power directly [14], [15], [16].

## **3.1** Bottom-up power estimation

In Figure 3.1 a typical estimation flow of bottom-up power estimation can be seen. The available input data is a gate-level netlist with corresponding simulation data. However, this requires that synthesis and layout with the desired technology library have been performed. A power estimation tool is then run on the gate level representation with a broad set of activity data. This results in a set of power estimates and simulation data that can be used to characterise the design, often relating the Input/Output (I/O)-switching to the power consumption. To get a power estimate, characterisation variables or a Lookup Table (LUT) are then fed to a general power model at the RTL, together with the simulation data of the scenario from which one wishes to estimate power.

Ravi et al. [11] makes an extensive set of macromodels from RTL components. These models are then translated into simulatable power model libraries. The creation of new designs then solely make use of these components for which power estimates are available.

Gupta et al. [12] made a macromodel relating gate-level power estimates to the hamming distance between consecutive input vectors. A complicated characterisation stage is necessary to exploit this relation.

Mehta et al. [13] also takes basis in making a macromodel for every possible RTL component. A clustering algorithm is used to group input vectors leading to similar power consumption in the circuit. These groups are then placed in a LUT. This clustering makes their model faster, as there are fewer values to look up.



Figure 3.1: The estimation flow in the case of bottom-up power estimation

## 3.2 Top-down power estimation

The estimation flow of a top-down power estimation approach can be seen in Figure 3.2. The method needs to take in information about the structure, readily available at the RTL, for instance, a HDL description. It also needs to take the cell library into account. The cell library can be considered by, for example, knowing the power characteristics of a standard gate from the cell library, or by processing the entire cell library as an input. It could also be possible to do some characterisation. If one, for instance, has a design that will be synthesised with strict timing constraints, this will increase its power consumption compared to a design with less strict timing constraints.

Zafalon et al. [14] have developed both a top-down and a bottom-up technique for power



Figure 3.2: A top-down estimation flow.

estimation. Their top-down approach is based on using a Binary Decision Diagram (BDD) to represent the circuit. Representing a design as a BDD is the same as making the design using only 2-to-1 multiplexers. This design is then optimised to some degree decided by the user, and the power estimate is tuned to the target technology. The user decides whether the actual synthesis will focus on power, timing or area and the model is also tuned based on that input.

Buyuksahin and Najm [15] make use of a Boolean Network (BN), a directed acrylic graph where each node is a boolean function, and its edges represent the connection between nodes. They use this network to estimate the gate count of the design, which yields an estimate of the circuit's total capacitance.

Sambamurthy et al. [16] use a Control flow Data Flow Graph (CDFG) to represent the circuit. This graph allows for modeling both the data operations done and conditionals. The number of stages necessary to implement a function is then estimated from the maximum input number of gates in the target technology and the function's size to be computed. The probability of switching at each node is then estimated from input switching from simulation

or the input switching probabilities and the likelihood of that switching propagating all the way to the logic depth of the function. The method of Logic Effort is used to make a capacitance estimate. All of the above is then combined into a power estimate.

#### 3.2.1 Fast synthesis power estimation

Several vendors provide tools for estimating power at the RTL. To mention a few; Ansys has PowerArtist [17], Synopsys has Spyglass Power [18], Mentor Graphics has PowerPro [19] and Cadence has Joules RTL Power Solution [20]. These are typically based on some variant of fast synthesis power estimation, mapping the RTL description to cells in a cell library and estimating the power consumption based on these cells. This method is applied by vendors already providing synthesis tools to provide a power estimation tool faster than gate-level estimation.

The power estimation flow of such tools is shown in Figure 3.3. The figure is simplified as the internal synthesis, and power estimation flow is undisclosed information private to the tool vendors. It is based on a synthesis tool that omits information not crucial to power estimation in order to speed up the synthesis. After the fast synthesis, an estimation tool will be used to estimate power. It gets its parameters from the "synthesised" design, activity data and possibly calibration data. As these methods bring the design closer to a gate-level representation, they allow for accurate power estimates but introduces a synthesis process which, though it is faster than a regular synthesis, may still be slow.



Figure 3.3: The estimation flow in the case of fast synthesis estimation

# 3.3 Prestudy

This thesis is written in collaboration with Nordic Semiconductor ASA. An unpublished literary review has been conducted on RTL power estimation to find a method suiting their motivations, which can be summarised as:

- Wanting to make RTL simulation power-aware.
- Being able to use this power awareness to detect power bugs.

The prestudy can be found on GitHub [21]. The following is a quick outline of the main differences between the top-down and bottom-up estimation flows and a summary of the prestudy conclusion.

The bottom-up methods have their foundation at the gate level and thus tend to have a

more accurate power estimation due to more information about the design being available as the power model is made. The challenge of bottom-up power estimation is to get the power estimates to correlate well with the input and output switching statistics of the design so that the model can be used at the RTL. The top-down methods tend to be less accurate, but lack the time-consuming characterisation stage of the bottom-up methods, making them faster for new designs and possibly more suited for design exploration if they take the internals of the RTL description into account.

To make the RTL simulation power-aware a power estimation tool for the RTL is needed. It is a significant advantage if this model is available before the design has been synthesised. Otherwise, it will introduce an extra iteration into the design flow, which may be avoided using the top-down method.

To detect power bugs with this power estimation, it needs time/cycle awareness. It could either work for smaller time-frames or do estimation cycle-by-cycle in the simulation. The latter is preferable. In addition to this temporal granularity, the tool should also have some spatial granularity. When running simulations on larger modules and observing unexpected power behaviour, it is an advantage to see where this behaviour occurs.

If the desire had been to increase simulation speeds when running power scenarios, then going from a gate-level representation to a RTL representation makes sense. Otherwise, this introduces an extra iteration to the design flow, which may be avoided using the top-down method.

If a top-down estimation approach does not provide enough accuracy, it could be supported by bottom-up models for existing design blocks to increase the estimation accuracy.

# 4 Suggesting a solution

The top-down method has been chosen for implementation due to its desirable estimation flow. The top-down flow is simple and starts at the RTL and makes a power estimate directly. For a bottom-up flow, on the other hand, a gate level representation of the design is needed **before** a RTL power estimate can be made. Using the top-down method a design can be changed or discarded because of power concerns early in the design flow, without ever needing to be synthesised, if the power estimates are accurate enough. With the topdown method it is possible to verify the power behaviour concurrently to the functional verification of the RTL.

The suggested estimation flow can be seen in Figure 4.1. Here the RTL representation of a design and data from the cell library is retrieved and processed separately, to later be combined into a power model. The power model is used together with activity data by a power estimation tool to yield a power estimate. Already existing data and tools are highlighted in green, while the parts highlighted in orange would have to be implemented.

It is necessary to implement a system processing the structural information found in the RTL representation, and another system processing power information related to the cell library. Then, the retrieved information from both systems can be combined into a power model, which will serve as an input to a power estimator together with simulation- or activity data.



Figure 4.1: The intended estimation flow of the top-down power estimation. The blocks highlighted in green are already existing, while the orange ones have to be implemented to make the top down power estimator.

## 4.1 Structural information

The processed structural info-block in Figure 4.1 should contain information about which operations are done on which signals and how they are all connected. Later, in the Power model generator-block this will be related to power information. The structural information should also allow for some estimation of activity in the structure, depending on observable, (input, output and/or register), switching activity. It is also important that the structure remains relatable to the RTL it represents.

Most synthesis tools have an *elaboration* stage where they retrieve structural information from the RTL as a pre-processing stage for the *synthesis*. This is done by breaking down coding constructs and compiler directives and mapping the code to cells from a *generic* library. This library does not correspond to any physical library and the *generic cells* represent logic- and arithmetic functions on the signals only. With this representation as a foundation the *operations* are the generic cells in the elaborated netlist and the *signals* are their connections.

Using the elaborated structural information, rather than unprocessed RTL, brings one a bit closer to the gate level representation of the design and possibly towards more accurate power estimates. It is not desirable to go all the way to a gate level representation as the synthesis process is time consuming, especially for larger designs. It is interesting to see what kind of power model can be developed with this elaborated design as a starting point rather than the RTL it is elaborated from or the netlist it is synthesised into. Detailed information about the elaborated SystemVerilog format can be seen in Section 5.1

## 4.2 Cell library information

A common approach in high level power estimation is to abstract away the cell library by using a general gate representing all the gates in the design instead of differentiating between gates. Such a cell is commonly a NAND2 cell with the correct gate length and power characteristics corresponding to the cell library. This project attempt to lay the foundation of accurate RTL power estimation and thus want differentiate between the cells in the design to some extent. Knowing what cells are where and what they are affected by will possibly improve the accuracy of temporal and spatial power estimates even if the average power estimate remains the same. Finding out *what* cells are available and what power consumption these cells have will be the job of the *Library Processing*-block in Figure 4.1. The library power information is commonly stored in a Liberty file. Liberty is a standard format for representing timing and power characteristics of a cell library. More information on the format is found in Section 5.2.

Nordic Semiconductor ASA has a Liberty parser that can retrieve information from the Liberty file, but further processing is necessary to structure and select the information necessary to do power estimates, which is information relating to the static and dynamic power consumption of the cells. Synopsys has a HDL compiling tool doing design elaboration, but it will be necessary to retrieve structural information from the elaborated SystemVerilog file. Lastly this project will combine the structural information from the elaborated SystemVerilog and the power-focused information from the cell library into a power-aware representation of the design, a *power model*, which can in turn be used for power estimation.

## 4.3 The power modeling flow

Figure 4.2 is a refined version of Figure 4.1. It goes more into detail on the estimation flow adding the Liberty parser and the elaborated SystemVerilog. The highlighted blocks are those involved in developing a power model, and thus the scope of this project. The power model will combine the structural information retrieved from the elaborated SystemVerilog and the power- and cell information retrieved from the Liberty file.

The elaborated file can be made using Synopsys HDL Compiler. A tool part of the Synopsys synthesis flow shown in Figure 4.3. In their flow the HDL is first compiled into an elaborated SystemVerilog netlist. The elaborated design is then fed to Synopsys DesignCompiler together with the Liberty file to yield the gate level netlist.



**Figure 4.2:** A refined flow for the top-down power estimation. The already existing Liberty parser is highlighted in yellow. The part of the flow that is out of scope is drawn in dotted lines.



Figure 4.3: An overview of the synthesis process from RTL to netlist

# 5 Design tools and file formats

## 5.1 Design elaboration

Wire

wire

When synthesising a design, the constructs in the RTL are mapped to cells in the cell library, creating a hardware design with equivalent functionality as the one described in the RTL. This representation is called a netlist. Most synthesis tools do this by going through an elaboration stage. Here the constructs in the RTL are first optimised and mapped to cells from a *generic* cell library. A *generic* cell library is a library with functional cells not corresponding to physical ones. They do not have any power- or timing data. The elaboration also goes through the compiler directives, which are direct instructions on how to process the HDL, such as **'ifdefs**. The results of the elaboration stage is an intermediate file, similar to the netlist, using cells from a generic library, rather than cells from the library used in synthesis.

To complete the synthesis process, the elaborated file is optimised further and mapped to the cells in the cell library. In this project, Synopsys HDL Compiler has been used to get an elaborated representation of the design. In Table 5.1, a simplified list of these elaboration constructs made by this synthesis tool can be seen. The *module*, *input*, *output* and *assign* constructs are the same as in the RTL file. The *wire* represents all connections between objects. The rest of the objects have replaced the more complex RTL with simple, generic gates like an AND2 gate. The elaborated netlist is not technology-specific and, thus, does not contain any power information.

| Construct | generic cells | Description                           |
|-----------|---------------|---------------------------------------|
| Module    | module        | A SystemVerilog module declaration or |
|           |               | instantiation                         |
| Input     | input         | An input port of variable bitwidth    |
| Output    | output        | An output port of variable bitwidth   |
|           |               |                                       |

Table 5.1: Elaborated cells

A wire of variable bitwidth

| Assign       | assign                                                                                                                                                                                      | Assigning one wire to another wire or a constant                                                                    |
|--------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------|
| Multiplexer  | MUX_OP                                                                                                                                                                                      | A multiplexer with variable datawith and select signal width                                                        |
| Register     | SEQGEN                                                                                                                                                                                      | A one bit register                                                                                                  |
| AND2         | GTECH AND2                                                                                                                                                                                  | A two-input AND gate                                                                                                |
| OR2          | GTECH OR2                                                                                                                                                                                   | A two-input OR gate                                                                                                 |
| XOR2         | GTECH_XOR2                                                                                                                                                                                  | A two-input XOR gate                                                                                                |
| Select       | SELECT_OP                                                                                                                                                                                   | This sends one of several data signals out,<br>depending on a control signal                                        |
| Adder        | ADD_UNS_OP,<br>ADD_UNS_CI_OP,<br>ADD_TC_OP,<br>ADD_TC_CI_OP                                                                                                                                 | Adder with inputs and outputs of variable width                                                                     |
| Subtractor   | SUB_UNS_OP,<br>SUB_UNS_CI_OP,<br>SUB_TC_OP,<br>SUB_TC_CI_OP                                                                                                                                 | Subtractor with inputs and outputs of variable width                                                                |
| Shift        | ASH_UNS_UNS_OP,<br>ASH_UNS_TC_OP,<br>ASH_TC_UNS_OP,<br>ASH_TC_TC_OP,<br>ASHR_UNS_UNS_OP,<br>ASHR_UNS_TC_OP,<br>ASHR_TC_UNS_OP,<br>ASHR_TC_TC_OP,<br>SRA_UNS_OP,<br>SRA_UNS_OP,<br>SRA_TC_OP | Shifting a signal in a certain direction,<br>possible to take the sign into account                                 |
| Barrel shift | BSH_UNS_OP,<br>BSH_TC_OP,<br>BSHL_TC_OP,<br>BSHR_UNS_OP,<br>BSHR_TC_OP                                                                                                                      | Shifting, rolling the bit shifted out to<br>the opposite side of the signal instead of<br>shifting in zeros or ones |

| Shift-and-add | SLA_UNS_OP,    | Shift signal before adding              |
|---------------|----------------|-----------------------------------------|
|               | SLA_TC_OP      |                                         |
| Multipliers   | MULT_UNS_OP,   | Multiply two signals and output the re- |
|               | MULT_TC_OP     | sult                                    |
| Division      | DIV_UNS_OP,    | Divide a signal by another and output   |
|               | MOD_UNS_OP,    | the result                              |
|               | REM_UNS_OP,    |                                         |
|               | DIVREM_UNS_OP, |                                         |
|               | DIVMOD_UNS_OP, |                                         |
|               | DIV_TC_OP,     |                                         |
|               | MOD_TC_OP,     |                                         |
|               | REM_TC_OP,     |                                         |
|               | DIVREM_TC_OP,  |                                         |
|               | DIVMOD_TC_OP   |                                         |
| Comparators   | LT_UNS_OP,     | Compare two signals of variable width   |
|               | LT_TC_OP,      |                                         |
|               | GT_UNS_OP,     |                                         |
|               | GT_TC_OP,      |                                         |
|               | LEQ_UNS_OP,    |                                         |
|               | LEQ_TC_OP,     |                                         |
|               | GEQ_UNS_OP,    |                                         |
|               | GEQ_TC_OP,     |                                         |
|               | EQ_UNS_OP,     |                                         |
|               | NE_UNS_OP,     |                                         |
|               | EQ_TC_OP,      |                                         |
|               | NE_TC_OP       |                                         |
|               |                |                                         |
| Not           | GTECH_NOT      | An single bit inverter                  |

Many of the more complex generic cells are grouped into the same constructs. These cells differ depending on the representation of their input signal representations but are otherwise similar in functionality. The Synopsys elaboration differs between *unsigned*, UNS, and *twos'* complement, TC, representations. In the table, cells with different signal representations but otherwise the same functionality is put in the same group.

Many comparators have been grouped together into one. It can be argued that their power consumption is quite similar, as the logic needed to implement them are the same. However, the larger-than and smaller-than comparisons introduce more complexity than the equal and not equal, so it can also be an option to divide the comparators into two (or more) groups.

Table 5.2 shows the elaborated cell groups sorted by functionality.

| Group                | Construct     |
|----------------------|---------------|
| Connects             | inputs        |
|                      | outputs       |
|                      | wire          |
|                      |               |
| Buffers              | Buffer        |
|                      |               |
| Multiplexer          | Multiplexer   |
|                      |               |
| Register             | Register      |
|                      |               |
| Logic operators      | AND2          |
|                      | OR2           |
|                      | XOR2          |
|                      | Comparator    |
|                      | Not           |
|                      | Shifter       |
|                      | Barrel shift  |
|                      |               |
| Arithmetic operators | Adder         |
|                      | Subtractor    |
|                      | Shift-and-add |
|                      | Multiplier    |
|                      | Divisor       |

Table 5.2: Groups of elaborated SystemVerilog constructs

The generic *select* cell is unique as it does not have a cell equivalent in any cell library.

An *if* or *case* statement is elaborated into a generic *select* cell by Synopsys HDL compiler unless it is specified in the HDL representation that one wants it to be inferred as a multiplexer. The *select* statement is then synthesised by Synopsys DesignCompiler into either logic or a multiplexer depending on the available cells and unknown DesignCompiler conditions.

## 5.2 Liberty file format

The cell library's timing and power characteristics are found in a Liberty file. The Liberty file format is an industry standard used to describe cells of a particular technology. Information regarding timing, power, area, functionality and operating conditions of cells in the cell library can be found in this file.

The Liberty file consists of three types of statements:

#### • Group statements

A collection of statements grouped together. In a library, the uppermost group is a *library* group, and no other such groups can be made in a Liberty file. A group internal to the library can, for example, be a *cell* group, and a group internal to the cell can be a *pin* group.

#### • Attribute statements

A statement used to describe the characteristics of objects (groups) in the library. Such attributes can, for instance, be the size of a cell or the unit of leakage current in the library.

#### • Define statements

Used to define new attributes. Which kind of group they are meant to describe is also specified.

Values are often specified without units, and the units of different values are described at a higher level, as library attributes.

### 5.2.1 Power characteristics

The same cell library is characterised in the different design corners described in Section 2.4, resulting in different Liberty files for different process conditions.

In Chapter 2, it was described how power consumption could be divided into dynamic and

static power consumption. Here, the Liberty groups and attributes relating to the two types of power consumption will be investigated.

### Static power

The liberty **cell** group has a sub-group called **leakage\_power**. In this group, a leakage power value is given. The group also has an optional **when** attribute and a **related\_pg\_pin** attribute, set to the supply pin of the cell. The **when** attribute describes the different states of the input pins. For instance, if the cell is a two-input **AND** gate, the attribute would be one of the 4 possible input cases; A1 & A2, A1 & !A2, !A1 & A2 or !A1 & !A2. If the **when** attribute is not given, the average leakage power is the one given. Depending on how accurate data one wants, one can choose to retrieve the average leakage power or all the state-specific leakage power values from the liberty file. In Listing 5.1, an example of the **leakage\_power** group can be seen.

Listing 5.1: leakage\_power group example

```
1 leakage_power () {
2 value : 93.1982;
3 when : "A1 & A2";
4 related_pg_pin : "VDD";
5 }
```

The unit of values of different groups are described at a higher level in the library.

#### Dynamic power

For a logic design, one can say that there are two contributions to the switching power:

• Switching power

The charging and discharging of the output load capacitance, which is determined by the input pins the output is connected to.

### • Internal power

The internal switching of transistors within the cell, both as a result of an input transition leading to an output transition and an input transition only causing some transistors *within* the cell to switch.

Both of these contributions are found in the **internal\_power** group in the Liberty file. This group is a sub-group of the **pin** group, which in turn is part of a **cell** group. The internal\_power group of an input pin will describe the internal power consumption of the cell, while the internal\_power group of an output pin will describe the switching power.

Listing 5.2: pin group examples

```
pin (Z) \{
1
           direction
                                  : "output";
2
           related power pin
                                 : "VDD";
3
           related ground pin
                                 : "VSS";
4
           power down_function : "(!VDD) + (VSS)";
5
           function
                                  : "A1*A2";
6
           max capacitance
                                 : 0.078;
7
           timing () \{\ldots\}
8
           timing () \{\ldots\}
9
           internal_power() {
10
                related pin
                                  : "A1";
11
                                  : "A2";
                when
12
                related_pg_pin : "VDD";
13
                rise power (lookup table template) {
14
                    // lookup table data
15
                }
16
                fall power (lookup table template) {
17
                    // lookup table data
18
                }
19
       }
20
           internal power() {...}
21
       pin (A1) {
22
           direction
                                 : "input";
23
           related power pin
                                 : "VDD";
24
           related_ground_pin
                                 : "VSS";
25
           max transition
                                 : 10;
26
           capacitance
                                 : 0.0268;
27
           rise capacitance
                                 : 0.0045;
28
           rise capacitance range (0.0071, 0.0089);
29
           fall capacitance
                                 : 0.0067;
30
           fall capacitance range (0.0032, 0.0045);
31
           receiver_capacitance () {...}
32
           internal power() \{\ldots\}
33
34
```

# 5.2.2 Power related library attributes and groups

In Table 5.3 some groups and attributes related to power consumption at a library level can be seen.

| Group/attribute                         | Description                                   |  |
|-----------------------------------------|-----------------------------------------------|--|
|                                         |                                               |  |
| voltage_unit                            | the voltage unit used for the cell library    |  |
|                                         | voltage values                                |  |
|                                         |                                               |  |
| $capacitive\_load\_unit$                | The unit for capacitive loads in the cell li- |  |
|                                         | brary                                         |  |
|                                         |                                               |  |
| ${\bf library\_features}~({\rm group})$ |                                               |  |
| default_cell_leakage_power              | default value for cell leakage power if cell  |  |
|                                         | lacks this group, if not specified it is zero |  |
|                                         |                                               |  |
| lu_table_template (group)               | Describes buildup of a lookup table that      |  |
|                                         | can be filled with characterisation values    |  |
|                                         |                                               |  |
| <b>cell</b> (group)                     | See Section 5.2.3                             |  |

| Table 5.3: | Library grou | p and attribute | overview |
|------------|--------------|-----------------|----------|

## 5.2.3 Cell attributes and groups

In Table 5.4 some groups and attributes related to the power consumption on a cell level can be seen. The cell in itself is a group in the library.

| Group/attribute       | Description                                  |  |
|-----------------------|----------------------------------------------|--|
|                       |                                              |  |
| footprint             | used to relate cells with same functionality |  |
|                       |                                              |  |
| area                  | the area of the cell                         |  |
|                       |                                              |  |
| leakage_power (group) |                                              |  |
| value                 | the leakage power value                      |  |
| when                  | pin logic values for value to be valid       |  |
| related_pg_pin        | related supply voltage pin                   |  |
|                       |                                              |  |
| <b>pin</b> (group)    | See Section 5.2.4                            |  |

 Table 5.4:
 Power related cell groups and attributes overview

# 5.2.4 Pin attributes and groups

Important groups and attributes of the pin group are shown in Table 5.5. The pin group itself is a group in a cell.

| Group/attribute                              | Description                                  |
|----------------------------------------------|----------------------------------------------|
|                                              |                                              |
| direction                                    | Whether pin is input or output pin           |
|                                              |                                              |
| related_power_pin                            | What is the power pin relative to this pin   |
|                                              |                                              |
| related_ground_pin                           | What is the ground pin relative to this pin  |
|                                              |                                              |
| capacitance (input pin)                      | Capacitance of pin                           |
|                                              |                                              |
| function (output pin)                        | Boolean function describing pin function     |
| max capacitance (output pin)                 | Maximum capacitance the pin can drive        |
|                                              |                                              |
| $\mathbf{internal\_power}\;(\mathrm{group})$ |                                              |
| $related_input$                              | input relating to this group instantiation   |
| when                                         | conditions of other related pins             |
| related_pg_pin                               | the related power-ground pin                 |
| rise_power (group)                           | the power consumption if related_input rises |
|                                              | (a LUT)                                      |
| fall_power (group)                           | the power consumption if related_input falls |
|                                              | (a LUT)                                      |
|                                              |                                              |

# 5.3 Test files and modules

## 5.3.1 Test modules

The system will be tested on several of Nordic Semiconductors designs. Here they are listed together with a short description of their functionality:

| Module1 | An activity monitor        |
|---------|----------------------------|
| Module2 | A memory management module |
| Module3 | A queue module             |
| Module4 | A data management module   |
| Module5 | A filter module            |

## 5.3.2 Calibration netlist

The system will in Chapter 7 make use of a *calibration netlist*. This file is the netlist of a full chip made by Nordic Semiconductor ASA.

## 5.3.3 Liberty file

The Liberty parser developed in Chapter 7 has been tested on one Liberty file. This file is representing a library in the sub-micro dimensions, with typical process values and operating conditions.

## 5.3.4 Project files

The code implemented as a part of this project can be found on GitHub [21] and also in the Appendix of this report. References to the Appenix are given in Table 5.7

Description Documentation Code listing Implementation for retrieving the structural Appendix D Appendix A information from the HDL description Implementation of retrieving and organising Appendix E Appendix B library information from the Liberty file Implementation of power model generator combining structural information from the Appendix C Appendix F HDL description of a design with the cell library information from the Liberty file

 Table 5.7:
 Code listings and code documentation

# 6 Extracting design structure

This chapter presents how the structural information of a design is retrieved, what comprises this information, and how it is shaped into a useful representation. The scope of this chapter, relative to the rest of the project, is highlighted in Figure 6.1.



Figure 6.1: The modeling flow with the part of the flow relevant to this chapter highlighted

The structural representation consists of information about operations done on signals and how the signals are connected as described in Section 4.1. It is needed as an input to the power model generator. In this power model generator, the structural representation will be combined with information about the cell library, such as which cells are available and information on the power consumption of these cells.

In Chapter 4 an estimation flow was settled upon. The flow makes use of the elaborated SystemVerilog made by Synopsys HDL compiler, rather than unprocessed RTL. The elaborated SystemVerilog format is presented in Section 5.1. Using a gate-level representation would introduce a slow synthesis process, and is thus undesirable. Using the elaborated RTL the code constructs and compiler directives are dealt with and a structural libraryindependent netlist is available. Assuming the elaboration tool does this well, using this representation as an input to get a structural representation of the design is ideal, and will save some implementation time.

# 6.1 Elaborated SystemVerilog

The available structural information from the elaborated HDL is low level and fine-grained, being a netlist of the design using a generic cell library. This representation does not introduce any abstraction, except the abstraction already present as a gap between the Register Transfer Level (RT-level) and the gate-level. On the contrary, it reduces the gap between these representations by removing code constructs, like *if*, *case* and *generate* statements in the HDL, transforming it to a netlist of generic gates. However, if any compiler directives or parameters change, the design will have to be re-elaborated, and the structural representation regenerated.

The information needed for the structural representation is:

- What building blocks make up the design
- How they are connected to compose the design

In the elaborated SystemVerilog, the *building blocks* are the cells from the generic cell library and their *connections* are represented as wires, input and outputs in the elaborated design representation, listed in Table 5.1.

It is necessary to retrieve enough information about the arrangement of generic cells, so that a good representation of the design can be made. A goal for this project as a whole is to be able to differentiate between the different types of cells and their power consumption. This requires distinguishing between different types of cells in the structural representation.

In Table 5.2 groups are made of the generic cells in elaborated SystemVerilog. These groups will be used, rather than representations for every generic cell type being differentiated between.

# 6.2 Structural representation of a design

There are many ways to represent the structure of the design, a few methods have been evaluated:

- The **levelised** circuit representation annotates gates with a level value. This annotation makes it possible to deal with each level of switching separately and propagate switching probabilities through the design. In [12], Gupta and Najm use this representation to estimate power by calculating the average capacitance at each level. However, this works for combinatorial circuits, not sequential. An illustration of this representation can be seen in Figure 6.2a
- If one can monitor the switching of registers during simulation, another way to structurally represent the circuit is to **levelise it by register**. One can monitor each register and the logic on each register output affected by its switching. This representation is illustrated in Figure 6.2b. One thing to take notice of is how each cell may be part of more than one register level.
- Another option is **large-scale register levelisation**, putting all the logic between a set of registers in one group and making a power estimate for this group depending on switching activity. This is illustrated in Figure 6.2c. A problem this introduces can also be seen in the figure. As "Level 2" in the figure technically also is "Level 1" due to the input of R6 not going through "Level 1". Clear definitions on how to group registers must be in place.
- It is also possible to represent the design **module-by-module**. This representation is made by having a model/representation for each module in the design. This representation can be useful for calculating average power, but power variations over time will be hard to consider. An advantage of this method compared to the register levelisation is that the module borders are clear and non-ambiguous.



(c) A circuit with groups of registers making up one level.

Figure 6.2: Different methods to levelise a logic circuit

The problem with logic being in more than one level is present in all the design representations except the **module-by-module** representation as the boundaries between modules are strict, and the **register-levelised** method, as the boundaries here are very loose. Nemani and Najm [22] eliminates this problem by differentiating between the lowest level that a gate is used at and other possible levels by saying the first level a gate is encountered is where it is *generated*, while, on other levels the gate is *used*.

When settling on a structural representation, it must have the right level of complexity. A too complicated representation would need more time to process the simulation data presented to it by the power estimation tool. At the same time, a too simple model might not be able to give accurate enough power estimates. It is necessary to make abstractions to simplify the design representation, as this will reduce the time required to run a power estimation. On the other hand, it is essential to retain enough information to give useful power estimates.

For the **levelised** representation each gate would have to be considered which would result in a lot of data processing during simulation, if we want the simulation to be fast this is not suitable, even if considering each gate would give the most accurate results. The **largescale register levelisation** and the **module-by-module** representation both abstract away too much information for the power estimation to be accurate as too much logic will be in the same groups, and the different impacts of different signals will not be seen. Their simulations would be fast, but the results not satisfactory. The **register-levelised** representation considers the impact of one register at a time, which is a manageable granularity, still being fine-grained enough the possibility for accurate estimation remains.

# 6.3 Abstractions made

As mentioned making abstractions are important to achieve the desired simulation speeds. As information can later be abstracted away by the *power model generator*, doing too many abstractions while making the structural representation is, however, deemed unnecessary. It is better to abstract away information after the structural representation is related to power information. Some abstractions done at this level are given below:

#### • Grouping of generic cells

In Section 5.1 generic cells are grouped together based on their basic functionalities. This is done in Table 5.2. The biggest generic cell groups were the ones representing complex arithmetic operations such as multiplication, but also shifters and comparators have big groups.

### • Register levelisation

Depending on how it is implemented, the register levelisation may introduce abstractions. If all information outside the register level loses its relation to information within that level, the correlation between inputs and the impact these correlations have on power consumption is lost. It will also make it harder to predict switching activity as inputs from different register levels may not be able to relate to each other, leading to a more inaccurate output switching probability.

Before relating the representation to the cell library used, it is good to retain as much information as possible to make the relation simpler.

## 6.4 Elaborated SystemVerilog parser implementation

The elaborated SystemVerilog parser is implemented in Python. It is class-based and has one class for each of the constructs listed in Table 5.1. It takes in the elaborated HDL representation of the design, created by Synopsys HDL Compiler and yields a set of node trees as the structural representation. The flow of the parser is illustrated in Figure 6.3.



**Figure 6.3:** The elaborated SystemVerilog file is parsed and a set of *structure* class objects are made

The functions of the system, their hierarchy and the different classes are described in depth in Appendix A. The problem can be divided into two parts;

- 1. Parsing the elaborated SystemVerilog and storing the information retrieved in objects.
- 2. Processing these objects to make a register-levelised structural representation.

The implementation is briefly described in the sections below. The code of the parser is given in Appendix D and on GitHub [21].

## 6.4.1 Parsing

The parser begins by going through the whole input file line-by-line. It makes objects for each instantiation of a generic cell or connection it encounters and stores them in lists.

The lists are part of a module object, representing a module instantiation. When a new module declaration is reached, a new module object is made. A module instantiation inside another module is another object type referring to the module objects they represent. After a new object is created from a generic cell instantiation, its connection ports are registered and found in the existing input-, output- or wire- object lists. The cell connection is then registered in the related connection object.

When all the objects are made, and the parsing is done, one has all the structural information in the lists of inputs, outputs and wires as all generic cell objects are now connected to these.

### 6.4.2 Post-processing

The information obtained from the parsing needs to be made into the register-levelised structural representation.

The object lists in the module object make it possible to start from an input or register and see how the signals propagate, fan-in and fan-out until they do not propagate any further. A signal stops propagating when it reaches a top-module output, another register or a *select* or *control* signal for a SELECT or MUX generic cell. This propagation is done, and a structure tree is made from the elements.

The structure tree is a node tree with one parent and multiple children per node. As the goal of the register-levelised representation is to see all the cells affected by the switching of a cell fan-out is necessary to take into consideration. Fan-in is not considered as different inputs to a cell, may originate with different registers.

This structural representation leads to every cell with more than one input having itself and everything connected to its output duplicated as many times as its number of inputs. To deal with this expansion, and reduce the processing time caused by it, the output node tree from a cell is stored in the cell the first time it is encountered. Later, if the cell is reencountered as part of another structure, the node tree stored in the cell object will be reused.

### 6.4.3 Register levelised structure trees

The circuit from Figure 6.2b represented as the structure trees developed from parsing elaborated SystemVerilog and processing the retrieved data is shown in Figure 6.4. The circuit becomes four structure trees, one for each register. Gates with inputs coming from elsewhere will also be part of other structure trees.

It is only possible to go through the tree in the direction of the arrows, and each tree head, starting with each of the four registers, are oblivious to the sharing of structures with the other trees.



**Figure 6.4:** A structural representation of the circuit in Figure 6.2b as a tree of *structure* objects

A text representation of this structural representation is shown in Listing 6.1. Here the relation between the structure trees are more obscured.

Listing 6.1: Structural representation example

```
R1

reg

| gtech_or2

| gtech_and2

R2

reg

| gtech_xor2

| gtech_and2

R3

reg

| gtech_and2

R4

reg

| gtech_and2

R4

reg

| gtech_and2
```

The first time a generic cell instantiation is encountered, the structure object created from it is stored in the cell object. If this cell has more than one input, it will be reencountered. When this happens, the structure object stored in the cell object will be reused.

# 6.5 Comparing cell counts

logic

In Table 6.1 cell counts from the elaborated SystemVerilog parser and the synthesised file are given. The counts are sorted into different groups based on cell functionality. The *select* statement has no equivalent in the synthesised file, and *others* is a group representing cells in the cell library with no generic equivalent functions, such as *decap* cells, preventing an IR drop by providing current when much of the logic switches at once.

|         | constructs | elaborated SV parser count | synthesised file count |
|---------|------------|----------------------------|------------------------|
| Module1 | registers  | 164                        | 175                    |
|         | muxes      | 6                          | 56                     |
|         | inverters  | 406                        | 88                     |
|         | buffers    | 332                        | 6                      |
|         | arithmetic | 2                          | 0                      |

Table 6.1: Gate counts from elaborated structure and synthesised file

1128

1196

|         | selects                     | 2450 | -    |
|---------|-----------------------------|------|------|
|         | others                      | -    | 2    |
|         | total                       | 4488 | 1521 |
| Module2 | registers                   | 15   | 13   |
|         | muxes                       | 4    | 0    |
|         | inverters                   | 92   | 13   |
|         | buffers                     | 57   | 8    |
|         | $\operatorname{arithmetic}$ | 0    | 0    |
|         | logic                       | 411  | 183  |
|         | selects                     | 81   | -    |
|         | others                      | -    | 0    |
|         | total                       | 660  | 217  |
| Module3 | registers                   | 12   | 12   |
|         | muxes                       | 0    | 11   |
|         | inverters                   | 22   | 9    |
|         | buffers                     | 20   | 1    |
|         | $\operatorname{arithmetic}$ | 5    | 0    |
|         | logic                       | 23   | 55   |
|         | selects                     | 26   | -    |
|         | others                      | -    | 0    |
|         | total                       | 108  | 88   |
| Module4 | registers                   | 16   | 16   |
|         | muxes                       | 1    | 16   |
|         | inverters                   | 99   | 22   |
|         | buffers                     | 70   | 0    |
|         | arithmetic                  | 0    | 0    |
|         | logic                       | 89   | 156  |
|         | selects                     | 51   | -    |
|         | others                      | _    | 0    |
|         | total                       | 326  | 210  |
| Module5 | registers                   | 180  | 180  |
|         | muxes                       | 0    | 0    |
|         | inverters                   | 6    | 99   |
|         | buffers                     | 5    | 11   |
|         | arithmetic                  | 19   | 226  |

| logic   | 5   | 708  |
|---------|-----|------|
| selects | 4   | -    |
| others  | -   | 1    |
| total   | 219 | 1225 |

# 6.6 Structural representation discussion

### 6.6.1 Cell counts

The most noticeable trend in Table 6.1 is the difference in total cell count. In most cases, the number of cells after synthesis is drastically reduced. The exception to this trend is the modules containing arithmetic cells. The generic cells for arithmetic operations may be large, as their input signals' width can vary. For example, a 32-bits adder could be represented as one generic cell instantiation. This is seldom the case for arithmetic cells in the non-generic cell library. These cells are often small and brilliantly combined to perform complex arithmetic operations. One generic cell adding together two 32-bits non-constant values will, after synthesis, very likely be represented by more than one arithmetic cell.

The elaborated SystemVerilog of Module5 contains 19 *arithmetic* cells. It is a filtering module performing several multiplications. The synthesis of the arithmetic cells leads to an increase to 226 *arithmetic* cells. Together with four select statements, they contribute to increasing the logic gate count from 5 to 708. The buffer counts also sink considerably in all test modules except Module5 where the buffer count increases from 5 to 11.

The select cells do not exist in the synthesised design, and the synthesis process transforms these to either multiplexers or logic cells. In most cases, it seems they become logic cells. The large Module1 begins with 2450 selects in its elaborated representation, and less than 56 of these are synthesised into multiplexers. All the other modules have similar trends, except for Module3. Here 26 selects become 11 multiplexers and the logic count increase from 23 to 55, but this may also be due to the modules' 5 arithmetic cells.

For modules where arithmetic operations are not done, the total gate count is reduced with more than 2/3 from the elaborated netlist to the synthesised one.

These general trends will hold, but the difference between the elaborated netlist and the synthesised netlist is heavily dependent on the cell library. If the cell library is similar to the generic cell library used during elaboration the reduction in gate count will mostly be due to clever optimisation. If, on the other hand, the cell library contains a wide selection of more complex cells, the mapping from the generic library which contains simple cells to the more complicated cells will in itself reduce the gate count significantly. The further from the synthetic cells the actual cell library is, the more inaccurate the structural representation made by the parser will be.

An example of this can be seen from the inverter count. It is lowered due to many gates inverting the output being present in the cell library compared to the generic library, like NAND-, NOR- and XNOR gates. Cells with several inputs also lowers the gate count, as one OR5 gate can be used instead of four OR2 gates and so on.

### 6.6.2 The register-levelised node tree

The structural representation as a node tree introduces some limitations to the possibility of optimising the structural representation. It allows one to follow one bit through the circuit, but is unaware of the fan-in introduced by cells. It only sees the fan-out introduced by wires and is thus ever-expanding but never shrinking.

When re-encountering objects, their structure will be reused. This reuse of structures prevents this expansion from impairing the size of the structural representation, reducing the processing time and preventing duplication of structural representations for one cell.

Some data is lost in this representation, as only one bit is considered at a time in multipleinput cells. This loss of data could introduce an issue when this representation is later used for power estimation due to the status of all inputs impacting the probability of an output switching.

This data loss could be worked around, either by assuming a probability for the other signals being 0 or 1 or by storing some switching information in the structure object before propagating a switching probability. The latter of these will yield a more accurate result than assuming probabilities and considering the correlation between inputs. However, propagating input probabilities this circumspectly may make the power estimation slow, which is not desired.

Another obstacle the structure tree faces is later being combined with power data from the cell library. If cells with more operations and higher numbers of inputs exist in the cell library, switching to these cells in the structural representation will be hard as we are only able to move forward in the structure, not backward (from parent to children, not from child to parent).

### 6.6.3 Abstractions introduced by generic cell groups

Already in Section 5.2 some choices were made abstracting away information. Several generic cells were grouped together based on their functionality. For instance, multipliers using signed- or unsigned input signals, (or a combination of the two), has been put into a *multiplier* group. All comparator cells were also grouped together.

These generic cells often have no equivalent in the cell library. It is the job of the synthesis tool to combine available cells to create the functionalities needed. These types of generic cells will be subject to massive optimisations in the synthesis. For instance, if one of the inputs to an addition or multiplication is constant, this can significantly reduce the logic needed.

Predicting some of these optimisations may be more important than whether the inputs to the cell are signed or unsigned.

The problem of how to represent these cells aside, grouping cells that will be represented differently, will introduce inaccuracy. For the comparators the *equality* and *inequality* comparators are grouped together with the "greater than", "lesser than", "greater than or equal" and "lesser than or equal" comparators. The logic needed for different types of comparators will vary and dividing this group in three should be considered in the future.

### 6.6.4 Registers being optimised away

In Table 6.1, it can be seen that the register counts are not always the same between the synthesised design and the elaborated SystemVerilog file.

In some cases, if input signal buses are more narrow than the module is made to handle, some registers within the module will be optimised away as they are not needed. Predicting this with the structural representation is hard. As each register is the head of a structure tree, it is hard to know whether registers will be inferred. A *has\_parent* variable introduced in the register object and only registers having a data input get this variable set to *True*. If a register has a parent, and its corresponding structure tree has children, the register is assumed inferred. In most cases, this assumption is valid, but in Module1 it results in fewer registers in the elaborated count than in the synthesised one, and in Module2 it results in more.

### 6.6.5 Possible optimisations

A disadvantage with the chosen structural representation is that different branches in the node tree do not have any relation to each other even if they are parents of the same structure. More ideally, they would be aware of each other or even be part of the same representation of a cell in the power model.

Such awareness would make propagating the activity data more accurate. One would have a probability for each of the inputs rising, and thus be able to calculate a more accurate probability of the output switching. It could also make it easier to relate the information on available cells in the cell library to the structure trees of generic cells.

On the other hand taking several inputs into account before propagating switching activity is complicated and such an approach may be too time consuming.

# 7 Extracting library information

This chapter presents and discusses how the information related to power consumption is retrieved from the cell library. An illustration of the flow, highlighting the steps relevant to this chapter, can be seen in Figure 7.1.



Figure 7.1: The modeling flow with the part related to retrieving power information from the cell library highlighted.

The resulting representation of cells in the cell library and their power consumption will later be used in the power model generator. The power model generator will relate the cell information to the structural representation of the design.

The motivation for using the Liberty file for this is given in Section 4.2. As information about the different cells available and their power characteristics is not normally introduced to a design before synthesis. Introducing it earlier and trying to use it in power estimation will lead to better accuracy of the power estimations if this information is related to the design one tries to estimate the power consumption of in a good way.

### 7.1 Relevant power data

The information to be retrieved from the cell library consists of certain groups and attributes. These are described in Section 5.2. Different cells have different power consumptions. An AND2 gate and a Multiplexer, for instance, will not have the same leakage power or switching power. Differentiating between types of cells will lead to more accurate power estimates, than, for instance, using the power data of an average cell for all cells. Using the average cell may yield the same average power consumption for the design, with a loss of spatial and temporal accuracy. Being able to improve accuracy with this differentiation is depending on relating the cell information well to the RTL representation of the design. Using the structural representation developed in Chapter 6, relating different cells and their power data to different generic cells in the structure trees is possible.

This information will allow for estimating the switching power by adding together the power values from the input pins' internal\_power group, which makes up the *internal power* of the cell, and the power value in the output pins internal\_power group, which depends on the capacitance of the pins they drive (if any) and constitutes the *switching power* of the cell. It also allows for estimating the leakage power by looking up the value in the leakage power group.

# 7.2 Abstractions

### 7.2.1 The difference between fall- and rise power

Often it is either the fall- or the rise power consuming power in a transition. The rise power is described in the rise\_power group. It describes the power consumed as the output pin rises when a related input pin rise. The fall power is part of the fall\_power group and describes the power consumed as an output pin falls after the related input pin rises. In this implementation, these two contributions to power, the rise- and fall power, are added together, as a signal rising implies that it has previously fallen, and conversely. This way, one needs only observe rising transitions, but can still take the fall power into account. This combination of the rise- and fall power effectively halves the amount of data needing to be represented. The temporal accuracy of the power estimation will, however, be affected by this combination. If a cell consumes the most switching power when an input falls, instead of when the input rises, the power consumption will happen later than estimated. After a signal rises, it is impossible to say when it will fall, and the power related to the fall of the input will be consumed.

This inaccuracy may be worth having half the amount of data to process in a simulation. An illustration of the abstraction is shown in Figure 7.2.



Figure 7.2: The impact on power estimation when summarising rise- and fall power

### 7.2.2 The difference between data input pins

The difference between data inputs in a cell will be abstracted away. The first input pin is the only input pin for which power data will be stored. This will lead to power estimated for different input pins in a cell to all be based on the power characteristics of the first input pin.

A good example of this abstraction can be given with the ANDOR21 cell, shown in Figure 7.3, two of the input signals go through the AND2 gate, before the OR2 gate, while the third input signal only goes through the OR gate.

The ANDOR21 cell does not exist in the generic cell library. Its equivalent is a generic AND2 gate, combined with an OR2 gate. The two first inputs of these cells will go through

both the AND2 gate and the OR2 gate and have the structural representation given in Listing 7.1, while the third input will only see the OR2 gate and be oblivious to the AND2 gate, as shown in Listing 7.2. The two first input transitions will naturally consume similar power, but the last input is not going through the same operations and will have different power characteristics.



Figure 7.3: ANDOR21

Listing 7.1: ANDOR21 gate seen from the two first inputs

```
gtech_and2
| gtech_or2
| ...
```

Listing 7.2: ANDOR21 gate seen from the last input gtech or2

|...

A comparison between two cells from the cell library; an ANDOR21 cell, as shown in Figure 7.3, and a regular OR2 cell is done. The comparable scenarios are the third input of the ANDOR21 rises while the result of the AND operation is 0, and a rising input on the OR2 cell while the other input is 0. It turns out that the ANDOR21 consumes approximately 41% more power in this switching.

Being able to take the found difference in power consumption into account would be advantageous as a more accurate power model could be made. With the structural representation from Chapter 6, where only one bit is considered at a time, it is impossible to recognise a cell with more operations than those the bit in question propagates through. Thus, abstracting away all inputs but the most complex one, will not make the representation more abstract, as the same limitation is already present from the structural representation. If a way to work around this abstraction is found in the power model generator, or the structural representation is re-implemented, however, only saving data from the first input pin of a cell is introducing some inaccuracy.

### 7.2.3 The state-dependency of leakage power

The leakage power in a cell is state-dependent. The library used contains leakage power groups with the *when* condition for all possible states of the cell to model this. There is also one leakage group containing the average leakage power between all the other leakage groups. In this project, the average leakage power group will be used, and the state of the cell will not be taken into account. Making use of the average leakage power removes the need to calculate the states of cells in the power model. Thus, it is still an opportunity for the power model to abstract away cells entirely, and only have a sum of power values in their place when calculating the power consumption in simulation.

Knowing the leakage power as a cycle variant contribution would increase the temporal accuracy of the power estimations, but it may introduce more computation than it is worth. Cells that spend equal time in all states will contribute a total power equal to the leakage power value used in this project. It is unlikely this is the case for any cell, but the total average power contribution from leakage power will likely be close to the total leakage power consumption nevertheless, as some cells will consume more than expected and others less.

The difference between the states of cells is significant and shown in Table 7.1. For a regular AND2 gate the leakage power can vary with up to 161%. If the AND2 gate is always open, this will result in a larger actual leakage power consumption than the estimated one, and if the gate is always closed it will result in a smaller actual leakage contribution than the estimated value.

| Cell | Increase in power consumption |
|------|-------------------------------|
| AND2 | 161%                          |
| MUX2 | 51.9%                         |
| NOT  | 147%                          |

 Table 7.1: Increase in leakage power from least consuming to most consuming state

Knowing how the leakage power varies in time is not relevant in most cases as the leakage power contribution to power is several orders of magnitudes less than the switching power. It is the steady, relentless power contribution every cycle that makes leakage power a big part of a design's power consumption. As long as the average leakage power is accurate enough, knowing the leakage variation in a cell over time is deemed redundant for this project. When qualitative results are obtained, the accuracy of the estimated leakage power contribution should be investigated.

# 7.3 Cells with same functionality

In a cell library, there are several cells with the same functionality, that have different properties. They can differ in timing-, area- and power characteristics, and the load capacitance the cell can handle and so on. Depending on requirements to a specific location in the design, the synthesis tool may choose any of these cells.

For large fan-outs, cells with high enough driving capacity are needed. For critical paths, faster cells may be necessary to avoid breaking the timing constraints. If it is a priority to reduce the power consumption of a design, cells with low power consumption will be used wherever possible.

As these cells have varying power characteristics, it is necessary to know which of them are most likely to be used in synthesis to improve the estimation accuracy.

From the pin attributes, the maximum capacitance an output pin can drive is given. If the total capacitance of the pin(s) this output is connected to exceed this capacitance for any cell, it can be discarded as a possible candidate.

Trying to choose a cell only based on the driving capacitance, however, ignores design constraints. These will be reflected in the cells used at the gate level representation.

A representative design can be used as a *calibration netlist* for the Liberty parsing to take design constraints into account. It must have been synthesised with the same frequency and voltage as intended for the design one wants to model as which cells chosen during synthesis depend heavily on this information. The *calibration netlist* needs to be large enough to give a clear indication of which cells are likely to be used and which are less likely.

In the *calibration netlist*, the number of occurrences for all the available cells is counted. Later, when comparing cells of the same functionality, this count can be compared to the counts of other cells, and used as an indication of whether or not a cell is more likely to be used than another.

# 7.4 Implementation

Running the Liberty parser is time-consuming. The liberty file used in this project exceeds 11 million lines, containing around 300 cells. For each cell approximately 20 lines of information is needed. This amounts to 6000 lines constituting the information wanted. Temporarily storing the power- and cell information outside the Liberty file is deemed necessary. The flow of the implementation is shown in Figure 7.4.



Figure 7.4: The dataflow of retrieving the relevant Liberty data.

The implementation has been divided in two:

1. Reading out data from the Liberty file and calibration data from the *calibration netlist* and storing it in JSON object lines.

**2.** Reading out the JSON lines from the intermediate file and construct an object representing the cell library.

The implemented code can be found in Appendix E and on GitHub [21]. A detailed overview of the functions and classes in the implementation can be found in Appendix B.

### 7.4.1 Parsing Liberty and storing data

Using the Liberty parser provided by Nordic Semiconductor ASA, the Liberty file was parsed, and the information interesting for power estimation of a cell was put into a list then transformed to a JSON object. In addition to the cell information, each cell's occurrences are counted from a *calibration netlist*, a large synthesised design used by Nordic Semiconductor ASA. This count can later be used to see which variations of each cell type are more probable to be used in synthesis.

Each cell is thus made into one object on one line and all unwanted information is removed from the cell representation. A function for reading out the representations from the JSON file was also made. A JSON line looks like this:

```
([cellName, footprint,leakagePower, occurences_in_calibration_file, [
    input_pins, output_pins])
```

The output pins is a list of output pin objects:

```
[pinName, pin_direction, pin_function, pwrPin, gndPin, related_pin,
when condition, [rise cap, powerSumList]]
```

rise\_cap is an array of load capacitance values, and PowerSumList is an array of power values relating to each capacitance value in rise\_cap. The values in PowerSumList is the sum of the rise- and fall power values for the capacitance value in question. The input pin object is identical to the output\_pin object, except for having a capacitance value instead of a function.

Running the liberty parser of Nordic Semiconductor ASA on the more than 11 million line Liberty file takes 11-12 minutes. If in addition to this calibration is done on an almost 3 million line calibration netlist, the time required to get the JSON library representation gets close to 20 minutes.

### 7.4.2 Putting together a cell library object

When the power library representation is needed, the file with the JSON objects is parsed, and each cell found is put in a cell object. This object contains all the information on the JSON line and a sequence of generic gates from the elaboration library corresponding to the cell behaviour. The cells are then grouped into a cell\_group object based on their functionality. All the cell groups are then put in a cell\_library class object representing the cell library.

### 7.4.3 Summary

The relevant power information from the cell library can now be retrieved and stored in a library object. The library object contains lists of groups sorted after the number of inputs and separate lists for registers, multiplexers and empty groups for the more complex generic cells. This library object can be used to find the power information one wants, which was previously found in the liberty file.

To avoid the time-consuming parsing of the liberty file every time the power estimation is done, the Liberty information is intermediately stored in JSON lines. This intermediate format reduces the time it takes to get the information from the liberty file and the calibration data from almost 20 minutes to instantaneous.

# 7.5 Discussion

### 7.5.1 Choosing a cell from a group

When deciding which cell in a cell group to use in the power model, there are different ways to do so. One can use the characterisation data to get the cell in a group with the highest weight, or one can use the weights to calculate some average cell in a group based on a weighted average.

Another option is to not care about the characterisation data and choose a cell with suitable driver strength depending on the total input capacitance it has to drive. Although, this selection may depend just as much on the speed needed for switching.

As this power representation contains no timing information, choosing cells based on timing is not an option. This may introduce inaccuracy if the circuit is synthesised with strict performance constraints. The calibration data tries to make up for this but depends on the design subject to power estimation being synthesised under similar constraints as the calibration netlist.

Choosing a cell in a cell group can be up to the user of the library by adding procedures to it, giving out a cell depending on the user's choice. Such a procedure could ask for a weighted average cell or a cell corresponding to some load capacitance.

### 7.5.2 Other representations

A representation that does not abstract away the fall power and leakage power states is also a possibility. Differentiating between rise- and fall power should allow for more cycle accurate power estimation. The fall power can lead to more power consumption than the rise power and signals may remain high for a long or short duration before falling again. A more state-dependent estimation of the leakage power could also be possible.

The abstracting away of power data from all inputs, but the first one could be skipped to make the library representation more general. When it is going to be combined with the structural information from Chapter 6, however, having power data from only the first cell input is sufficient.

### 7.5.3 On the calibration

The calibration script is simplistic and goes through the entire calibration netlist for each cell to count how many times it appears. If runtime is critical, the algorithm should be improved. A good alternative is to go through the calibration netlist looking for all cells in a group at once, or even going through it only once counting occurrences of all library cells simultaneously.

It could also be an option to move the calibration to a different pointing the flow, especially if its runtime is improved. This way, the Liberty parsing remains indifferent to synthesis settings, just extracting the information from the liberty file, and the parsing of the Liberty file will not have to be done again if one wants to test the estimation with another calibration netlist. The calibration data could, for example, be an input to the power model generator instead. Using the calibration data as an input here requires the power model generator user to be aware of the cell library, so the calibration netlist is undoubtedly from the same library as the one used for power estimation.

# 8 Generating a power model

This chapter will present how the structural information retrieved from the elaborated SystemVerilog file can be combined with the cell library information retrieved from the Liberty file in order to create a power model of the design. Figure 8.1 illustrates the scope of this chapter.



Figure 8.1: The modeling flow with the flow relevant to this chapter highlighted.

The *processed structural info* will, in this project, be the *structure* tree from the elaborated SystemVerilog parser. The *power relevant library info* is the *cell\_library* object acquired from the Liberty file. Combining the data retrieved is necessary to relate the structural representation to realistic power data and is the last step towards getting a power model.

# 8.1 Limitations introduced by the structural representation

The most major limitation introduced by the implementation in Chapter 6 is the nature of the structure trees. The nodes in the structure tree allow for finding the children of a structure, but not its parents. It fans out whenever a signal fans out, but does not fan in when a cell does so. Two inputs to the same cell remain oblivious to each other in such a structural representation.

On one hand this allows one to easily parse through all the structures affected by a register or input changing value and estimate the amount of logic. On the other hand, not knowing the other parents of a node can make it harder to estimate the switching activity.

Not knowing the parents of a node in the structure tree makes doing many optimisations on the structure tree impossible. Only sequences of cells will be recognised and possibly replaced with more complex cells. For instance, the three AND2 gates in Figure 8.2a in an elaborated SystemVerilog representation is very likely to be optimised to one AND4 cell during synthesis, if one is available in the cell library. As the last AND2 gate in the structure is unaware of its parents the parents remain oblivious to each other. The structure objects created by the implementation in Chapter 6 is shown in Figure 8.2c. And the tree made up of the structure objects is illustrated in Figure 8.2d. All the four input pins, will, even though they share structure objects be unaware of their relation to each other.



(c) AND4 in structural representation. Structure objects (d) AND4 structure are represented as dotted circles around the object they represent.

Figure 8.2: Different representations of an AND4 gate

# 8.2 Limitations introduced by the cell library representation

Three abstractions were introduced in the implementation of the library processing in Chapter 7. They are discussed further in Section 7.2.

### • Combining rise- and fall power

Reducing the temporal accuracy of the power estimates. Combining the power contributions will not introduce any inaccuracy on the switching power values, but the time where power contributions happen will be inaccurate.

### • The difference between input pins

Power data is only saved for the first input pin of a cell. Ignoring the difference may complicate grouping of generic cells to one cell from the cell library, as one is unable to differentiate between inputs and their impact on power. It was seen in Section 7.2.2 that the OR2 operation done in a regular OR2 cell and the OR2 operation that is part of an ANDOR21 cell consume different power.

### • State dependency of leakage power

The different leakage power values for different states of a cell were not retrieved from the cell library. Only the average value between all the states is available after processing the library data. As the leakage power can increase with as much as 160% from a lowoutput state to a high-output state as shown in Table 7.1, the inaccuracy using only the average leakage power of cells may introduce significant inaccuracy.

# 8.3 Combining the structural information and the liberty data

The cells in the actual cell library are different from the cells in the generic cell library used in the structural representation. To combine the power information with the structural representation, a mapping of the generic cells to cells from the cell library is necessary.

As a cell is not aware of its parents, this can only be done by going through the structural representation and replacing generic cells or sequences of generic cells with cells from the cell library.

After being able to determine which *cell group* fits one or more of the generic cells, a *cell* has to be chosen from that group to get power data. As the calibration data states, the probabilities of each cell in a group being used, some weighted average power consumption of the *cell group* can be made from this. The driver strength of cells could also be used when choosing a cell, excluding those that can drive loads differing by some margin from the actual load.

For the generic arithmetic cells, there is seldom a suitable cell in the cell library. This is discussed further in Section 8.4.

As so few of the buffers were still present after synthesis, as seen in Table 6.1, it is chosen to remove buffer cells entirely from the power representation of the system. Another option could be to let buffer cells remain only if their fan-out is high enough for it to be deemed necessary.

### 8.3.1 Need for optimisation

When Synopsys DesignCompiler elaborates the HDL, the representation is optimised and mapped to the generic cell library. When mapping it to corresponding cells, one option is to find an equivalent cell for each of the generic cells, and simply replace all generic cells with their cell library equivalent from the Liberty file. Systematic errors this may introduce can be investigated and adjusted for to the best ability. Another option is to do some "optimisation" and for instance map an AND2 cell followed by a NOT cell to a NAND2 cell if one is available in the cell library.

To investigate whether the generic cells can be mapped directly to library cells by using only the cell library equivalents of generic cells, or if several generic cells should be mapped to one more complex cell from the cell library when possible, three options of implementing an AND4 gate have been examined: One representation consists of three AND2 gates, one consists of one AND2 gate and one AND3 gate, and lastly, one consists of one AND4 gate. Power information for each of these have been retrieved from the cell library that is used in this case. The three AND4 implementation options are shown in Figure 8.3.



Figure 8.3: Different implementations of a 4-input AND gate

The different cells in the cell library have different power characteristics as shown in Table 8.1. Here the cells are all driving the same output capacitance having one of their inputs rise as the other rises for the dynamic power and the average leakage power value is used. The leakage power values are a bit lower for the bigger cells and the dynamic power consumption is a bit higher.

Table 8.1: Power consumption in AND cells of different sizes using AND2 as the reference

|               | AND3   | AND4   |
|---------------|--------|--------|
| Leakage power | -8.4%  | -20.1% |
| Dynamic power | +14.4% | +18.9% |

The average leakage power for Figure 8.3b is 36% lower than for the three AND2 gates in Figure 8.3a. For the AND4 gate in Figure 8.3c, it is 73% lower. This is due to the leakage power being the average of all the states in the cell and the gates with higher inputs having more states with low power consumption. Combining the three generic cells into one will reduce the calculated leakage power, simply by making use of the cell with more inputs.

In reality, however, the three AND2 gates in Figure 8.3a will not all consume their average leakage power. As one input from the second AND2 gate comes from another, the probability of that input being high is lower than if it was coming from an input propagating through less cells. For the third AND2 gate the probability of the gate output being high is lowered once again. As the model does not monitor states or take state-dependency into account when calculating leakage power this is not taken into account, and the estimated leakage power of the three AND2 cells will be higher than in reality. However, the reduction in leakage power from combining generic cells when mapping them to the cell library is so significant that it improves the power consumption either way. The leakage power that of three open AND2 cells.

For the switching power, the scenarios that leads to the output of the AND4 gate switching has been considered. This means that three inputs are already high, and one rises. For the AND gate combination in Figure 8.3a, this can cause one-, two- or all of the AND2 cells to switch. The mean case will be considered. For the second AND4 gate implementation, in Figure 8.3b, this means either both cells will switch, or only the AND3 cell will. Lastly, for the AND4 cell in Figure 8.3c, one cell will switch.

Using the unoptimised combination of three AND2 gates in Figure 8.3a as a basis, the switching power will on average be 19% lower for the option in Figure 8.3b, and 41.3% lower for the AND4 gate in Figure 8.3c if they drive the same output. The comparison in shown in Table 8.2.

**Table 8.2:** Power consumption in AND4 optimisations, in comparison to the three AND2gate implementation in Figure 8.3a

|               | 1 AND2 1 AND3, Figure 8.3b | 1 AND4, Figure 8.3c |
|---------------|----------------------------|---------------------|
| Leakage power | -36%                       | -73%                |
| Dynamic power | -19%                       | -41%                |

Thus, combining the generic cells to suit more complex cells from the cell library is desirable to get a more accurate power estimate. This will reduce the error in leakage power by introducing cells with more states and reduce the error in dynamic power by using cells more likely to be chosen by the synthesis tool.

The generic library does not contain logic cells with inverted outputs, such as *NAND*, *NOR* and *XNOR*. Typically these cells need less transistors than their non-inverting equivalents. A NAND2- and an AND2 gate will be used to understand the significance of this. In figure 8.4 CMOS logic for a NAND2 gate and an AND2 gate is shown. An AND2 gate consists of a NAND2 gate and an inverter. The generic cell equivalent of the NAND2 gate is a AND2 gate followed by an inverter. If the NAND2 is implemented directly as such, four extra transistors are needed, compared to using a NAND2 gate directly, if one is available in the cell library.



Figure 8.4: A common CMOS schematic for a NAND2 and an AND2 gate. The AND2 schematic is the same as the NAND2 but with an added inverter.

The amount of logic needed, and thus, the power consumed, is reduced by optimisations in the synthesis process as shown in Table 6.1. It is necessary not only to map the structure from Chapter 6 to cells from the cell library, but to also do this intelligently as many cells in the cell library are complex and does not have generic cell equivalents. The same goes for the other way around, as many generic cells, specially the arithmetic ones will never have an equivalent in the cell library.

### 8.4 Generic cells with no library equivalent

In the power model generator it is assumed that all the logic cells in the generic library have an equivalent. This means cell libraries used **must** have the following cells: *NOT*, *AND2*, *OR2*, *XOR2*. It is also assumed that the cell library contains a *register* cell and a *multiplexer* cell.

It is possible to make a power model generator without these assumptions, but then alternatives to the generic cells would have to be found and proposed. An alternative to an AND2 cell, for instance, will be a NAND2 cell followed by an inverter.

Several of the generic cells are complex and have no equivalent in the cell libraries, as their implementation will depend heavily on the inputs to the cells and their size. Most of the generic arithmetic cells; *multipliers*, *divisors*, *adders* and *subtractors* fall under this category and need an alternative implementation with cells from the cell library. For now, all of these are left as empty shells, containing no power information and no cells from the cell library. Alternative representations for *selects*, *shifters* and *comparators* will have to be made as well.

In Section 8.4.1 a representation is investigated for the generic *select* cell. A similar approach can be used upon making representations for the other cells, however, the synthesis of the arithmetic generic cells will depend highly on their inputs. An addition or multiplication of an arbitrary number and a constant, for instance, require less logic than an addition or multiplication of two arbitrary numbers.

### 8.4.1 The *select* cell

The elaborated SystemVerilog netlist contains the 'select' cell, which does not have an equivalent in any cell library. It has the functionality of a one-hot multiplexer and is at times synthesised using a multiplexer, and at other times synthesised using logic cells.

It is necessary with a consistent way to represent the 'select' cell to incorporate it into the power model. From the results in 6.5, it can be seen that in most cases the select statement is not made into a multiplexer, but rather implemented in logic. A 2-input one-hot multiplexer can be represented as two AND2 gates and one OR2 gate, which is also equivalent to an AO22 gate. This representation can be seen in Figure 8.5. This implementation can also be extended to an N-bits one-hot multiplexer by using N AND2 gates and one N-inputs OR gate or its equivalent. For datawidths larger than one, the whole MUX-structure will be duplicated for each data bit



(a) One-hot MUX2 logic representation with datawidth 1



(b) One-hot MUX2 logic representation with datawidth 2. Twice the amount of logic is needed compared to the MUX2 with half the datawidth

Figure 8.5: One-hot multiplexers with different datawidth

The power consumption of possible replacements has been calculated, using a MUX2 as a baseline. The AO22 gate is a suitable replacement with a 4.9% increase in the leakage power and a 7.9% decrease in switching power. Two AND2 gates followed by an OR2 gate performs worse with 37% higher leakage and a 40.8% increase in the switching power. If an AO22 gate is available in the cell library, replacing the SELECT statement with that one, rather than a MUX, is a good option. As a second option, if no such gate is available, substituting the *select* operator with a MUX would be better than the three-cell alternative in Figure 8.5.

The AO22 gate is equivalent to a one-hot MUX, but has two separate select signals instead of a select input and the same select input inverted, compared to a regular MUX. Using the AO22 gate solely to replace select statements should not introduce more than this error in the cases where a MUX is used instead of other logic cells. The optimisation done by the synthesis tool, concerning *select* operators, is still unaccounted for, however.

## 8.5 Estimating the switching power

When estimating the switching power of a design, how switching activity propagates through the cells is important. With the register-levelised structural representation each input- and register bit is the head of a structure tree containing all the logic affected by the inputor register bit switching. Two methods for calculating the switching probabilities will be investigated. Calculating the complete signal propagation, and calculating the signal propagation for each structure tree separately.

### • Determine the complete propagation of the signals

It is possible to know the exact propagation of signals by waiting with calculating the output switching probability of a cell until all input switching activity is known. Then one would know exactly how many gates switched each cycle and can estimate power from it. However this is very complex, especially for larger modules and would take a considerable amount of time. This is what a simulator does.

### • Calculate propagation in each structure tree separately

Another option, that do not require as much calculation each cycle would be to calculate the propagation probabilities beforehand. Then *one* power value can be present for each structure tree. During simulation this value can be added to the total power consumption each time the head of the structure tree (a register or an input) rises.

As determining the complete propagation of signals is too complex for the method to be used efficiently in parallel with RTL simulations the second option is better suited for propagating the switching activity through a structure tree. Having one switching power value per structure tree is a huge advantage when it comes to calculating the power consumption with activity data from a simulation.

# 8.6 Implementation

A description of the system functions, classes, and behaviour can be seen in Appendix C. The full implementation can be found in Appendix F and on GitHub [21].

The power model generator is implemented in Python. It scans the structure trees from the structural representation implemented in Chapter 6 and replaces generic cells, or sequences of generic cells, with cells from the cell library and their power information. These cells and corresponding power information is stored in a *cell\_group* object by the library processing

implemented in Chapter 7. The *cell\_group* is a group of cells with equivalent functionality. All cell groups existing in the cell library are stored in a *cell\_library* object.

The structure from the elaborated SystemVerilog parser, implemented in Chapter 6, is scanned and a corresponding structure is made containing power information, in the form of a *cell\_group* object. This new structure tree is also doing optimisations where more cells follow each other without fanning out. If any sequence of cells correspond to the functionality of a more complex cell, the sequence will be replaced with that cell from the cell library. For instance if an AND2 gate is followed by an OR2 gate, and that OR2 gate only the sequence will be optimised to an ANDOR21 gate if one is present in the cell library. If the AND2 gate output fans out and drives more inputs, however the optimisation will not be done.

In Figure 8.6a the power structure representation of the AND4 gate in Figure 8.2 is shown. The sequences of AND2 gates are combined together and optimised to two AND3 gates.



(a) AND4 in optimised structural representation.



Figure 8.6: An AND4 gate as made by the power model generator

The implementation of the switching propagation through the structure tree to yield a power

estimate for each structure tree was deemed too time consuming. A bit more work done on the propagation on switching probabilities can be seen in Chapter 10. The implemented power model generator is a structure tree of cell groups and contain all the necessary information to implement the switching power estimate suggested in Section 8.5.

## 8.7 Results

The structural representation of the elaborated SystemVerilog and the library information retrieved from the Liberty file have been successfully combined into a tree structure of cells from the cell library.

One structure tree from Module2 is given in Listing 8.1 and the same structure from Module2, after it is processed by the power model generator is given in Listing 8.2. It can be seen that all *select* operators have been replaced with an AO22 cell and that all buffers are removed. In addition the module is able to unite sequences of generic cells corresponding to a sequence of operations that can be done by a more complex cell from the cell library, granted there is no fan-out between the generic cells that are being merged. The first OR gate after the register on line 2 in Listing 8.1 is followed by a NOT gate, in the power structure these are put together into a NOR2 operation.

Another structure tree from the elaborated SystemVerilog parser is shown in Listing 8.3, and can be compared to the power structure tree in Listing 8.4. Here the generators ability to combine cells is once again demonstrated as for instance the sequence on line 11-13 of Listing 8.3, becomes the NOR3 cell on line 6 in Listing 8.4.

| 10 |                             |
|----|-----------------------------|
| 1  | 'reg '                      |
| 2  | $ 'gtech_or2' $             |
| 3  | 'gtech_not'                 |
| 4  | $ $     gtech_buf '         |
| 5  | $ $ $ $ $ $ 'select_op '    |
| 6  | $ $ $ $ $ 'gtech_or2'$      |
| 7  | $ $ $ $ $ $ $ $ gtech_or2 ' |
| 8  | 'reg '                      |
| 9  | 'reg '                      |
| 10 | 'gtech_not'                 |
| 11 | 'gtech_and2'                |
| 12 | $ $ $ $ $ $ gtech_buf'      |
| 13 | $ $ $ $ $ $ $ $ select_op ' |
| 14 | $ $ $ $ $ $ gtech_or2 '     |
| 15 | 'gtech_or2'                 |
| 16 | 'reg '                      |
| 17 | 'reg '                      |
| 18 | 'gtech_or2'                 |
| 19 | $ $ $ $ $ 'gtech_not'$      |
| 20 | 'gtech_buf'                 |
| 21 | 'select_op'                 |
| 22 | 'gtech_or2'                 |
| 23 | 'reg'                       |
| 24 | 'reg '                      |
| 25 | 'gtech_or2'                 |
| 26 | 'gtech_not'                 |
| 27 | 'gtech_buf'                 |
| 28 | 'select_op'                 |
| 29 | 'select_op'                 |
| 30 | 'select_op'                 |
| 31 | 'gtech_buf'                 |
| 32 | 'select_op'                 |
| 33 | 'select_op'                 |
| 34 | 'select_op'                 |

Listing 8.1: Structure from the elaborated SV model

**Listing 8.2:** Power structure from the power model generator

| 1  | 'reg '     |  |
|----|------------|--|
| 2  | 'nor2 '    |  |
| 3  | 'andor22 ' |  |
| 4  | 'or3 '     |  |
| 5  | 'reg '     |  |
| 6  | 'reg '     |  |
| 7  | 'not '     |  |
| 8  | 'and2 '    |  |
| 9  | 'andor22 ' |  |
| 10 | 'or3 '     |  |
| 11 | 'reg '     |  |
| 12 | 'reg '     |  |
| 13 | 'nor2 '    |  |
| 14 | 'andor22 ' |  |
| 15 | 'or2 '     |  |
| 16 | 'reg '     |  |
| 17 | 'reg '     |  |
| 18 | 'or2 '     |  |
| 19 | 'not '     |  |
| 20 | 'andor22 ' |  |
| 21 | 'andor22 ' |  |
| 22 | 'andor22 ' |  |
| 23 | ''         |  |
| 24 | 'andor22 ' |  |
| 25 | 'andor22 ' |  |
| 26 | 'andor22 ' |  |

Listing 8.3: Structure from the elaborated SV model

| 1        | 'reg '                                   |
|----------|------------------------------------------|
| 2        | 'gtech_or2'                              |
| 3        | 'gtech_not'                              |
| 4        | gtech_and2 '                             |
| 5        | $ $ $ $ $ $ 'select_op '                 |
| 6        | 'reg '                                   |
| 7        | $ $ $ $ $ $ $ $ gtech_and2 '             |
| 8        | 'gtech_or2'                              |
| 9        | 'gtech_buf'                              |
| 10       | 'select_op'                              |
| 11       | 'gtech_or2'                              |
| 12       | 'gtech_or2'                              |
| 13       | 'gtech_not'                              |
| 14       | 'select_op'                              |
| 15       | 'gtech_not'                              |
| 16       | 'gtech_and2'                             |
| 17       | 'select_op'                              |
| 18       | 'gtech_and2'                             |
| 19       | gtech_and2 '                             |
| 20       | 'select_op'                              |
| 21       | 'gtech_and2'                             |
| 22       | 'gtech_or2'                              |
| 23       | gtech_or2'                               |
| 24       | 'gtech_not'                              |
| 25       | 'select_op'                              |
| 26       | 'gtech_and2'                             |
| 27       | 'select_op'                              |
| 28       | 'gtech_not'                              |
| 29       | 'gtech_and2'<br>            'gtech_and2' |
| 30<br>31 | green_andz                               |
| 32       | 'gtech or2'                              |
| 33       | 'reg '                                   |
| 34       | $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $      |
| 35       | 'select op '                             |
| 36       |                                          |
| 37       | 'sub op'                                 |
| 38       | 'select_op'                              |
| 39       | 'reg '                                   |
| 40       | 'select_op'                              |
| 41       | 'reg '                                   |
|          |                                          |

Listing 8.4: Power structure from the power model generator

| 1  | 'reg '                                         |
|----|------------------------------------------------|
| 2  | 'nor2 and2 '                                   |
| 3  | 'andor22 reg '                                 |
| 4  | 'and2 or2 '                                    |
| 5  | 'andor22 '                                     |
| 6  | 'nor3 andor22 '                                |
| 7  | ' not '                                        |
| 8  | $\mid \mid \mid \mid \mid$ and and or 22 '     |
| 9  | $\mid \mid \mid \mid \mid$ and 3 and or 22 $'$ |
| 10 | 'and2 '                                        |
| 11 | 'nor3 andor22 '                                |
| 12 | $\mid \mid \mid \mid$ and and or 22 '          |
| 13 | $\mid \mid \mid $ 'not and 3 and or 22 '       |
| 14 | 'or2 reg '                                     |
| 15 | 'and2 andor22 reg '                            |
| 16 | 'andor22 reg '                                 |
| 17 | 'andor22 reg '                                 |

## 8.8 Discussion

Some work remains before the power model is ready to be integrated into a power estimation tool, and the remaining work is outlined in Chapter 10, together with suggestions on how the power estimation tool making use of the power model can be implemented. Discussions regarding the implementation done so far is given in the subsections below.

#### 8.8.1 The quality of the cell mapping

Mapping the structure shown in Listing 8.3 to cells from the cell library, one is left with the structure shown in Listing 8.4. As the generic cells are not directly mapped but grouped when possible the cell count is reduced. It was concluded in Section 8.3.1 that reducing the number of logic cells needed would increase the overall accuracy of the representation.

Some mappings done in the power model generator are evidently increasing the accuracy of the estimations, like combining a sequence of AND2, NOT into a NAND2 gate, as this reduces the number of transistors needed by four. In other cases, evaluating whether the cell mapping is bringing the representation closer to the synthesised one, is harder. The AND4 gate in Figure 8.2 is represented as two AND3 gates in the power structure, as shown in Figure 8.6a. Here the original elaborated representation had three AND2 gates. Changing the AND2 gate to an AND3 gate will reduce both the leakage power and the switching power, as the number of cells is reduced from three to two. The difference between the power consumption of AND cells with differing numbers of inputs are not big, as seen in Table 8.1. Further reducing these two AND3 cells to one AND4 cell would be beneficial. This last combination is harder to do as the power structure trees implemented are oblivious of their parents and can only be parsed in one direction. It is necessary to change the structure trees and the cell mapping algorithm if one wishes to increase the accuracy of the estimation further. Then, before deciding which cell from the cell library to map a structure to, one would be able to look at the parents of a generic cell.

When mapping the structural representation to a power-aware structure, the complex cells with many inputs and logic operations will only be chosen where a match to their most complex paths is found. This is discussed in Section 7.2.2, using an ANDOR21 cell as an example. Calculations done there show the difference between the OR2 gate and the input only going through the OR operation in an ANDOR21 gate consumes differentiating switching power.

Another option is to range the library cells by complexity and tag the generic cells with the

library cell of which they are included. This tag can then be changed if a more complex cell is used. Then, the generic OR2 cell that is part of the ANDOR21 operation would know it was part of an ANDOR21 and not just an OR2. If this method is used, it is necessary to keep track of which input pin of a library cell is connected to a structure and to differentiate between the power consumption of different cell inputs.

Such a method would also require a more sophisticated algorithm for mapping the generic cells to the library cells and that the different data inputs are kept track of and their power information stored when parsing the Liberty file.

The choice of removing all generic buffer cells in the power model has also been made. As buffers, when present, consume a significant amount of power, how to determine when buffers will be inferred could be advantageous. In most cases, as observed in Table 6.1, generic buffers are not inferred. A generic buffer cell will be inferred if the cell output has a load capacitance it is not able to drive by itself. Such a load is typically present where large fanouts occur. Buffer cells consume considerable power, and trying to predict their inference is a possibility, rather than assuming they will never be inferred.

### 8.8.2 Consequences of abstractions

#### No state retention

The leakage power in cells from the cell library is state-dependent and the implemented system retains no state information. Only the average leakage power of a cell is used. If it is discovered that this has a very negative impact on the accuracy of the estimated leakage power, attempting to predict cell states can be considered. As the leakage power contribution is several magnitudes lower than the switching power it was argued that estimating leakage power with a spatial and temporal accuracy may not be necessary to get good power estimates.

Adding state information can be done by introducing a probability of how likely a gate is to be in a specific state. Calculating that probability is hard unless fall power is also considered, as the state of a cell depends on the cell input values. When only considering the signal rise times, and adding the contribution from rise- and fall power together, the time between rising and falling transitions is lost. This means one loses the information on how long the cell has been in an open state versus a closed state.

When storing state probabilities in objects, simplifying the power model will become harder as the power consumption will be dependent on variables in each structure object rather than a constant power value. An option could be to try to estimate some state probabilities in cells as the switching probabilities are calculated, as this could remove the need to include the fall power contribution by itself while still increasing the accuracy of power estimates.

#### Fall power and rise power combined

Abstracting away the fall power by combining it with the rise power in the implementation in Chapter 7 allows for much faster estimation later on, by only needing half the amount of data, but temporal information is lost. The estimates will have a lower cycle accuracy as fall power can contribute more than the rise power to power consumption in many cases, depending on the cell. Knowing when a cell output is probable to rise, but not when it is likely to fall also makes state prediction harder if it is later decided that this is necessary.

### 8.8.3 Evaluating the power model

The power model generated has been visually evaluated in Chapter 8.7, but more evaluation is needed to determine the quality of the model. An option to do such an evaluation would be to create a power estimation tool using the power model, and compare the power estimates yielded to those of state-of-the-art tools for power estimation, both at the RTL and the gate-level. To get the power model to work with a power estimation tool the library cell implementation of the arithmetic operations comparisons and shifts must first be made, and the signal propagation through structure trees must be implemented. After that is done, a power estimation tool making use of the power model must be made. When implementing the power estimation tool, the register- and input switching must be monitored and related to its respective structure tree. Each cycle the contribution to power can be added together, and give a cycle-by-cycle estimate, or transitions within a time frame can be counted and added up to a power estimate for that frame.

It would also be useful to use smaller and simpler designs than those used in this project to be able to investigate the relationship between the elaborated- and the synthesised netlist further, and see how the model made compares to this. These investigations could also be useful when developing the internals of the arithmetic cells, comparators and shifters as one could see how such generic cells are treated by the synthesis tool in different cases.

After evaluating the power model, it would also be possible to determine if there are systematic errors. These can be atoned for by adjusting the resulting power values, and the model can be improved.

### 8.8.4 Improvements to consider

#### Improved optimisation

An improvement that could have a positive effect on power estimates is to add more optimisation to the power model generator. It is currently only looking for sequences of cells corresponding to larger, more complex cells. If it were able to reduce the amount of logic needed in other ways, we would get closer to the synthesised representation.

If parents of nodes in the structure tree were known, the structure would be able to do optimisations also in that direction. For an ANDOR21 gate, with logic function (A1 \* A2) + B this would mean the B input knows it is an input to the ANDOR21 gate and not to an OR2 gate. Such optimisations would lower the power consumption and bring the representation closer to the synthesised one. For an AND4 gate, such as the one in Figure 8.2a this would mean that the structure tree could transform it to an AND4 gate if one is available in the cell library, rather than the two AND3 gates it currently becomes.

Other kinds of optimisations could also be considered. If two inverters are placed after each other, there is a possibility they could be removed, and operation reordering may also reduce the number of logic. Such optimisations could be done by viewing the gates as logic operations and using boolean algebra to simplify the expressions. How effective such optimisations are in bringing the design closer to its synthesised representation, depends on the optimisations done during the elaboration of the HDL.

#### Constant signals

Another factor the implementation in this project is not considering is constant values. If a signal value is set to 1 or 0, it is not necessary to consider it a signal that may have either value. For instance, if a signal set to 1 enters an AND2 gate, the other input will always be propagated through, and the AND2 gate can be removed. Similar conclusions can be reached with a signal being set to 0.

#### Alternatives if generic cells do not exist

In the implementation it was assumed that *multiplexers*, AND2, OR2, XOR2 and NOT cells were present in the cell library as equivalents to the generic logic cells. Such an assumption may not always be the correct, and a possibility of replacing some of these cells

with logic equivalents should be implemented, to make the power modeling available for cell libraries diverging from this assumption.

### Not giving all registers a power structure

As the model is now **all** registers become the top structure of a structure tree. For modules with many registers, this may result in too many structure trees and a more fine-grained power estimation than needed. If the system is modified such that which registers have their own structures can be chosen by the user instead of taking all registers into account, the module would have modifiable granularity. The propagation of switching probabilities through registers will have to be implemented for this method, as registers can now be part of structure trees. Otherwise only minor changes are required in the elaborated SystemVerilog parser.

#### Using switching probability to adjust leakage power

The difference in leakage power in different states of a cell is significant, as seen in Table 7.1. An attempt to adjust for this could be added after the switching probabilities are calculated. If a cell output is likely to be high as a result of a register or input rising, the leakage power increase after a transition changing its state happen. This could be taken into account.

Still, when not considering the fall power contribution by itself when the state changes back to a state with low leakage is unknown. It could be that by assuming a state for a given amount of cycles after a rising output, or by calculating some state duration from the activity data higher accuracy can be reached, than by simply using the average leakage values.

### 8.8.5 The accuracy/speed trade-off

Making the power estimates more accurate and bringing the representation closer to the IC it will become is advantageous, but solely increasing the accuracy of RTL power estimation is not the goal of this project. Some accuracy can be sacrificed to yield fast power estimates, as the estimation speed is also critical. Finding a balance between execution speed and estimation accuracy is paramount.

Separating the structure trees and modeling the power without state-dependency are important steps toward fast power estimation. Both abstractions seem promising, but it remains to be seen whether they are accurate enough. Steps towards improving the model accuracy can be taken if it proves to be inaccurate. For instance using the predicted activity of structure trees, (depending on the register- or input bit switching), to partly give the leakage power state dependency, or improving the mapping from the generic cells to the cells from the cell library, to better imitate the power characteristics of the synthesised design.

If the power estimation yields systematic errors, such as estimating a consequently too high switching probability, they can easily be adjusted for.

# 9 Conclusion

A power estimation flow for top-down power estimation at the RTL have been chosen. Parts of this flow have been implemented to make a power model generator. The power model generator makes use of an elaborated SystemVerilog representation of the design, and process the format to yield a node tree for each input or register bit. This node tree representing the design structure is then combined with information from the cell library to be used when manufacturing the IC. The Liberty file is parsed and information regarding all the cells available in the cell library is organised and stored in a *cell\_library* object. When combining the structural information and the library information some optimisation is done on the structure tree depending on the available cells in the cell library. The resulting structure trees contain the information needed to estimate the power consumption of the design, and the representation has similarities to the netlist.

The implementation shows promise in finding a power-aware representation of a design without first synthesising it. By parsing the Liberty file of a cell library, realistic power values are obtained and integrated into the structure tree representing the design. The grouping and mapping from generic cells to library cells ensures a reduction in the number of cells inferred, which is shown to have a positive effect on the accuracy of power estimates and brings the design representation closer to that of the synthesised design.

Thorough evaluation of the power model is needed to determine whether the speed/accuracy trade-off is satisfactory. Such an evaluation would be more easily conducted after integrating the model into the suggested power estimation flow. The speed and accuracy of power estimations done using the implemented power model can then be compared to power estimates from state-of-the-art tools for power estimation at the RTL and the gatelevel.

Some work remains in the implementation. Power-aware replacements for the arithmetic generic cells in the structural representation will have to be made in the power model generator. Replacements for shifters and comparators are also needed. Lastly, it is necessary to combine the power information in each node together into one power value representing the structure tree for a rising input or register bit. To do so, the propagation of the signal have to be estimated. In addition to this the leakage power should be added together for

all the cell objects present to represent the leakage per cycle.

## 10 Future work

## 10.1 Finishing the power model

Some work remains for the power model to be finished. That work is listed below:

#### • Compose arithmetic cells from cell library

The generic, arithmetic cells need an equivalent. The amount of logic one bit from a signal with a certain width must go through needs to be estimated together with the switching- and leakage power of such propagation. This must be done for

- Multipliers
- Dividers
- Adders
- Subtractors
- Make equivalent for other complex generic cells

The shifter and comparator generic cells also need an equivalent consisting of cells from the cell library with corresponding power data.

#### • Make estimate for each structure tree

The switching activity needs to be propagated through the structure tree for the corresponding input- or register and a power estimate for each tree must be made in addition to a leakage power per cycle estimate.

### 10.2 Implementing a power estimation tool

The power model, when finished, must be integrated into a power estimation tool to yield a power estimate. Such a tool must be made. The key points in developing such a tool are given below:

#### • Connect registers and inputs to structure trees

The registers and inputs corresponding to the structure trees in the power model must be found and "connected".

#### • Use activity data to calculate switching data

The activity data from the simulation must be used to add up switching power consumptions from the structure trees.

#### • Calculate leakage power

The structures in the structure trees must have their leakage power added together to get a leakage-per-cycle estimate.

## Bibliography

- I. S. IEEE and T. Organiztion. https://ieee-isto.org/member\_programs/ liberty-technical-advisory-board/, 2020. [Online; accessed 27-May-2020].
- [2] S. Palnitkar, Verilog HDL: A Guide to Digital Design and Synthesis, Second Edition. USA: Prentice Hall PTR, 2nd ed., 2003.
- [3] J. Rabaey, Low Power Design Essentials. Integrated Circuits and Systems, Springer US, 2009.
- [4] N. Weste and D. Harris, CMOS VLSI Design: A Circuits and Systems Perspective. USA: Addison-Wesley Publishing Company, 4th ed., 2010.
- [5] J. Rabaey, "Introduction," in Low Power Design Essentials, pp. 1–23, Boston, MA: Springer US, 2009. Series Title: Integrated Circuits and Systems.
- [6] R. Dennard, F. Gaensslen, H.-N. Yu, V. Rideout, E. Bassous, and A. LeBlanc, "Design of ion-implanted MOSFET's with very small physical dimensions," *IEEE Journal of Solid-State Circuits*, vol. 9, pp. 256–268, Oct. 1974. Conference Name: IEEE Journal of Solid-State Circuits.
- [7] Design Automation Committee, "IEEE Standard for Power Modeling to Enable System-Level Analysis," *IEEE Std 2416-2019*, pp. 1–63, July 2019.
- [8] L. Zhong, S. Ravi, A. Raghunathan, and N. Jha, "Power estimation for cycle-accurate functional descriptions of hardware," in *IEEE/ACM International Conference on Computer Aided Design*, 2004. ICCAD-2004., pp. 668–675, Nov. 2004. ISSN: 1092-3152.
- [9] L. Zhong, S. Ravi, A. Raghunathan, and N. Jha, "RTL-Aware Cycle-Accurate Functional Power Estimation," *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems*, vol. 25, pp. 2103–2117, Oct. 2006.
- [10] D. Lee and A. Gerstlauer, "Learning-Based, Fine-Grain Power Modeling of System-Level Hardware IPs," ACM Transactions on Design Automation of Electronic Systems (TODAES), vol. 23, pp. 30:1–30:25, Feb. 2018.

- [11] S. Ravi, A. Raghunathan, and S. Chakradhar, "Efficient RTL power estimation for large designs," in 16th International Conference on VLSI Design, 2003. Proceedings., pp. 431–439, Jan. 2003. ISSN: 1063-9667.
- [12] S. Gupta and F. Najm, "Energy and peak-current per-cycle estimation at RTL," *IEEE Transactions on Very Large Scale Integration (VLSI) Systems*, vol. 11, pp. 525–537, Aug. 2003.
- [13] H. Mehta, R. M. Owens, and M. J. Irwin, "Energy characterization based on clustering," in 33rd Design Automation Conference Proceedings, 1996, pp. 702–707, June 1996.
- [14] R. Zafalon, M. Rossello, E. Macii, and M. Poncino, "Power macromodeling for a high quality RT-level power estimation," in *Proceedings IEEE 2000 First International Symposium on Quality Electronic Design (Cat. No. PR00525)*, pp. 59–63, Mar. 2000. ISSN: null.
- [15] K. Buyuksahin and F. Najm, "Early power estimation for VLSI circuits," *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems*, vol. 24, pp. 1076–1088, July 2005.
- [16] S. Sambamurthy, J. A. Abraham, and R. S. Tupuri, "A Robust Top-Down Dynamic Power Estimation Methodology for Delay Constrained Register Transfer Level Sequential Circuits," in 21st International Conference on VLSI Design (VLSID 2008), pp. 521–526, Jan. 2008. ISSN: 2380-6923.
- [17] Ansys. https://www.ansys.com/products/semiconductors/ansys-powerartist, 2019. [Online; accessed 22-June-2020].
- [18] Synopsys. https://www.synopsys.com/verification/ static-and-formal-verification/spyglass/spyglass-power.html, 2019. [Online; accessed 22-June-2020].
- [19] Mentor. https://www.mentor.com/hls-lp/powerpro-rtl-low-power/ power-estimation, 2019. [Online; accessed 22-June-2020].
- [20] Cadence. https://www.cadence.com/en\_US/home/tools/ digital-design-and-signoff/power-analysis/joules-rtl-power-solution. html, 2019. [Online; accessed 22-June-2020].
- [21] E. T. Bygland. https://github.com/emblatbyg/thesis, 2020. [].

[22] M. Nemani and F. N. Najm, "Towards a high-level power estimation capability [digital ICs]," *IEEE Transactions on Computer-Aided Design of Integrated Circuits and* Systems, vol. 15, pp. 588–598, June 1996.

## A Technical implementation of the elaborated SystemVerilog parser

Figure A.1 shows the function hierarchy of the elaborated SV parser. In Table A.1 a function overview can be seen, while A.2 shows the helper functions of the parser.

The Table A.3 gives a complete overview of the elaborated SV parser classes, class variables and procedures can be seen.



Figure A.1: The function hierarchy of the elaborated SV parser. Functions at the same level are called from left to right.

| Function                                                                                | Description                                                                                                                              |  |
|-----------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------|--|
| parse_file(filename)                                                                    | Parses the elaborated SystemVerilog file object by object                                                                                |  |
| <pre>module. set_connection_points()</pre>                                              | reads the description on how to connect the inputs and<br>outputs of a module from the module declaration stored<br>in the module object |  |
| process_dependencies()                                                                  | goes through the dependencies of a module and connects<br>them to the rest of the module                                                 |  |
| connect_structures<br>(top_module)                                                      | Goes through all inputs of the top module and all registers,<br>creating their structure trees                                           |  |
| connect_assigns ()                                                                      | Sets the two signals assigned to each other equal each other                                                                             |  |
| make_object (line,<br>object_type)                                                      | processes the object line sent in from parse_file                                                                                        |  |
| create_object<br>(object_type)                                                          | calls the init function of the type specified by make_object                                                                             |  |
| parse_line (line,<br>object_handle)                                                     | goes through the object internals and set their connection points                                                                        |  |
| <pre>process_match (match,<br/>object_handle,<br/>connection_type,<br/>port_name)</pre> | connects one connection point to an input, output or wire                                                                                |  |
| connect_children (parent,<br>i1, i2)                                                    | make a node tree including everything connected to a spec-<br>ified parent (input signal or register output)                             |  |

 Table A.1: Overview of the functions in the elaborated SV parser.

 Table A.2: Helper functions for the elaborated SV parser

| Function                       | Description                                              |
|--------------------------------|----------------------------------------------------------|
| search_list(list, object_name) | looks for object with object_name in the list of objects |
| find_module(line)              | looks for regex matching start of a module in line       |
| find_endmodule(line)           | looks for regex matching end of module in line           |
| empty_global_lists()           | empties global lists removing a module environment       |
| set_global_lists(module)       | setting global lists to match environment of module      |
| connect_nodes(n1, n2)          | set two nodes to equal each other                        |
| hline find_indexes(string)     | looks for an index declaration in the string             |

| Class          | Variables       | Procedures                               |
|----------------|-----------------|------------------------------------------|
| register       | id              | init(self)                               |
|                | name            | initialize object, returns object handle |
|                | Q               |                                          |
|                | QN              |                                          |
|                | clear           |                                          |
|                | preset          |                                          |
|                | next_state      |                                          |
|                | clocked_on      |                                          |
|                | data_in         |                                          |
|                | enable          |                                          |
|                | synch_clear     |                                          |
|                | sync_preset     |                                          |
|                | synch_toggle    |                                          |
|                | synch_enable    |                                          |
|                | output_nodes_q  |                                          |
|                | output_nodes_qn |                                          |
| gtech_or2      | id              | init(self)                               |
| $gtech\_xor2$  | name            | initialize object, returns object handle |
| $gtech_and2$   | A               |                                          |
|                | В               |                                          |
|                | Z               |                                          |
|                | output_nodes    |                                          |
| $gtech_not$    | id              | $\init\(self)$                           |
| $gtech_buf$    | name            | initialize object, returns object handle |
|                | A               |                                          |
|                | Z               |                                          |
|                | output_nodes    |                                          |
| $shift_op$     | id              | init(self)                               |
| $b\_shift\_op$ | name            | initialize object, returns object handle |
|                | SH              |                                          |
|                | A               |                                          |

**Table A.3:** Overview of classes in the elaborated SV parser and their variables and pro-cedures.

|              | output nodes     |                                          |
|--------------|------------------|------------------------------------------|
| shift_add_op |                  | init(self)                               |
| r            | name             | initialize object, returns object handle |
|              | SH               |                                          |
|              | $output\_nodes$  |                                          |
| comp_op      | id               | init(self)                               |
|              | name             | initialize object, returns object handle |
|              | А                |                                          |
|              | В                |                                          |
|              | $output\_nodes$  |                                          |
| add_op       | id               | init(self)                               |
| sub_op       | name             | initialize object, returns object handle |
| mult_op      | А                |                                          |
| div_op       | В                |                                          |
|              | Z                |                                          |
|              | $output\_nodes$  |                                          |
| mux_op       | id               | init(self)                               |
|              | name             | initialize object, returns object handle |
|              | D                |                                          |
|              | S                |                                          |
|              | Ζ                |                                          |
|              | datawidth        |                                          |
|              | output_nodes     |                                          |
| select_op    | id               | $\_\_init\_\_(self)$                     |
|              | name             | initialize object, returns object handle |
|              | D                |                                          |
|              | S                |                                          |
|              | Z                |                                          |
|              | selectwidth      |                                          |
|              | datawidth        |                                          |
|              | output_nodes     |                                          |
| input_obj    | id               | init(self)                               |
|              | name             | initialize object, returns object handle |
|              | connection_nodes |                                          |

|            | width             | $add\_node\_input\_connection(self, i1, i2, l, index\_type)$                                                               |
|------------|-------------------|----------------------------------------------------------------------------------------------------------------------------|
|            | depth             | add a connection to the node specified by indexes and                                                                      |
|            | widthoffset       | index_type                                                                                                                 |
| output_obj | id                | init(self)                                                                                                                 |
|            | name              | initialize object, returns object handle                                                                                   |
|            | connection_nodes  |                                                                                                                            |
|            | width             | $add\_node\_input\_connection(self, i1, i2, l, index\_type)$                                                               |
|            | depth             | add a connection to the node specified by indexes and                                                                      |
|            | widthoffset       | index_type                                                                                                                 |
|            |                   | <pre>add_node_output_connection(self,i1,i2,l,index_type,k) add a connection to the node specified by indexes and in-</pre> |
|            |                   | dex_type                                                                                                                   |
| connection | id                | $\init\(self)$                                                                                                             |
|            | name              | initialize object, returns object handle                                                                                   |
|            | connection nodes  |                                                                                                                            |
|            | width             | init connection $nodes(self)$                                                                                              |
|            | depth             | initialize an array of node objects corresponding to the                                                                   |
|            | widthoffset       | size of the connection                                                                                                     |
|            |                   | $add\_node\_input\_connection(self, i1, i2, l, index\_type)$                                                               |
|            |                   | add a connection to the node specified by indexes and in-                                                                  |
|            |                   | dex_type                                                                                                                   |
|            |                   | $add\_node\_output\_connection(self, i1, i2, l, index\_type, k)$                                                           |
|            |                   | add a connection to the node specified by indexes and in-                                                                  |
|            |                   | dex_type                                                                                                                   |
| node       | id                | init(self)                                                                                                                 |
|            | connected_inputs  | initialize object, returns object handle                                                                                   |
|            | connected_outputs |                                                                                                                            |
|            | i1                | $add\_input\_connection(self,l)$                                                                                           |
|            | i2                | add connection to connected_inputs                                                                                         |
|            |                   | $add\_output\_connection(self,l,J)$                                                                                        |

|            |                   | add connection to connected_outputs and adds node to |
|------------|-------------------|------------------------------------------------------|
|            |                   | output_nodes of connected object                     |
| dependency | id                | init(self)                                           |
|            | name              | initialize object, returns object handle             |
|            | modulename        |                                                      |
|            | module_handle     | $add\_connections(self, list)$                       |
|            | connections       | add a connection to connections                      |
| assign     | id                | init(self)                                           |
|            | lhs               | initialize object, returns object handle             |
|            | rhs               |                                                      |
|            | lhs_i1            |                                                      |
|            | lhs_i2            |                                                      |
|            | rhs_i1            |                                                      |
|            | rhs_i2            |                                                      |
| module     | name              | init(self)                                           |
|            | connection_string | initialize object, returns object handle             |
|            | connection_points |                                                      |
|            | regs              | $set\_lists(self)$                                   |
|            | nots              | set module environment lists                         |
|            | bufs              |                                                      |
|            | and2s             | $set\_connection\_points(self)$                      |
|            | or2s              | set the connection_points variable from the          |
|            | muxes             | connection_string                                    |
|            | selects           |                                                      |
|            | connects          |                                                      |
|            | inputs            |                                                      |
|            | outputs           |                                                      |
|            | dependencies      |                                                      |
|            | shifters          |                                                      |
|            | comparators       |                                                      |
|            | xor2s             |                                                      |
|            | multipliers       |                                                      |
|            | subtractors       |                                                      |
|            | b_shifters        |                                                      |
|            | adders            |                                                      |

|           | shift_adders<br>divisors<br>assigns |                                          |
|-----------|-------------------------------------|------------------------------------------|
| structure | id                                  | init(self)                               |
|           | children                            | initialize object, returns object handle |
|           | represented-                        |                                          |
|           | $\_object\_handle$                  | $add\_child(self, child)$                |
|           |                                     | add child structure to children          |
|           |                                     |                                          |
|           |                                     | print(self)                              |
|           |                                     | prints node structure                    |

# B Technical implementation of the liberty parser

In Figure B.1 the function hierarchy of parsing the Liberty file and storing the power data in JSON objects can be seen. In figure B.2 the function hierarchy of reading out the cells from the JSON lines and putting them in a cell library can be seen.

Table B.1 shows the functions of the system processing the data from the liberty file, extracting the relevant power information. Table B.2 shows a class overview of the liberty data retrieving system.



Figure B.1: The function hierarchy of the liberty parser



Figure B.2: The function hierarchy of the liberty power data retrieving

 Table B.1: functions for processing the liberty file information

| Function                    | Description                                                |  |
|-----------------------------|------------------------------------------------------------|--|
| set_cells_and_environment() | parses the liberty file and retrieves the cell library in- |  |
|                             | formation. Stores it in a JSON line. Also counts the       |  |
|                             | occurrence of each cell in a calibration netlist.          |  |
| get_cells(filename)         | Reads cell JSON lines from a file and returns a list of    |  |
|                             | these lines                                                |  |
| sort_cells(path)            | makes cell object from list of cells, makes a library ob-  |  |
|                             | ject containing the cells. Calibrates the library.         |  |
| $get_dict_N(N)$             | get dictionary with regexes and synthetic gate se-         |  |
|                             | quences corresponding to cells with N inputs               |  |
| count_ocurrence(word)       | count occurrence of word in calibration netlist            |  |

|                       | Procedures                                             | Variables           | Class        |
|-----------------------|--------------------------------------------------------|---------------------|--------------|
|                       | init(self, def_list, cell_lib)                         | footprint           | cell         |
| ıp to                 | initialise object adds it to cell group. Adds group to | name                |              |
|                       | cell library if new group is made                      | leakage_power       |              |
|                       |                                                        | synthetic_gate_list |              |
|                       |                                                        | def_list            |              |
|                       |                                                        | input_pins          |              |
|                       |                                                        | output_pins         |              |
|                       | init(self, sequence, matching_key)                     | matching_key        | cell_group   |
|                       | initialise object                                      | synthetic_gate_list |              |
|                       |                                                        | cells               |              |
|                       | append_cell(self, cell)                                | cellcounts          |              |
| $\operatorname{ints}$ | add cell to cells, and calibration count to cellcounts | weights             |              |
|                       |                                                        |                     |              |
|                       | get_weights()                                          |                     |              |
|                       | set weights based on cellcounts                        |                     |              |
|                       | init(self)                                             | cells_6             | cell_library |
|                       | initialise object                                      | cells_5             |              |
|                       |                                                        | cells_4             |              |
|                       | $get_list(self, N)$                                    | cells_3             |              |
|                       | get list corresponding to cells with N inputs          | cells_2             |              |
|                       |                                                        | cells_1             |              |
|                       | find_cell_group(self, list, group_key)                 | muxes               |              |
|                       | look for cell group with matching_key equal to         | regs                |              |
|                       | group_key                                              | group_lists         |              |
|                       |                                                        |                     |              |
|                       | set_group_weights(self)                                |                     |              |
|                       | call set_weights for all cell_groups in library        |                     |              |
|                       | print available cells(self)                            |                     |              |
|                       |                                                        |                     |              |
|                       | group_key<br>set_group_weights(self)                   | _                   |              |

 Table B.2: Class overview for processing the liberty file information

## C Technical implementation of the power model

Figure C.1 describes the function hierarchy of the power model generator and Table C.1 describes the functions in greater detail. Table C.2 describes the two classes in the power modeling system. The value class is used to make a mutable number, while the power\_structure class make up the nodes of a node tree.



Figure C.1: Function hierarchy for the power model implementation

| Function                           | Description                                         |
|------------------------------------|-----------------------------------------------------|
| go_through_structures()            | Iterate through structures from the parsed elab-    |
|                                    | orated SystemVerilog and create power structure     |
|                                    | equivalents                                         |
| run_parse_elab(filename)           | Called after importing the elaborated SystemVer-    |
|                                    | ilog parser. Runs the parser and returns the struc- |
|                                    | ture trees.                                         |
| lists_from_top(s, power_s)         | recursively goes through all nodes in structure     |
|                                    | tree from parse_elab(). Makes a parallell power     |
|                                    | structure tree.                                     |
| transform_list(cellLib, l)         | Transforms list of synthetic cells to list of cells |
|                                    | from cell library                                   |
| find_sequence(to_find, cell_group) | see if cell group's synthetic gate sequence has     |
|                                    | any match(es) in list. Return positions of oc-      |
|                                    | currence(s) if that is the case.                    |

Table C.1: functions for making the power model  $\$ 

Table C.2: Class overview for the power model

| Class           | Variables               | Procedures                     |
|-----------------|-------------------------|--------------------------------|
| power_structure | name                    | init(self, def_list, cell_lib) |
|                 | cell_lib_list           | initialise object              |
|                 | $structural\_rep\_list$ |                                |
|                 | children                |                                |
|                 | parent                  |                                |
| value           | i                       |                                |

## D Code implemented in Chapter 6

| 1 $\#$ File for parsing .elab files made by synopsys compiler | $138 \text{ not}_n = 0$                                     |
|---------------------------------------------------------------|-------------------------------------------------------------|
| 2                                                             | 39 $buf_n = 0$                                              |
| 3 import numpy as np                                          | 40 and 2_n = 0                                              |
| 4 import re                                                   | 41 or2_n = 0                                                |
| 5 import pandas as pd                                         | $42 \text{ mux_n} = 0$                                      |
| 6 import sys                                                  | 43 select $n = 0$                                           |
| 7                                                             | 44 $shift_n = 0$                                            |
| 8 #list of module objects                                     | 45 comp_n = 0                                               |
| 9 modules = []                                                | $46 \operatorname{xor2} n = 0$                              |
| 10 #list of structure trees                                   | 47 mult_n = 0                                               |
| 11 top_level_parents = []                                     | 48 $sub_n = 0$                                              |
| 12                                                            | 49 $b_shift_n = 0$                                          |
| 13 #lists setting module environment                          | 50 add_n = 0                                                |
| 14 regs $= np.array([])$                                      | 51 $shift_add_n = 0$                                        |
| 15 nots $= np. array([])$                                     | 52 div_n = 0                                                |
| 16 bufs = np.array([])                                        | 53                                                          |
| 17 and 2s $=$ np. array ([])                                  | 54 $\#$ empty global lists relating to a module environment |
| 18 or 2s = np. array ([])                                     | 55 def empty_global_lists():                                |
| 19 muxes $= np. array([])$                                    | 56 global regs                                              |
| 20 selects $= np. array([])$                                  | 57 global nots                                              |
| 21 connects $= np. array([])$                                 | 58 global bufs                                              |
| 22 inputs $= np. array([])$                                   | 59 global and2s                                             |
| 23 outputs $= np. array([])$                                  | 60 global or2s                                              |
| 24 dependencies = np.array([])                                | 61 global muxes                                             |
| 25 shifters = np.array([])                                    | 62 global selects                                           |
| 26 comparators $= np. array([])$                              | 63 global connects                                          |
| 27  xor 2s = np.array([])                                     | 64 global inputs                                            |
| 28 multipliers = np.array([])                                 | 65 global outputs                                           |
| 29 subtractors = np.array([])                                 | 66 global dependencies                                      |
| 30 b_shifters = np.array([])                                  | 67 global shifters                                          |
| 31 adders $= np. array([])$                                   | 68 global comparators                                       |
| 32 shift_adders = np.array([])                                | 69 global xor2s                                             |
| 33 divisors = np.array([])                                    | 70 global multipliers                                       |
| 34  assigns = np.array([])                                    | 71 global subtractors                                       |
| 35                                                            | 72 global b_shifters                                        |
| 36 #gate counts                                               | 73 global adders                                            |
| 37 reg_n = 0                                                  | 74 global shift_adders                                      |
|                                                               |                                                             |

| 75       | global divisors                                    |
|----------|----------------------------------------------------|
| 76       | global assigns                                     |
| 77       |                                                    |
| 78       | regs = np.array([])                                |
| 79       | nots = np.array([])                                |
| 80       | bufs $= np.array([])$                              |
| 81       | and2s = np.array([])                               |
| 82       | or2s = np.array([])                                |
| 83       | muxes = np.array([])                               |
| 84       | selects = np.array([])                             |
| 85       | connects = np.array([])                            |
| 86       | inputs = np.array([])                              |
| 87       | outputs = np.array([])                             |
| 88       | dependencies = np.array([])                        |
| 89       | shifters = np. array([])                           |
| 90       | comparators = np. array([])                        |
| 91       | xor2s = np. array([])                              |
| 92       | multipliers = np. array([])                        |
| 93       | subtractors = np.array([])                         |
| 94       | $b_{shifters} = np. array([])$                     |
| 95       | adders = np. array ([])                            |
| 96       | shift_adders = np.array([])                        |
| 97       | divisors = np.array([])<br>assigns = np.array([])  |
| 98<br>99 | assigns – np. anay ([])                            |
|          | #set global lists relating to a module environment |
| 101      | def set_global_lists(module):                      |
| 102      | global regs                                        |
| 103      | global nots                                        |
| 104      | global bufs                                        |
| 105      | global and2s                                       |
| 106      | global or2s                                        |
| 107      | global muxes                                       |
| 108      | global selects                                     |
| 109      | global connects                                    |
| 110      | global inputs                                      |
| 111      | global outputs                                     |
| 112      | global dependencies                                |
| 113      | global shifters                                    |
| 114      | global comparators                                 |
| 115      | global xor2s                                       |
| 116      | global multipliers                                 |
| 117      | global subtractors                                 |
| 118      | global b_shifters                                  |
| 119      | global adders                                      |
|          |                                                    |

| 120        | <pre>global shift_adders</pre>                                          |                                     |  |  |  |  |
|------------|-------------------------------------------------------------------------|-------------------------------------|--|--|--|--|
| 121        | global divisors                                                         |                                     |  |  |  |  |
| 122        | global assigns                                                          |                                     |  |  |  |  |
| 123        |                                                                         |                                     |  |  |  |  |
| 124        | regs                                                                    | = module.regs                       |  |  |  |  |
| 125        | nots                                                                    | = module.nots                       |  |  |  |  |
| 126        | bufs                                                                    | = module.bufs                       |  |  |  |  |
| 127        | and 2s                                                                  | = module.and2s                      |  |  |  |  |
| 128        | or2s                                                                    | = module.or2s                       |  |  |  |  |
| 129        | muxes                                                                   | = module.muxes                      |  |  |  |  |
| 130        | selects                                                                 | = module.selects                    |  |  |  |  |
| 131        | connects                                                                | = module.connects                   |  |  |  |  |
| 132        | inputs                                                                  | = module.inputs                     |  |  |  |  |
| 133        | outputs                                                                 | = module.outputs                    |  |  |  |  |
| 134        | dependencies                                                            | = module.dependencies               |  |  |  |  |
| 135        | shifters                                                                | = module.shifters                   |  |  |  |  |
| 136        | comparators                                                             | = module.comparators                |  |  |  |  |
| 137        | xor2s                                                                   | = module.xor2s                      |  |  |  |  |
| 138        | multipliers                                                             | = module.multipliers                |  |  |  |  |
| 139        | subtractors                                                             | = module.subtractors                |  |  |  |  |
| 140        | b_shifters                                                              | = module.b_shifters                 |  |  |  |  |
| 141        | adders                                                                  | = module.adders                     |  |  |  |  |
| 142        | $shift\_adders$                                                         | $=$ module.shift_adders             |  |  |  |  |
| 143        | divisors                                                                | = module.divisors                   |  |  |  |  |
| 144        | assigns                                                                 | = module.assigns                    |  |  |  |  |
| 145        |                                                                         |                                     |  |  |  |  |
|            |                                                                         | ted systemverilog file line by line |  |  |  |  |
| 147        | def parse_file(path                                                     |                                     |  |  |  |  |
| 148        |                                                                         | treat lists to make objs later      |  |  |  |  |
| 149        | print ("Parsing                                                         | file:" + path )                     |  |  |  |  |
| 150        |                                                                         |                                     |  |  |  |  |
| 151        | #SEARCH FOR STAL                                                        |                                     |  |  |  |  |
| 152        | object_handle                                                           |                                     |  |  |  |  |
| 153        | objectstring<br>bitwidth = 0                                            | = ""                                |  |  |  |  |
| 154        | modulename =                                                            |                                     |  |  |  |  |
| 155        | $in_module = Fal$                                                       |                                     |  |  |  |  |
| 156        | m_moduleline = ''                                                       | se                                  |  |  |  |  |
| 157        |                                                                         | (n) no cufilo.                      |  |  |  |  |
| 158<br>159 | <pre>with open(path, 'r') as svfile:<br/>line = svfile.readline()</pre> |                                     |  |  |  |  |
| 160        | line $=$ $3 \times 11$                                                  | le l'eadline ()                     |  |  |  |  |
| 161        | while line:                                                             |                                     |  |  |  |  |
| 161        |                                                                         | for start of module                 |  |  |  |  |
| 162        |                                                                         | module):                            |  |  |  |  |
| 164        | · _                                                                     | ndle end of module                  |  |  |  |  |
| 104        | #11a                                                                    | naro onu or mouuro                  |  |  |  |  |

| 165     | if (find_endmodule(line)):                        | 204 | if match.group $(1) = 'input' or match.group(1)$   |
|---------|---------------------------------------------------|-----|----------------------------------------------------|
| 166     | global modules                                    |     | = 'output' or match.group(1) = 'wire':             |
| 167     | $in\_module = False$                              | 205 |                                                    |
| 168     | $module_handle = module(modulename)$              | 206 | if match.group $(2)$ != None:                      |
| 169     | <pre>module_handle.set_lists()</pre>              | 207 | bitwidth = int(match.group(2)) - int(              |
| 170     | $module\_handle.connection\_point\_string =$      |     | $\mathrm{match}.\mathrm{group}(3))~+~1$            |
|         | moduleline                                        | 208 | line = " ".join(line.split()[2:])                  |
| 171     | <pre>connect_assigns()</pre>                      | 209 | offset = int(match.group(3))                       |
| 172     | <pre>empty_global_lists()</pre>                   | 210 | else:                                              |
| 173     | modules.append(module_handle)                     | 211 | bitwidth = None                                    |
| 174     | #search for start of object                       | 212 | line = " ".join(line.split()[1:])                  |
| 175     | else:                                             | 213 | #else prepare for finding attributes of single     |
| 176     | <pre>key, match = parse_line(line, 'false')</pre> |     | object                                             |
| 177     | else:                                             | 214 | else:                                              |
| 178     | #search for start of module                       | 215 | # object handle = make object (match.group(1))     |
| 179     | $modulename$ , $in_module = find_module(line)$    |     | key)                                               |
| 180     | #print("Started module: "+modulename)             | 216 | if (key == 'dep'):                                 |
| 181     | match = False                                     | 217 | object_handle = make_object(match.group            |
| 182     | $module\_declaration\_ongoing = True$             |     | (2), key)                                          |
| 183     | #get lines until end of module declaration and    | 218 | object_handle.modulename = match.group             |
|         | catch things in regex                             |     | (1)                                                |
| 184     | while module_declaration_ongoing:                 | 219 | #print("made dep: "+match.group(2)+" of            |
| D-3 185 | <pre>for k, rx in rx_dict_end.items():</pre>      |     | module "+match.group(1))                           |
| 186     | objectstring = objectstring+line                  | 220 | else:                                              |
| 187     | <pre>#print("objectstring to search for</pre>     | 221 | object_handle = make_object(match.group            |
|         | end: $\n\t"+objectstring$ )                       |     | (1), key)                                          |
| 188     | match = rx.search(line)                           | 222 | # while taking in list of equally attributed       |
| 189     | if match:                                         |     | connections or single item                         |
| 190     | # found end, leave while loop                     | 223 | # get new lines until all of object(s) is in one   |
| 191     | $module\_declaration\_ongoing =$                  |     | string                                             |
|         | False                                             | 224 | $in\_object = True$                                |
| 192     | moduleline = objectstring                         | 225 | while in object:                                   |
| 193     | #print(moduleline)                                | 226 | #merge lines until end of regster is found         |
| 194     | else:                                             | 227 | <pre>#print ("looking for end of object")</pre>    |
| 195     | # read new line until end is                      | 228 | <pre>for k, rx in rx_dict_end.items():</pre>       |
|         | found                                             | 229 | objectstring = objectstring+line                   |
| 196     | line = svfile.readline()                          | 230 | <pre>#print("objectstring to search for end:</pre> |
| 197     | linenum = linenum + 1                             |     | (n t''+objectstring)                               |
| 198     | objectstring = ''                                 | 231 | match = rx.search(line)                            |
| 199     | match = False                                     | 232 | if match:                                          |
| 200     |                                                   | 233 | # found end, leave while loop                      |
| 201     | if match and in_module:                           | 234 | $in\_object = False$                               |
| 202     | #if connection split items by comma and make      | 235 | else:                                              |
|         | objects of all                                    | 236 | # read new line until end is found                 |
| 203     | offset = 0                                        | 237 | line = svfile.readline()                           |

| 238                 | linenum = linenum + 1                                                 | 276 | $object\_handle$ , foundbool = $search\_list(inputs,$         |
|---------------------|-----------------------------------------------------------------------|-----|---------------------------------------------------------------|
| 239                 |                                                                       |     | object_name)                                                  |
| 240                 | #make single object or make several connect                           | 277 | <pre>elif name == 'output':</pre>                             |
|                     | objects                                                               | 278 | $object_handle$ , foundbool = search_list(outputs,            |
| 241                 | objectstring = "".join(objectstring.split())                          |     | object_name)                                                  |
| 242                 | if (key == 'input' or key == 'output' or key ==                       | 279 | elif name == 'wire':                                          |
|                     | <pre>'wire '):</pre>                                                  | 280 | object_handle, foundbool = search_list(connects,              |
| 243                 | <pre>objectstring = objectstring.strip(";")</pre>                     |     | object_name)                                                  |
| 244                 |                                                                       | 281 | if (foundbool):                                               |
| 245                 | objectstring = objectstring.translate({ord(i                          | 282 | <pre>#print("Found object in already existing connetion</pre> |
|                     | ): None for i in '}{ '})                                              |     | object")                                                      |
| 246                 |                                                                       | 283 | if (object_handle.width <= i1): object_handle.width           |
| 247                 | <pre>objectlist = objectstring.split(',')</pre>                       |     | = i1+1                                                        |
| 248                 | for name in objectlist:                                               | 284 | if (object_handle.depth <= i2): object_handle.depth           |
| 249                 | object_handle = make_object(name, key)                                |     | = i2+1                                                        |
| 250                 | #if (object_handle != None):                                          | 285 |                                                               |
| 251                 | if (bitwidth != None):                                                | 286 | else:                                                         |
| 252                 | object handle.width = bitwidth                                        | 287 | <pre>object_handle = create_object(name)</pre>                |
| 253                 | object_handle.widthoffset = offset                                    | 288 | object handle.name = $object$ name                            |
| 254                 | object_handle.init_connection_nodes                                   | 289 | if i1 $!= -1$ : object handle.width = i1+1                    |
|                     | ()                                                                    | 290 | if i2 $!= -1$ : object_handle.depth = i2+1                    |
| 255                 | #print ("made new connection object with                              | 291 | object_handle.init_connection_nodes()                         |
|                     | name "+name+"\nand width "+ str(bitwidth))                            | 292 | else:                                                         |
| 256                 | else:                                                                 | 293 | <pre>object_handle = create_object(name)</pre>                |
| 257                 | # add connection node info to object                                  | 294 | object_handle.name = line                                     |
| 258                 | <pre>parse_line(objectstring, object_handle)</pre>                    | 295 | if (name == 'input' or name == 'output' or name == 'wire      |
| 259                 | # when done with an object, empty object string                       |     | ):                                                            |
| 260                 | objectstring = ""                                                     | 296 | object_handle.init_connection_nodes()                         |
| 261                 |                                                                       | 297 | if(name = 'assign'):                                          |
| 262                 | line = svfile.readline()                                              | 298 | <pre>#print("making fancy assign")</pre>                      |
| 263                 | linenum = linenum + 1                                                 | 299 | object_handle.i1 = i1                                         |
| 264                 |                                                                       | 300 | if i2 $!= -1$ : object handle.i2 = i2                         |
| 265 # create object |                                                                       | 301 | foundbool = False                                             |
| 266                 | def make_object(line, name):                                          | 302 | connected handle, foundbool = search list(outputs,            |
| 267                 | <pre>#print("Making object: "+name)</pre>                             |     | object name)                                                  |
| 268                 |                                                                       | 303 | if foundbool != True:                                         |
| 269                 | <pre>line = line.translate({ord(i): None for i in '}{\ '})</pre>      | 304 | connected_handle, foundbool = search_list(input               |
| 270                 |                                                                       |     | , object name)                                                |
| 271                 | i1, i2, object_name, index_type = find_indexes(line)                  | 305 | if foundbool != True:                                         |
| 272                 | if $((i1 != -1) \text{ or } (i2 != -1))$ and name $!= 'register'$ and | 306 | connected_handle, foundbool = search_list(                    |
|                     | name != 'assign':                                                     |     | connects, object name)                                        |
| 273                 | object handle = None                                                  | 307 | if foundbool:                                                 |
| 274                 | foundbool = False                                                     | 308 | object handle.lhs = connected handle                          |
| 275                 | if name == 'input':                                                   | 309 |                                                               |
|                     |                                                                       | 310 | return object handle                                          |
|                     |                                                                       |     |                                                               |

```
311
312  #looks for indexes at end of string, returns i1, i2, str(w/o)
        indexes
313 def find_indexes(string):
        i1 = -1
314
        i2 = -1
        index type = ''
        indexes = re.compile(r"(?: \setminus [(\setminus d\{1,4\}) \setminus ])(?: \setminus [(\setminus d\{1,4\}) \setminus ])?$"
317
        slices = re.compile(r"(?: \setminus [(\setminus d\{1,4\}) \setminus :(\setminus d\{1,4\}) \setminus ])$")
318
        indexfind = indexes.search(string)
319
        newline = indexes.sub("", string)
        if indexfind != None:
             index_type = 'index'
322
             if indexfind.group(1) != None:
                 i1 = int(indexfind.group(1))
                 if indexfind.group (2) != None:
                      i2 = int(indexfind.group(2))
326
        else:
             slicefind = slices.search(string)
             if slicefind != None:
329
                 index type = 'slice'
                 i1 = int(slicefind.group(1))
331
                 i2 = int(slicefind.group(2))
332
                 newline = slices.sub("", string)
333
                 #print("found a slice")
334
        return i1, i2, newline, index type
335
336
337 #create an object of a class specified by objectname
   def create object(objectname):
338
        objectselect = \{
339
             'wire'
340
                           : connection,
             'input'
                           : input_obj,
341
             'output'
                           : output obj,
342
             'SELECT OP' : select op,
343
             'MUX OP'
                           : mux_op,
344
             'GTECH NOT' : gtech not,
             'GTECH BUF' : gtech buf,
             'GTECH AND2': gtech and2,
347
             'GTECH OR2' : gtech or2,
348
             'GTECH_XOR2': gtech_xor2,
             'register'
                          : register,
             'dep'
                           : dependency,
351
             'COMP OP'
                           : comp op,
             'SHIFT OP'
                           : shift op,
```

```
'SUB OP'
354
                         : sub_op,
            'ADD OP'
                         : add op,
355
            'MULT OP'
356
                         : mult op,
            'DIV_OP'
                         : div_op,
357
            'B SHIFT OP'
                            : b shift op,
358
            'SHIFT ADD OP': shift add op,
359
360
            'DIV OP'
                         : div op,
            'assign'
                         : assign
361
362
363
       #get function
364
       #print("making object: "+objectname)
365
        func = objectselect.get(objectname, lambda: None)
366
        if func == None:
367
            print ("found no object with objectname: "+str (objectname
368
       ))
            return None
369
        else:
370
            \#print("lookup successful, func = "+ str(func))
371
            retval = func()
372
373
            return retval
374
375 #return true if module declaration is on line
376 def find module(line):
        module_start = re.compile(r"module \s+(\S+) \s?\(")
377
        match = module_start.search(line)
378
        if (match):
379
380
            return match.group(1), True
        else:
381
            return None, False
382
383
384 #return true if line contains endmodule
   def find_endmodule(line):
385
       module end = re.compile(r"endmodule")
386
       match = module end.search(line)
387
        if (match):
388
            return True
389
390
        else:
            return False
391
392
393 #look for object with name objectname in a list of objects.
        return handle if match, None otherwise
394 def find object(objectlist, objectname):
        for i in range (0, len(objectlist)-1):
```

if objectlist [i].name == objectname:

```
Ч
Ч
```

```
return objectlist[i]
397
       return None
398
399
     parse one line, looking for start of object or internal
400 #
       parameters of object
401 def parse line(line, object handle):
       \# print ("parsing line: n"+ line + "n in object:n" + str(
402
       object handle))
       kev = ""
403
       match = ""
404
       if (object handle = 'false'):
405
           for key, rx in rx dict start.items():
406
                #look for start of object to determine object type
407
                match = rx.search(line)
408
                if match:
409
                    return key, match
410
       #look for connect objects in non-connect objects?
411
       elif (object handle.id = 'reg'):
412
           #look for objects inside register dict and end
413
           for key, rx in rx dict reg.items():
414
                match = rx.search(line)
415
               \#print("line: " + line)
416
               \#print ("found match for: " + key +" group captured:
417
        " + match.group(1))
                if (key = 'clear'):
418
                    process match ([match.group(1)], object handle, '
419
       control', key)
                    object handle.clear = match.group(1)
420
                elif (key = 'preset'):
421
                    process match ([match.group(1)], object handle, '
422
       control', key)
423
                    object handle.preset = match.group(1)
                elif (key == 'next state'):
424
                    process match ([match.group(1)], object handle,
425
       control', key)
                    object handle.next state = match.group(1)
426
                elif (key == 'clocked on'):
427
                    process match ([match.group(1)], object handle,
428
       control', key)
                    object handle.clocked on = match.group(1)
429
                elif (key == 'data in'):
430
                    process match ([match.group(1)], object handle,
431
       input', key)
                    object handle.data in = match.group(1)
432
                elif (key == 'enable'):
433
```

```
process_match([match.group(1)], object_handle, '
control', key)
            object handle.enable = match.group(1)
        elif (key = 'Q'):
            process match ([match.group(1)], object handle,
output', key)
            object handle.Q = match.group(1)
        elif (key = 'QN'):
            process match ([match.group(1)], object handle,
output', key)
            object handle.QN = match.group(1)
        elif (key = 'synch clear'):
            process match ([match.group(1)], object handle,
control', key)
            object handle.synch clear = match.group(1)
        elif (key == 'synch preset'):
            process match ([match.group(1)], object handle,
control', key)
            object handle.synch preset = match.group(1)
        elif (key = 'synch toggle'):
            process match ([match.group(1)], object handle,
control', key)
            object handle.synch toggle = match.group(1)
        elif key == 'synch enable':
            process match ([match.group(1)], object handle,
control', key)
            object handle.synch enable = match.group(1)
        else:
            print ("no matching key: " + key + " in register
 " + object handle)
    #look for end
    for key, rx in rx dict end.items():
        match = rx.search(line)
elif object handle.id == 'gtech or2' or object handle.id ==
'gtech and2' or object handle.id == 'gtech xor2':
    for key, rx in rx_dict_AND2.items():
        match = rx.search(line)
        if (key = 'A' and match):
            process match ([match.group(1)], object handle,
input', key)
            object handle.A = match.group(1)
        elif (key == 'B' and match):
            process match ([match.group(1)], object handle, '
input', key)
            object handle.B = match.group(1)
```

435

436

437

438

439

440

441

442

443

444

445

446

447

448

449

450

451

452

453

454

455

456

457

458

459

460

461

462

463

464

465

466

467

D-C

```
elif (key = 'Z' and match):
468
                    process match ([match.group(1)], object handle,
469
       output', key)
                    object handle.Z = match.group(1)
470
                else:
471
                    print ("\nNo attribute of object " + str(
472
       object handle) + " matches key " + key)
                    print("from line: "+line+"\n")
473
           for key, rx in rx_dict_end.items():
474
                match = rx.search(line)
475
                if (match == False):
476
                    print ("looked for end in "+ object handle + "
477
       could not find it ... " )
       elif object handle.id == 'gtech not' or object handle.id ==
478
       'gtech buf':
           for key, rx in rx_dict_BUF.items():
479
                match = rx.search(line)
480
                if (key == 'A' and match):
481
                    process_match([match.group(1)], object_handle, '
482
       input', key)
                    object handle A = match.group(1)
483
                elif (key = 'Z' and match):
484
                    process_match([match.group(1)], object_handle,
485
       output', key)
486
                    object handle.Z = match.group(1)
                else:
487
                    print ("No attribute of object " + str(
488
       object handle) + " matches key " + key)
               #look for end
489
           for key, rx in rx_dict_end.items():
490
                match = rx.search(line)
491
                if (match == False):
492
                    print ("looked for end in "+ str(object_handle)
493
       + " could not find it...")
                    return key, False
494
495
           return key, match
       elif object handle.id == 'mux op':
496
            for key, rx in rx_dict_MUX.items():
497
                match = rx.findall(line)
498
                if (key = 'D' and match):
499
                    dataN, datawidth, matchlist = process_match(
500
       match, object handle, 'input', key)
501
                    object handle.datawidth = datawidth
                    object handle.D = matchlist
```

| <pre>for i in range(0,len(match)):</pre>                               |  |
|------------------------------------------------------------------------|--|
| object_handle.D[i] = matchlist                                         |  |
| elif $(key = 'S' and match):$                                          |  |
| process_match(match, object_handle, 'control',                         |  |
| key)                                                                   |  |
| #number of selects is length of match, only one                        |  |
| bit widths.                                                            |  |
| object_handle.S = np.append(object_handle.S,                           |  |
|                                                                        |  |
| match)                                                                 |  |
| #if match is list of matches to key                                    |  |
| elif (key $=$ 'Z' and match):                                          |  |
| <pre>#print( "Added Z to mux")</pre>                                   |  |
| <pre>process_match(match, object_handle, 'output',</pre>               |  |
| key)                                                                   |  |
| #will have same datawidth as D.                                        |  |
| $object_handle.Z = match$                                              |  |
|                                                                        |  |
| else:                                                                  |  |
| <pre>print("did not find attributes of object: "+</pre>                |  |
| object_handle.name)                                                    |  |
| <pre>elif object_handle.id == 'select_op':</pre>                       |  |
| for key, rx in rx_dict_SELECT.items():                                 |  |
| #returning everything matching given key in a list                     |  |
| match = rx.findall(line)                                               |  |
| if (key == 'DATA' and match):                                          |  |
| $dataN, datawidth, matchlist = process_match($                         |  |
| match, object_handle, 'input', key)                                    |  |
| object handle.datawidth = datawidth                                    |  |
| object handle. $D = matchlist$                                         |  |
| elif (key = 'CONTROL' and match):                                      |  |
| dataN, datawidth, matchlist = process_match(                           |  |
| match, object_handle, 'control', key)                                  |  |
| object handle. datawidth = datawidth                                   |  |
| elif (key = $ Z $ and match):                                          |  |
|                                                                        |  |
| dataN, datawidth, matchlist = process_match(                           |  |
| match, object_handle, 'output', key)                                   |  |
| $object_handle.Z = matchlist$                                          |  |
| else:                                                                  |  |
| <pre>print("did not find attributes of object: "+</pre>                |  |
| object_handle.name)                                                    |  |
| <pre>elif object_handle.id == 'comp_op' or object_handle.id == '</pre> |  |
| add_op' or object_handle.id == 'sub_op' or object_handle.id            |  |
| = "mult_op" or object_handle.id == "div_op":                           |  |
| <pre>for key, rx in rx_dict_SUB_ADD_MULT.items():</pre>                |  |
| match = rx.findall(line)                                               |  |
|                                                                        |  |

505

506

507

508

509

510

511

512

514

516

517

518

519

520

521

523

526

527

528

530

531

534

535

536

537

| 538        | if $(key = 'A' and match)$ :                               |
|------------|------------------------------------------------------------|
| 539        | $dataN, datawidth, matchlist = process_match($             |
|            | <pre>match, object_handle, 'input', key)</pre>             |
| 540        | $object_handle.a_width = datawidth$                        |
| 541        | $object_handle.A = matchlist$                              |
| 542        | elif (key == 'B' and match):                               |
| 543        | $dataN,\;\;datawidth,\;\;matchlist\;=\;process\_match($    |
|            | <pre>match, object_handle, 'input', key)</pre>             |
| 544        | $object\_handle.b\_width = datawidth$                      |
| 545        | $object_handle.B = matchlist$                              |
| 546        | elif (key = $'Z'$ and match):                              |
| 547        | $dataN, datawidth, matchlist = process_match($             |
|            | <pre>match, object_handle, 'output', key)</pre>            |
| 548        | $object\_handle.z\_width = datawidth$                      |
| 549        | $object_handle.Z = matchlist$                              |
| 550        | else:                                                      |
| 551        | <pre>print("did not find attributes of object: "+</pre>    |
|            | object_handle.name)                                        |
| 552        | elif object_handle.id = 'shift_op' or object_handle.id = ' |
|            | b_shift_op':                                               |
| 553        | <pre>for key, rx in rx_dict_shift.items():</pre>           |
| 554        | match = rx.findall(line)                                   |
| 555        | if $(key = 'A' and match)$ :                               |
| 556        | $dataN, datawidth, matchlist = process_match($             |
|            | match, object_handle, 'input', key)                        |
| 557        | $object_handle.a_width = datawidth$                        |
| 558        | object_handle.A = matchlist                                |
| 559        | elif (key == 'SH' and match):                              |
| 560        | $dataN$ , $datawidth$ , $matchlist = process_match($       |
|            | match, object_handle, 'control', key)                      |
| 561        | object_handle.sh_width = datawidth                         |
| 562        | object_handle.SH = matchlist                               |
| 563        | elif (key = $'Z'$ and match):                              |
| 564        | $dataN$ , $datawidth$ , $matchlist = process_match($       |
| 505        | match, object_handle, 'output', key)                       |
| 565        | object_handle.z_width = datawidth                          |
| 566        | object_handle.Z = matchlist                                |
| 567        | else:<br>print("did not find attributes of object: "+      |
| 568        |                                                            |
| 560        | object_handle.name)<br>elif(object_handle.id == 'dep'):    |
| 569<br>570 | #look for dep objects and add them to list                 |
| 570        | for key, rx in rx_dict_dep_internals.items():              |
| 572        | match = rx.findall(line)                                   |
| 573        | if match:                                                  |
|            |                                                            |

```
object_handle.add_connections(match)
       elif(object handle.id == 'assign'):
           #print("Assign statement")
           for key, rx in rx_dict_assign.items():
               match = rx.search(line)
               if key == 'rhs':
                    rhsline = match.group(1)
                    rhsline = rhsline.translate({ord(i): None for i
       in '}{\ '})
                    if rhsline = "1'b0" or rhsline = "1'b1":
                        object handle.rhs = "constant"
                   else:
                        i1 , i2 , new_rhsline , indextype =
       find indexes (rhsline)
                       object handle.rhs i1 = i1
                       #look for new_rhsline in connections.
                       if (i2 != -1): object handle.rhs i2 = i2
                       element, foundbool = search_list(outputs,
       new rhsline)
                        if foundbool == False:
                            element, foundbool = search_list(inputs,
        new_rhsline)
                            if foundbool == False:
                                element, foundbool = search_list(
       connects, new_rhsline)
                        if foundbool:
                            object handle.rhs = element
                       else:
                            print ("Did not find match of rhs in
       assign")
                            print(new_rhsline)
600
       else:
           print (" No match found for object handle id: "+
       object _ handle.id)
       return key, match
603
605 #if signal is not constant, find out what it is connected to and
        the width and register connection
606 def process_match(match, object_handle, connection_type,
       port name):
       #print("Running process match for object " + str(
       object handle.name))
       dataN = len(match)
608
```

575

576

578

580

581

582

583

584

585

586

587

588 589

590

591

592

593

594

596

597

598599

601

602

604

607

```
processed matchlist = []
609
       if match[0] == "":
            return 0, 0, []
611
       for i in range(0, len(match)):
612
           match[i] = match[i].translate({ord(i): None for i in '
613
       \{ \ '\}
            datawidth = match[i].count(',')+1
614
            matchlist = match[i].split(',')
615
            processed matchlist.append(matchlist)
           datawidth set = { 'sub op', 'select op', 'mux op', '
617
       shift op', 'add op', 'mult op', 'comp op', 'div op', '
       b shift op', 'shift add op'}
            if (connection type == 'output'):
618
                if (object handle.id in datawidth set):
619
                    if (object handle.output nodes ==[]):
                        #declare output nodes
                        if object handle.id == 'mux op' or
       object handle.id == 'select op':
                             object handle.output nodes = [None]*
623
       datawidth
624
                        else:
                             object handle.output nodes = [None]*
       datawidth
           j increment = 0
626
            for j in range(0, len(matchlist)): #j is i1
627
                if matchlist [j] = (1 \setminus b) or matchlist [j] = (1 \setminus b)
628
       1
                    datawidth = 1
                else:
630
                    found = False
                    if (port name = 'dep'):
632
                        to append = [object handle, port name,
633
       object_handle.id, connection_type, j+j_increment, i] #0]
                    else:
634
                        to append = [object handle, port name,
635
       object_handle.id , connection_type , j+j_increment]
                    \#print("j = "+str(j))
636
                    if (connection type == 'input' or
637
       connection type == 'control'): #or connection type == '
       control'):
                        i1, i2, matchobj, index type = find indexes(
638
       matchlist [j])
                        element, found = search list (inputs,
639
       matchobj)
```

```
if found:
                    element.add node input connection(i1, i2
, to append, index type)
                else:
                    element, found = search list (connects,
matchobj)
                    if found:
                         element.add node input connection(i1
, i2, to_append, index_type)
                    else:
                         element, found = search list(outputs
, matchobj)
                         if found:
                             element.
add node input connection(i1, i2, to append, index type)
                         else:
                             print("Did not find: "+matchobj)
                             print("["+str(object handle.name
+", "+str(port name)+", "+str(object handle.id)+"]")
                if found:
                    if index type = '':
                        #whole signal width-1 added to j
                        j increment = j increment + element.
width -1
                    elif index type == 'slice':
                        #add width of slice to j'
                        j increment = j increment + i1-i2
                    #print("j increment: "+str(j increment))
            elif (connection type == 'output'):
                i1, i2, matchobj, index type = find indexes(
matchlist [j])
                element, found = search list(outputs,
matchobj)
                if found:
                    if (i1 = -1 \text{ and } i2 = -1):
                        #print("Element width: "+str(element
.width))
                         if element.width > 1 and (element.
width != len(object handle.output nodes)):
                        #redefine output
                             if (object handle.id in
datawidth set):
```

642

643

644

645

646

647

648

649

650

651

652

654

655

656

657

658

659

660

661

662

663

664

665

666

667

668

669

670

671

672

```
Ū-
```

| 673 | for i in range(element.width                                      | 702 | <pre>#print("Modified j_increment")</pre>                      |
|-----|-------------------------------------------------------------------|-----|----------------------------------------------------------------|
|     | -1):                                                              | 703 | $j\_increment = j\_increment + element$                        |
| 674 | $object_handle$ .                                                 |     | width -1                                                       |
|     | output_nodes.append(None)                                         | 704 | <pre>elif index_type == 'slice':</pre>                         |
| 675 | $\# print(len(object_handle.$                                     | 705 | #add width of slice to j'                                      |
|     | <pre>output_nodes))</pre>                                         | 706 | <pre>#print("Modified j_increment")</pre>                      |
| 676 | if element.width == 1 and index_type ==                           | 707 | $j\_increment = j\_increment+ i1-i2$                           |
|     | 11:<br>                                                           | 708 | <pre>#print("j_increment: "+str(j_increment)</pre>             |
| 677 | <pre>index_type = 'bit'</pre>                                     | 709 | else:                                                          |
| 678 | element.add_node_output_connection(i1,                            | 710 | $print("Did not find \ t"+str(matchlist[j])+")$                |
|     | $i2$ , to_append, index_type, j+j_increment)                      |     | tANYWHERE")                                                    |
| 679 | else:                                                             | 711 |                                                                |
| 680 | $element$ , found = $search_list$ (connects,                      | 712 | $\#print("\nFinished matchlist for: "+object_handle.name+" \$  |
|     | matchobj)                                                         |     | nwidth: "+str(datawidth)+"(nN:"+str(dataN)+"(nMatchlist:"))    |
| 681 | if found:                                                         |     | $str(processed\_matchlist)+"\n")$                              |
| 682 | if $(i1 = -1 \text{ and } i2 = -1)$ :                             | 713 | return dataN, datawidth, processed_matchlist                   |
| 683 | <pre>#print("Element width: "+str(</pre>                          | 714 |                                                                |
|     | element.width))                                                   | 715 | #return object in list with name matching searchstring or None |
| 684 | ${ m if}~{ m element.width}>1~{ m and}$ (                         |     | False                                                          |
|     | element.width != len(object_handle.output_nodes)):                | 716 | def search_list(list, searchstring):                           |
| 685 | #redefine output                                                  | 717 | element = None                                                 |
| 686 | if (object_handle.id in                                           | 718 | for element in list:                                           |
|     | datawidth_set):                                                   | 719 | if (element.name == searchstring):                             |
| 687 | for i in range(element.                                           | 720 | <pre>#if searchstring[0] == 'e': print("FOUND: "+</pre>        |
|     | $\operatorname{width} -1):$                                       |     | searchstring)                                                  |
| 688 | $object\_handle$ .                                                | 721 | return element, True                                           |
|     | $output\_nodes.append(None)$                                      | 722 | return element, False                                          |
| 689 |                                                                   | 723 |                                                                |
| 690 | if element.width == 1 and index_type                              | 724 | #set nodes involved in assign statements equal to each other   |
|     | = '':                                                             | 725 | <pre>def connect_assigns():</pre>                              |
| 691 | $index_type = 'bit'$                                              | 726 | # print("RUNNING CONNECT ASSIGNS")                             |
| 692 | # print("calling)                                                 | 727 | <pre>#print(assigns)</pre>                                     |
|     | $add\_node\_output\_connection i1="+str(i1)+" i2: "+str(i2)+" j:$ | 728 | for a in assigns:                                              |
|     | "+str(j))                                                         | 729 | # print(a.name)                                                |
| 693 | $element.add\_node\_output\_connection($                          | 730 | #print(a.lhs.name+""+str(a.lhs))                               |
|     | $i1$ , $i2$ , to_append, index_type, $j+j\_increment$ )           | 731 | <pre>#print(a.rhs)</pre>                                       |
| 694 |                                                                   | 732 | #print(a.i1)                                                   |
| 695 | else:                                                             | 733 | # print(a.i2)                                                  |
| 696 | <pre>print("Did not find: "+matchobj)</pre>                       | 734 | if(a.i1 != -1):                                                |
| 697 | $print(str(object_handle)+""+str($                                | 735 | $\#print(a.lhs.connection_nodes)$                              |
|     | <pre>port_name)+" "+str(object_handle.id))</pre>                  | 736 | $lhs_node = a.lhs.connection_nodes[a.i1-a.lhs.$                |
| 698 | if found:                                                         |     | widthoffset][a.i2]                                             |
| 699 | # print("j = "+str(j))                                            | 737 | else:                                                          |
| 700 | if index_type == '':                                              | 738 | # print("should connect whole signal")                         |
| 701 | #whole signal width-1 added to j                                  | 739 | pass                                                           |

| 740 | if a.rhs != None and a.rhs != "constant":               | 780              |                                                                 |
|-----|---------------------------------------------------------|------------------|-----------------------------------------------------------------|
| 741 | <pre>#print(a.rhs.connection_nodes)</pre>               | 781              |                                                                 |
| 742 | rhs_node = a.rhs.connection_nodes[a.rhs_i1-a.rhs.       | 782 <del>/</del> | ¢classes                                                        |
|     | widthoffset][a.rhs_i2]                                  | 783              | class register:                                                 |
| 743 | <pre>#print("Defined two connection nodes")</pre>       | 784              | id = 'reg'                                                      |
| 744 | if $(a.i1 != -1 and a.rhs != None)$ :                   | 785              | name = ""                                                       |
| 745 | #Connect bits of single node                            | 786              | clear = 0                                                       |
| 746 | if a.rhs == "constant":                                 | 787              | preset = 0                                                      |
| 747 | a.lhs.connection_nodes[a.i1][a.i2].constant =           | 788              | $next_state = 0$                                                |
|     | True                                                    | 789              | $clocked_on = 0$                                                |
| 748 | else:                                                   | 790              | $data_in = 0$                                                   |
| 749 | $n1 = a. lhs. connection_nodes [a.i1-a.lhs.$            | 791              | enable = 0                                                      |
|     | widthoffset ][a.i2]                                     | 792              | $\mathrm{Q}=~0$                                                 |
| 750 | $n2 = a.rhs.connection_nodes[a.rhs_i1-a.rhs.$           | 793              | QN = 0                                                          |
|     | widthoffset][a.rhs_i2]                                  | 794              | $synch_clear = 0$                                               |
| 751 | $connect_nodes(n1, n2)$                                 | 795              | $synch_preset = 0$                                              |
| 752 | else:                                                   | 796              | $synch_toggle = 0$                                              |
| 753 | #connect whole node if a.rhs exists                     | 797              | $synch_enable = 0$                                              |
| 754 | if a.rhs != None and a.rhs != "constant":               | 798              | output_structure_taken = False                                  |
| 755 | <pre>#print("Connect whole signal")</pre>               | 799              | structurecount $= 0$                                            |
| 756 | <pre>for i in range(len(a.lhs.connection_nodes)):</pre> | 800              | defstr(self):                                                   |
| 757 | <pre>for j in range(len(a.lhs.connection_nodes[i</pre>  | 801              | #return "Register: $n clear = " + str(self.clear) + " n$        |
|     | ])):                                                    |                  | $preset = " + str(self.preset) + " n next_state = " + str($     |
| 758 | n1 = a.lhs.connection_nodes[i][j]                       |                  | $self.next_state) + "\n clocked_on = " + str(self.clocked_on)$  |
| 759 | n2 = a.rhs.connection_nodes[i][j]                       |                  | +" $\n data_in = " + str(self.data_in) + "\n enable = " + str($ |
| 760 | $connect\_nodes(n1,n2)$                                 |                  | $self.enable) + "\n Q = " + str(self.Q)$                        |
| 761 |                                                         | 802              | return "Register: " + self.name                                 |
| 762 | #set nodes equal to each other                          | 803              | <pre>definit(self):</pre>                                       |
| 763 | def connect_nodes(n1, n2):                              | 804              | # print("made reg!")                                            |
| 764 | if $n1 = n2$ :                                          | 805              | global regs                                                     |
| 765 | return                                                  | 806              | regs = np.append(regs, self)                                    |
| 766 | else:                                                   | 807              | $self.output_nodes_q = [None]*1$                                |
| 767 | <pre>for i in range(len(n1.connected_inputs)):</pre>    | 808              | $self.output_nodes_qn = [None]*1$                               |
| 768 | n1_con = n1.connected_inputs[i]                         | 809              | self.has_parent = False                                         |
| 769 | found = False                                           | 810              | class gtech_or2:                                                |
| 770 | <pre>for j in range(len(n2.connected_inputs)):</pre>    | 811              | $id = 'gtech_or2'$                                              |
| 771 | $n2\_con = n2.connected\_inputs[j]$                     | 812              | name = ""                                                       |
| 772 | $n1.connected\_inputs.append(n2\_con)$                  | 813              | A = 0                                                           |
| 773 | $n2.connected\_inputs.append(n1\_con)$                  | 814              | $\mathrm{B} = 0$                                                |
| 774 | if n1.connected_outputs == []:                          | 815              | $\mathrm{Z}~=~0$                                                |
| 775 | $n1.connected_outputs = n2.connected_outputs$           | 816              | structurecount = 0                                              |
| 776 | <pre>elif n2.connected_outputs == []:</pre>             | 817              | defstr(self):                                                   |
| 777 | $n2.connected_outputs = n1.connected_outputs$           | 818              | return "OR2: " + self.name                                      |
| 778 | if $n1.constant = True: n2.constant = True$             | 819              | <pre>definit(self):</pre>                                       |
| 779 | if $n2.constant = True: n1.constant = True$             | 820              | #print("made or2!")                                             |

```
global or2s
821
           or2s = np.append(or2s, self)
822
            self.output nodes = [None]*1
823
   class gtech_xor2:
824
       id = 'gtech xor2'
825
       name = ""
826
       A = 0
827
       B = 0
828
       Z = 0
829
830
       structurecount = 0
       def str (self):
831
            return "XOR2: " + self.name
832
       def init (self):
833
           #print("made or2!")
834
            global xor2s
835
            xor2s = np.append(xor2s, self)
836
            self.output nodes = [None]*1
837
   class gtech and2:
838
       id = 'gtech and2'
839
       name = ""
840
841
       A = 0
       B = 0
842
       Z = 0
843
       structurecount = 0
844
845
       def str (self):
            return "AND2: " + self.name + " A: "+str(self.A)+" B: "+
846
       str(self.B)+"Z: "+str(self.Z)
       def init (self):
847
            global and2s
848
            and2s = np.append(and2s, self)
849
            self.output nodes = [None]*1
850
   class gtech not:
851
       id = 'gtech not'
852
       name = ""
853
       A = 0
854
       Z = 0
855
       structurecount = 0
856
       def str (self):
857
            return "NOT: " + self.name
858
       def init (self):
859
            global nots
860
            self.output nodes = [None]*1
861
           nots = np.append(nots, self)
862
   class gtech buf:
863
       id = 'gtech buf'
864
```

```
name = ""
865
       A = 0
866
       Z = 0
867
       structurecount = 0
868
       def str (self):
869
           return "BUF: " + self.name
870
       def init (self):
871
            self.output nodes = [None]*1
872
            global bufs
873
           bufs = np.append(bufs, self)
874
   class mux op:
875
       id = 'mux op'
876
       name = ""
877
       D = np.array([])
878
       S = np. array([])
879
       Z = np.array([])
880
       datawidth = 0
881
       structurecount = 0
882
       def init (self):
883
            global muxes
884
885
            self.output nodes = []
           muxes = np.append(muxes, self)
886
       def str (self):
887
           return "mux: " + self.name + "# inputs: " + str(self.
888
       d size()) + " datawidth: " + str(self.datawidth)
       def d size(self):
889
           D = self.D
890
891
           print (str(D))
           \#print("number of Ds " + str(len(D)))
892
           return len(D) #D.size()
893
       def s size(self):
894
           S = self.S
895
           return len(S)
896
       #def datawidth(self):
897
            D = self.D
898
       #
       #
            return len(D.item(0))
899
       def print(self):
900
            print("mux: " + self.name + "# inputs: " + str(self.
901
       d size()) + " datawidth: " + str(self.datawidth))
902 class select op:
       #NB select also has width of select to take into account
903
       id = 'select op'
904
       name = ""
905
       D
                 = np.array([])
906
       CONTROL
                    = np.array([])
907
```

```
\mathbf{Z}
                    = np.array([])
908
        datawidth
                    = 0
909
        selectwidth = 0
910
        structurecount = 0
911
        def init (self):
912
            global selects
            self.output nodes = []
914
            selects = np.append(selects, self)
915
        def __str__(self):
916
            return "select: " + self.name + " inputs: \n" + str((
917
        self.DATA)) + " \land nselect : \land n" + str((self.CONTROL)) + " \land n
       datawidth = " +str(self.datawidth)+ " selectwidth = "+ str(
        self.selectwidth)
918 class connection:
        id = 'connection'
919
                     _ !!
        name
        width
                    = 1
        depth
922
                     = 1
        widthoffset = 0
923
        def init (self):
924
925
            global connects
            connects = np.append(connects, self)
926
            self.connection_nodes = []
927
            self.init_connection_nodes()
928
        def init_connection_nodes(self):
929
            self.connection_nodes = [[node(j,i) for i in range(self.
930
       depth) | for j in range(self.width) |
931
        def add node input connection(self, i1, i2, l, index type):
            #to increment 1[4]
932
            add = 0
933
            if index type = '':
934
935
                for i in range(self.width):
                     li = l[:]
936
                     li[4] = l[4] + add
937
                     add = add+1
938
                     for j in range(self.depth):
939
                         self.connection nodes[i][j].
940
        add_input_connection(li)
941
            elif index type == 'index':
942
                 self.connection nodes [i1-self.widthoffset][i2].
943
        add input connection(1)
            elif index type == 'slice':
944
                for i in range(i2, i1):
945
                     li = 1[:]
946
```

```
li[4] = l[4] + add
                    self.connection nodes [i-self.widthoffset][0].
       add input connection(li)
                    add = add+1
       def add_node_output_connection(self, i1, i2, l, index_type, k)
           add = 0
           if index type = '':
               for i in range(self.width):
                    li = 1[:]
                    li[4] = l[4] + add
                   k = li[4]
                    for j in range(self.depth):
                        self.connection nodes[i][j].
       add output connection(li,k)
                    add = add+1
           elif index type == 'bit':
                self.connection_nodes[i1-self.widthoffset][i2].
       add output connection(1,k)
            elif index type == 'index':
                self.connection_nodes[i1-self.widthoffset][i2].
       add output connection(1,k)
           elif index type == 'slice':
               for i in range(i2, i1):
                    li = l[:]
                    li[4] = l[4] + add
                    k = li[4]
                    self.connection nodes[i][0].
       add output connection(li,k)
                    add = add+1
973 class assign:
       id = 'assign'
       lhs = ''
       rhs = ''
       i1 = 0
       i2 = 0
       lhs = None
       rhs = None
       rhs i1 = 0
       rhs i2 = 0
       def init (self):
           global assigns
           assigns = np.append(assigns, self)
```

948

949

950

951

952

953

954

955

956

957

958

959

960

961

962

963

964

965

966

967

968

969

970

971

972

974

975

976

977

978 979

980

981

982

983

984

985

```
self.lhs = lhs
        #
986
        #
              self.rhs = rhs
987
    class node:
988
        id = 'node'
989
        def init (self, i1, i2):
990
             self.connected inputs
                                         = []
991
             self.connected outputs
                                         = []
992
             self.i1 = i1
993
             self.i2 = i2
994
             self.constant = False
995
        def add input connection(self, l):
996
             self.connected inputs.append(1)
997
        def add output connection(self, l, j):
998
999
             self.connected outputs.append(1)
            connected object handle = l[0]
1000
             if 1[2] != 'reg':
                 #add output node to connected object
                 connected object handle.output nodes [j] = self
1003
             else:
1004
                 #add output node to register
1006
                 connected object handle.output nodes q[0] = self
1007
                 connected object handle.output nodes qn[0] = self
1008
        def print(self):
1009
1010
             print("Node:")
             print("\tinputs")
             print(self.connected inputs)
1012
             print("\toutputs:")
             print(self.connected outputs)
1014
             print("Endnode")
1015
    class input obj():
1016
1017
        id
                 = 'input'
                 = ""
        name
1018
        width = 1
1019
        depth = 1
        width offset = 0
        def init (self):
            global inputs
            inputs = np.append(inputs, self)
             self.connection nodes = []
        def init_connection_nodes(self):
            #print("\ninitializing connection nodes")
             self.connection nodes = [[node(j,i) for i in range(self.
1028
        depth)] for j in range(self.width)]
            #print(self.connection nodes)
1029
```

```
def add_node_input_connection(self, i1, i2, l, index_type):
1030
            add = 0
            if index type = '':
                 if (i1 = -1 \text{ and } i2 = -1):
                     for i in range(self.width):
                         1i = 1[:]
                         li[4] = l[4] + add
                         for j in range(self.depth):
                              #print("added input connection with 1
1038
        [4]: "+str(li[4]))
                              self.connection nodes[i][j].
1039
        add input connection(li)
                         add = add+1
1040
1041
             elif index type == 'index':
                 self.connection_nodes[i1-self.widthoffset][i2].
1043
        add input connection(1)
             elif index type == 'slice':
1044
                 for i in range(i2, i1):
1045
                     li = l[:]
1046
                     li[4] = l[4] + add
                     self.connection nodes [i][0].add input connection
        (li)
1049
                     add = add+1
1050 class output obj():
        id
                 = 'output'
                 = ""
        name
1052
        width
                = 1
        depth = 1
1054
         width offset = 0
        def init (self):
1056
            global outputs
            outputs = np.append(outputs, self)
1058
             self.connection_nodes = []
1059
        def init connection nodes(self):
1060
            #print("\ninitializing connection nodes")
            self.connection nodes = [[node(j,i) for i in range(self.
1062
        depth) | for j in range(self.width) |
1063
            #print(self.connection nodes)
        def add node output connection(self, i1, i2, l, index type, k)
1064
        :
            add = 0
1065
            if index type = '':
1066
                 for i in range(self.width):
                     li = 1[:]
1068
```

1061

1067

```
li[4] = l[4] + add
                                                                                      name = ""
                                                                             1107
1069
1070
                      k = li [4]
                                                                             1108
                                                                                       connection point string = ""
                                                                                      def init (self, name):
                      for j in range(self.depth):
1071
                                                                             1109
                          self.connection_nodes[i][j].
                                                                                           self.regs
        add output connection(li,k)
                                                                                           self.nots
                      add = add+1
                                                                                           self.bufs
1073
                                                                                           self.and2s
                     #print(add)
                                                                             1113
1074
             elif index type == 'bit':
                                                                                           self.or2s
                                                                             1114
1075
                                                                                           self.muxes
                 self.connection nodes [i1-self.widthoffset][i2].
                                                                                           self.selects
        add output connection(1,k)
                                                                             1116
                                                                                           self.connects
                                                                             1117
             elif index type == 'index':
                                                                                           self.inputs
1078
                                                                             1118
                 self.connection nodes [i1-self.widthoffset][i2].
                                                                                           self.outputs
1079
                                                                             1119
                                                                                           self.dependencies
        add output connection(1,k)
                                                                             1120
             elif index type == 'slice':
                                                                                           self.shifters
1080
                                                                             1121
                 for i in range(i2, i1):
                                                                                           self.comparators
1081
                                                                                           self.xor2s
                      li = l[:]
                                                                             1123
                      li[4] = l[4] + add
                                                                                           self.multipliers
1083
                                                                             1124
                      k = li [4]
                                                                                           self.subtractors
1084
                      self.connection nodes[i][0].
                                                                                           self.b shifters
1085
                                                                             1126
                                                                                           self.adders
        add output connection(li,k)
                      add = add+1
                                                                                           self.shift adders
                                                                             1128
1086
                                                                                           self.divisors
        def add_node_input_connection(self, i1, i2, l, index_type):
1087
             add = 0
                                                                             1130
                                                                                           self.assigns
1088
1089
             if index type = '':
                                                                                           self.name = name
                 if (i1 = -1 \text{ and } i2 = -1):
                                                                                           self.connection points = []
1090
                      for i in range(self.width):
                                                                                      def set lists(self):
1091
1092
                          li = l[:]
                                                                                           global regs
                          li[4] = l[4] + add
                                                                                           global nots
1093
                                                                             1135
                          for j in range(self.depth):
                                                                                           global bufs
1094
                                                                             1136
1095
                               self.connection_nodes[i][j].
                                                                             1137
                                                                                           global and2s
        add input connection(li)
                                                                             1138
                                                                                           global or2s
                          add = add+1
                                                                                           global muxes
1096
                                                                             1139
                                                                                           global selects
                                                                             1140
1097
             elif index type == 'index':
                                                                                           global connects
1098
                                                                             1141
                 self.connection_nodes[i1-self.widthoffset][i2].
                                                                                           global inputs
1099
                                                                             1142
        add input connection(1)
                                                                                           global outputs
                                                                             1143
             elif index type == 'slice':
                                                                                           global dependencies
1100
                                                                             1144
                                                                                           global shifters
                 for i in range(i2, i1):
                                                                             1145
1101
                      li = l[:]
                                                                                           global comparators
                                                                             1146
1102
                      li[4] = l[4] + add
                                                                             1147
                                                                                           global xor2s
                                                                                           global multipliers
                      self.connection nodes[i][0].add input connection
1104
                                                                             1148
        (li)
                                                                                           global subtractors
                                                                             1149
                      add = add+1
                                                                                           global b shifters
                                                                             1150
1105
1106 class module:
                                                                                           global adders
```

= [1]

= [1]

= []

= []

= []

= [1]

= []

= [1]

= []

= []

= [1]

= []

= []

= []

= []

= []

= []

= ||

= []

= [1]

= []

| 1150         | global shift adders                                           | 1101 | connectionpoints.append(tuple([m[0], 1])        |
|--------------|---------------------------------------------------------------|------|-------------------------------------------------|
| 1152<br>1153 |                                                               | 1191 | )                                               |
| 1155         |                                                               | 1192 | / #connectionpoints.append(match)               |
| 1154         |                                                               | 1192 | <pre>#print(connectionpoints)</pre>             |
| 1155         |                                                               | 1193 | self.connection_points = connectionpoints       |
| 1150         |                                                               |      | class dependency:                               |
| 1157         |                                                               | 1195 | #name of instantiation                          |
| 1158         |                                                               | 1190 | id = 'dep'                                      |
| 1160         |                                                               | 1198 | name = ""                                       |
| 1161         |                                                               | 1199 | #modulename                                     |
| 1162         |                                                               | 1200 | modulename = ""                                 |
| 1163         |                                                               | 1200 | module_handle = None                            |
| 1164         |                                                               | 1201 | possible_HINST = False                          |
| 1165         |                                                               | 1202 | definit(self):                                  |
| 1166         |                                                               | 1200 | global dependencies                             |
| 1167         |                                                               | 1204 | dependencies = np.append(dependencies, self)    |
| 1168         |                                                               | 1206 | self.connections = []                           |
| 1169         |                                                               | 1200 | def add_connections(self, list):                |
| 1170         |                                                               | 1201 | self.connections.append(list)                   |
| 1171         |                                                               |      | <pre>class shift_op:</pre>                      |
| 1172         |                                                               | 1210 | $id = 'shift_op'$                               |
| 1173         |                                                               | 1211 | name = $1$                                      |
| 1174         |                                                               | 1212 | A = 0                                           |
| 1175         |                                                               | 1213 | SH = 0                                          |
| 1176         |                                                               | 1214 | $\mathbf{Z} = 0$                                |
| 1177         |                                                               | 1215 | $a_{width} = 0$                                 |
| 1178         |                                                               | 1216 | $sh_width = 0$                                  |
|              | connection_point_string.translate({ord(i): None for i in '\   | 1217 | $z_{\rm width} = 0$                             |
|              | \n'})                                                         | 1218 | -structurecount = 0                             |
| 1179         |                                                               | 1219 | <pre>definit(self):</pre>                       |
| 1180         | connection points = $[]$                                      | 1220 | global shifters                                 |
| 1181         | <pre>for key, rx in rx_dict_module_connections.items():</pre> | 1221 | <pre>shifters = np.append(shifters, self)</pre> |
| 1182         | match = rx.findall(self.connection_point_string) #rx          | 1222 | <pre>self.output_nodes = []</pre>               |
|              | .findall(line)                                                | 1223 | <pre>class comp_op:</pre>                       |
| 1183         | #print("found "+str(len(match))+" matches in                  | 1224 | $id = "comp_op"$                                |
|              | modulestring")                                                | 1225 | name = ""                                       |
| 1184         | if match:                                                     | 1226 | $\mathrm{A}~=~0$                                |
| 1185         | #print(str(match))                                            | 1227 | $\mathrm{B}~=~0$                                |
| 1186         | for m in match:                                               | 1228 | $\mathrm{Z}~=~0$                                |
| 1187         | <pre>if (m[0] == ''): connectionpoints.append(</pre>          | 1229 | $a_width = 0$                                   |
|              | tuple([m[2]]))                                                | 1230 | $b_width = 0$                                   |
| 1188         | else:                                                         | 1231 | $z_width = 0$                                   |
| 1189         | $l = m[1].translate({ord(i): None for i})$                    | 1232 | structurecount = 0                              |
|              | in '}{'})                                                     | 1233 | <pre>definit(self):</pre>                       |
| 1190         | l = l.split(', ')                                             | 1234 | global comparators                              |
|              |                                                               |      |                                                 |

```
comparators = np.append(comparators, self)
             self.output nodes = []
1236
             #print("Made comparator")
1237
    class sub_op:
1238
         id = "sub op"
        name = ""
        A = 0
        B = 0
1242
        Z = 0
1243
         structurecount = 0
1244
         def init (self):
1245
             global subtractors
1246
             subtractors = np.append(subtractors, self)
1247
             self.output nodes = []
1248
             #print("made subtractor")
    class add op:
1250
         id = "add op"
1251
        name = ""
        A = 0
1254
        B = 0
        Z = 0
         a width = 0
1256
        b \text{ width} = 0
1257
1258
         z \text{ width} = 0
1259
         structurecount = 0
         def __init__(self):
1260
             global adders
1261
1262
             adders = np.append(adders, self)
             self.output nodes = []
1263
             #print("made adder")
1264
    class mult_op:
1265
1266
        id = "mult op"
        name = ""
1267
        A = 0
1268
        B = 0
1269
        Z = 0
         a width = 0
         b width = 0
        z width = 0
1273
         structurecount = 0
1274
         def __init__ (self):
             global multipliers
             multipliers = np.append(multipliers, self)
             self.output nodes = []
1278
             #print("made multiplicator")
1279
```

```
1280 class div_op:
        id = "div op"
1281
        name = ""
1282
1283
        A = 0
        B = 0
1284
        Z = 0
1285
1286
         a width = 0
         b \text{ width} = 0
         z \text{ width} = 0
1288
         structurecount = 0
         def init (self):
1290
             global divisors
1291
             divisors = np.append(divisors, self)
1293
             self.output nodes = []
             #print("Made divisor")
1294
    class b shift op:
1295
         id = "b shift op"
1296
        name = ""
1297
        A = 0
1298
        SH = 0
1299
        Z = 0
1300
         a width = 0
1301
        sh_width = 0
1302
         z \text{ width} = 0
1303
1304
         structurecount = 0
         def init (self):
1305
             global b shifters
1306
1307
             b_shifters = np.append(b_shifters, self)
             self.output nodes = []
1308
             #print("made barrelshift")
1309
1310 class shift_add_op:
1311
        #dont know what content should be here, may need to
        implement as I go if I encounter it during testing
        id = "shift add op"
1312
        name = ""
1313
         structurecount = 0
1314
        def init (self):
1315
             global shift adders
1317
             shift adders = np.append(shift adders, self)
             self.output nodes = []
1318
             #print("Made shift adder")
1319
1322 #dictionaries containing regular expressions to handle different
          constructs from the elaborated systemverilog
```

```
1323 rx_dict_module_start = \{
                    'begin'
                                                   : re.compile(rmodule \setminus s + (\setminus S +))
1324
1325
1326 rx_dict_objectconnection = {
                     'bit'
                                                   : re.compile(r''(1'b d)''),
                     'connect' : re.compile(r"(\S+)(?:\left( \left( d \{1,4\}\right) \right))?(?:\left( \left( d \{1,4\}\right) \right))
                    \{1,4\}) \setminus ])?")
1329 }
1330 rx dict start = {
                     'register'
                                                             : re.compile(r'' \setminus \times \times (S+) \setminus s'')
                     'GTECH OR2'
                                                             : re.compile (r "GTECH OR2s+(S+),"),
                     'GTECH NOT'
                                                             : re.compile (r "GTECH NOT s + (S+) s"),
                     'GTECH BUF'
                                                             : re.compile(r"GTECH BUFs+(S+),
1334
                     'GTECH AND2'
                                                             : re.compile(r"GTECH AND2s+(S+),
                     'GTECH XOR2'
                                                             : re.compile(r"GTECH XOR2s+(S+),
                     'MUX OP'
                                                             : re.compile(r"MUX OP(s+(S+))s"),
1337
                     'SELECT OP'
                                                             : re.compile(r"SELECT OPs+(S+),"),
1338
                    #TODO: mux add, sub, mult, shifts and compares remain at
                    least ...
                    #all of these can be single bit or multibit. if square
1340
                    brackets before name
                    #multi, else single (group capture?)
1341
                                                             : re.compile(r"(input)s+(?:\{(d \{1,3\}):(d \{1,
                    'input'
1342
                    \{1,3\}) \])?"),
                    'output'
                                                             : re.compile(r"(output)s+(?:[(d{1,3}):(d
1343
                    \{1,3\}) \setminus ])?"),
                     'wire'
                                                             : re.compile(r"(wire)s+(?:[(d{1,3}):(d
1344
                    \{1,3\}) \setminus ])?"),
                    'dep'
                                                             : re.compile(r"(\S+)\s+(\S+u \S+)\s+("),
1345
                    'COMP OP'
                                                             : re.compile(r "^ \times s * (?:EQ UNS OP | NE UNS OP |
1346
                   EQ TC OP|NE TC OP|GEQ UNS OP|GEQ TC OP|LEQ UNS OP|LEQ TC OP|
                   GT_UNS_OP|GT_TC_OP|LT_UNS_OP|LT_TC_OP) \setminus s + (\setminus S + ) \setminus s"),
                                                             : re.compile(r"SUB (?:UNS OP|UNS CI OP|TC OP
1347
                    'SUB OP'
                    |\text{TC CI OP} \setminus s + (\setminus S +) \setminus s"),
1348
                     'ADD OP'
                                                            : re.compile(r"ADD (?:UNS OP|UNS CI OP|TC OP
                    |\text{TC CI OP} \setminus s + (\setminus S +) \setminus s"),
                     'MULT OP'
                                                            : re.compile(r"MULT (?:UNS OP|TC OP)s+(S+)
1349
                    \langle \mathbf{s}'' \rangle,
                    'DIV OP'
                                                            : re.compile(r"(?:DIV|MOD|REM|DIVREM|DIVMOD)
                     (?:UNS|TC) OP(s+(S+))s"), #only div in Yoda
                                                             : re.compile(r"(?:ASH|ASHR|SRA) (?:UNS|TC)
                    'SHIFT OP'
1351
                    (?:UNS|TC|OP)(?:OP)?(s+(S+))s"),
                    'B SHIFT OP' : re.compile(r"BSH(?: UNS OP| TC OP|L TC OP|
1352
                   R UNS OP|R TC OP(s+(S+))s", #not in Yoda
```

```
'SHIFT ADD OP' : re.compile(r"(?:SLA UNS OP|SLA TC OP)\s+()
1353
           S+) \setminus s"), #not in Yoda
           'assign'
                                 : re.compile(r^{"assign} \setminus s([^{-}]+)^{"})
1354
          \#'SRA'
1356 }
1357 rx dict dep internals = {
           #'dep' : re.compile(r" \setminus ( \ s?(?: \ ( \ \land , ]+) \setminus (([^ \setminus ( \ ); \ s]*)
1358
           (),?) + (s?);")
           'dep' : re.compile(r"\s?(?:\.(?P<connection_point>[^\(\s
1359
           |+\rangle \setminus ((?P < connected to > [^ ((); s]*)), ?) \land s?")
1360 }
1361 rx dict assign = {
           'rhs': re.compile(r''=([^=]+); ")
1362
1363
1364 rx dict shift = {
           'A'
                      : re.compile(r"\A\(([^{\]})]*))),
1365
           'SH'
                      : re.compile(r'' \setminus SH \setminus (([^ \setminus)]*) \setminus )''),
1366
           'Z'
                      : re.compile(r " \setminus Z \setminus (([^ \setminus )]*) \setminus )")
1367
1368
1369 rx dict module connections = {
           #'reconnect' : re.compile(r"\s?((?:\.(?P<connection point</pre>
           > [^ (\langle s, ]+) ((?P < connected_to > [^ (\langle ); \langle s] *) )) [, ] s?")
           \# | ( [^, \backslash . \backslash ( \backslash ) \backslash \{ \backslash \} \backslash [ \backslash ]] + ) [, \backslash ) ] \backslash s?" ),
           #'plain'
                               : re.compile(r"\s*([(\langle s, ]+) \rangle?[,\)]")
1371
           'connection' : re.compile(r"(?:[,\(]\.([^\(\),]+)\(([^\(\))
1372
           |+) \setminus ) ) | (?:[, \setminus (]([^ \setminus .][^ \setminus ) \setminus (,;]*))")
1373 }
1374 rx dict end = {
           \# ' end '
                       : re.compile(r " \setminus);"),
1375
           'semi' : re.compile(r";") #hopefully this does not ruin
1376
           anything and all semicolons are ends
1377 }
1378 rx_dict_in_out_wire = {
           'varname' : re.compile(r''[\land \ s, ]+'') #one or more char not
1379
           whitespace comma
1380 }
1381 rx dict SELECT = {
                            : re.compile(r"\.DATA\d{1,2}\(([^ \])),
           'DATA'
1382
                            : re.compile(r"\.CONTROL\d{1,2}\(([^\)]*)\)"),
           'CONTROL'
1383
1384
           'Z'
                            : re.compile(r " \setminus Z \setminus (([^ \setminus )]*) \setminus )")
1385 }
1386 rx dict_comp = {
1387
           'A'
                                 : re.compile(r " \setminus A \setminus (([^ \setminus )]*) \setminus )"),
           'B'
                                  : re.compile(r " \setminus B \setminus (([^ \setminus )]*) \setminus )"),
1388
                                  : re.compile(r"\.QUOTIENT((([^{)})]*)))
           'QUOTIENT '
1389
```

```
D-18
```

| 1390  | }                                                                                                         |
|-------|-----------------------------------------------------------------------------------------------------------|
| 1391  | $rx\_dict\_SUB\_ADD\_MULT = \{$                                                                           |
| 1392  | $A' : re.compile(r" \land A \land (([^ \land)]*) \land)"),$                                               |
| 1393  | $"B" : re.compile(r" \.B \ (([^ \] *) \)"),$                                                              |
| 1394  | $'Z'$ : re.compile(r"\.Z\(([^\)]*)\)")                                                                    |
| 1395  | }                                                                                                         |
| 1396  | $rx_dict_MUX = {$                                                                                         |
| 1397  | 'D' : re.compile(r"\.D\d{1,2}\(([^\)]*)\)"), # SEEMS TO                                                   |
|       | BE SOME THAT HAS UP TO D31 AS D INPUTS. HOW DO i HANDLE                                                   |
|       | THese VARYING THINgS                                                                                      |
| 1398  | # ALSO SOME                                                                                               |
|       | ONLY GOING TO D3 BUT ATTACHING 5 BIT TO EACH D data width of                                              |
|       | d varies, number of D inputs varies                                                                       |
| 1399  | 'S' : re.compile(r"\.S\d{1,2}\(([^\)]*)\)"), # make                                                       |
|       | arrays for the mux                                                                                        |
| 1400  | $Z' : re.compile(r" \setminus Z \setminus (([^ \setminus )]*) \setminus)")$                               |
| 1401  |                                                                                                           |
|       | $rx$ dict BUF = {                                                                                         |
| 1403  | $A' : re.compile(r" \land A \land (([^ \land)]*) \land)"),$                                               |
| 1404  | $Z' : re.compile(r" \. Z \ (([^ \] *) ))")$                                                               |
| 1405  |                                                                                                           |
|       | $rx$ dict NOT = {                                                                                         |
| 1407  | $A' : re.compile(r" \land A \land (([^ \land)]*) \land)"),$                                               |
| 1408  | $Z' : re.compile(r" \. Z \ (([^ \] *) \)")$                                                               |
| 1409  |                                                                                                           |
|       | $rx_dict_AND2 = {$                                                                                        |
| 1411  | $A' : re.compile(r" \land A \land (([^ \land)]*) \land)"),$                                               |
| 1412  | $B' : re.compile(r" \.B \ (([^ \] *) \)"),$                                                               |
| 1413  | $Z' : re.compile(r" \. Z \ (([^ \] *) \)")$                                                               |
| 1414  |                                                                                                           |
|       | $rx dict OR2 = {$                                                                                         |
| 1416  | $A' : re.compile(r" \land A \land (([^ \land)]*) \land)"),$                                               |
| 1417  | $B' : re.compile(r" \.B \ (([^ \] *) \)"),$                                                               |
| 1418  | $Z' : re.compile(r" \. Z \ (([^ \] *) \)")$                                                               |
| 1419  |                                                                                                           |
|       | $rx dict reg = {$                                                                                         |
| 1421  | $clear' : re.compile(r" \.clear \(([^\]*) \)"),$                                                          |
| 1422  | 'preset' : re.compile(r"\.preset\((([^\)]*)\)"),                                                          |
| 1423  | $ \text{'next state'} : \text{re.compile}(r"\setminus.next state\setminus(([^ \setminus]*)\setminus)"), $ |
| 1424  | $"clocked_on" : re.compile(r" \.clocked_on \(([^\]*) \)"),$                                               |
| 1425  | 'data in' : re.compile(r"\.data in\(([^\)]*)\)"),                                                         |
| 1426  | $= \operatorname{re.compile}(r" \ enable \ (((( \ )) \ )))),$                                             |
| 1427  | $'Q' : re.compile(r"\.Q\(([^\)]*)\)"),$                                                                   |
| 1428  | $(QN') : re.compile(r'',QN((([^{)}]*)))),$                                                                |
| - 120 |                                                                                                           |

| 1429         | $"synch_clear" : re.compile(r" \.synch_clear \(([^\)]*) \)")$      |
|--------------|--------------------------------------------------------------------|
| 1430         | ,<br>'synch_preset' : re.compile(r"\.synch_preset\(([^\)]*)\)"     |
| 1 100        | ),                                                                 |
| 1431         | 'synch_toggle' : re.compile(r"\.synch_toggle\(([^\)]*)\)"          |
|              | ),                                                                 |
| 1432         | 'synch_enable' : re.compile(r"\.synch_enable\(([^\)]*)\)"          |
|              | )                                                                  |
| 1433         | }                                                                  |
| 1434         |                                                                    |
| 1435         | #process module instantiations                                     |
| 1436         | <pre>def process_dependencies():</pre>                             |
| 1437         | for top_module in modules:                                         |
| 1438         | $set\_global\_lists(top\_module)$                                  |
| 1439         |                                                                    |
| 1440         | for dep in top_module.dependencies:                                |
| 1441         | $found\_dep = False$                                               |
| 1442         | <pre>#print("Looking for "+dep.modulename+" in</pre>               |
|              | dependencies")                                                     |
| 1443         | modulename = dep.modulename                                        |
| 1444         | #find modulename in module list                                    |
| 1445         | for the module in modules.                                         |
| 1446<br>1447 | for dep_module in modules:                                         |
| 1447         | if dep module.name == modulename:                                  |
| 1449         | found dep = $True$                                                 |
| 1450         | $dep.module _handle = dep_module$                                  |
| 1451         | $module\_connection\_list = dep\_module.$                          |
|              | connection points                                                  |
| 1452         |                                                                    |
| 1453         | for dependency_connection_tuple in dep.                            |
|              | connections [0]:                                                   |
| 1454         | $cleaned\_dependency\_connection\_point =$                         |
|              | $dependency\_connection\_tuple[0].translate({ord(i): None for i})$ |
|              | in '\ '})                                                          |
| 1455         | found = False                                                      |
| 1456         |                                                                    |
| 1457         | for module_connection_tuple in                                     |
|              | module_connection_list:                                            |
| 1458         |                                                                    |
| 1459         | if                                                                 |
|              | cleaned_dependency_connection_point ==                             |
|              | module_connection_tuple[0]:                                        |
| 1460         | found = True                                                       |

```
cleaned\_dep\_connectionlist =
1461
        dependency connection tuple [1]. translate ({ord(i): None for i
         in '}{\ '})
                                       cleaned\_dep\_connectionlist =
1462
        cleaned dep connectionlist.split(',')
1463
1464
                                       for i in range(len(
        cleaned dep connectionlist)):
                                           if len(
1465
        module connection tuple) > 1:
                                               i1, i2,
1466
        module connection, typeindex = find indexes(
        module_connection_tuple[1][i])
                                           else:
1467
1468
                                               i1, i2,
1469
        module connection, typeindex = find indexes (
        module_connection_tuple[0])
1470
                                           dep handle, found dep =
1471
        search list(dep module.inputs, module connection)
                                           if found dep:
1472
1473
                                               print("sending "+str(
1474
        cleaned dep connectionlist [i])+" into process match")
                                               dataN, datawidth,
1475
        processed matchlist = process match ([
        cleaned dep connectionlist [i]], dep handle, 'input', 'dep')
1476
1477
1478
1479
                                           else:
                                               dep_handle, found_dep =
1480
        search list(dep module.outputs, module connection)
                                               #print(dep connection)
1481
                                               if found_dep:
1482
1483
                                                   dataN, datawidth,
1484
        processed matchlist = process match ([
        cleaned dep connectionlist [i]], dep handle, 'input', 'dep')
                                               else:
1485
1486
                                                    print("Did not find
1487
        dep "+dep module.name)
1488
```

```
1489
1490
                              #DEP NOT FOUND IN MODULES, MAYBE HINST
                              if found == False:
1491
                                  dep.possible_HINST = True
1492
1493
                 if found dep = False:
1494
                     print("Did not find dep: "+dep.modulename)
1495
             top module.set lists()
1496
             empty global lists()
1497
                     #find dep ports in inputs or outputs
1499
1500 #print name of all objects in a list
    def print list names(1):
        for e in 1:
             print(" \setminus t" + e.name)
1503
1505
1506 #go structure heads and make structure trees
    def connect structure(module):
1507
        global top_level_parents
1508
1509
        for inp in module.inputs:
1510
             for nl in range (len (inp. connection nodes)):
                 for n in range(len(inp.connection nodes[nl])):
1513
                     \#print(str(nl)+""+str(n))
1514
                     structure handle = structure(None, inp, nl)
1516
                     top level parents.append(structure handle)
                     connect children(structure handle, nl, n)
1517
1518
1519
        for m in modules:
             for reg in m. regs:
                 #print(reg.name)
                 structure handle = structure (None, reg, 0)
                 top level parents.append(structure handle)
1523
1524
                 connect_children(structure_handle, 0, 0)
1525
1526
1527 #recursively connects all children to a parent and expand
        structure tree
1528 def connect children (parent, i1, i2):
         object handle = parent.represented object handle
1529
1530
        i1 = parent.i1
1532
```

1498

```
if object_handle.id != 'input' and object_handle.id != '
        output ':
1534
            datawidth_set = { 'sub_op', 'select_op', 'mux_op', '
        shift op', 'add op', 'mult op', 'comp op', 'div op', '
        b shift op', 'shift add op'}
1536
            object handle.structurecount = object handle.
        structurecount+1
        global top level parents
1539
        #print("Current object: "+object handle.name)
1540
        output nodes = []
        output nodes q = []
1542
        output nodes qn = []
1543
        #print("parent: "+object handle.name)
        if object handle.id == 'output':
1545
            output nodes = []
1546
        elif object handle.id = 'reg':
1547
1548
            if object handle.output structure taken == False:
                output nodes q = object handle.output nodes q
                output nodes_qn = object_handle.output_nodes_qn
                object handle.output structure taken = True
1553
            if parent != None:# and parent.connected inputs [0] !=
1554
        None:
                if parent.parent != None:
                     object handle.has parent = True
1557
1558
        elif object handle.id == 'input':
1559
            output_nodes.append(object_handle.connection_nodes[i1][
1560
        i2])
1561
        elif object_handle.id == 'comp_op':
1562
            output nodes = object handle.output nodes
        else:
            output nodes = [object handle.output nodes[i1]]
            #ADDED to shorten recursion
            object handle.output nodes[i1] = parent
1569
        #print(output nodes)
        for node in output_nodes:
```

```
if node != None:
         if node.id == 'structure':
             parent = node
             return
         elif(node != None):
             for con in node.connected inputs:
                  child handle = con[0]
                  structure handle = structure (parent,
child handle, con[4])
                  added = parent.add child(structure handle)
                  if added:
                      structure handle.structure_type =
child handle.id
                      structure_handle.
structure connection characteristic = con[3]
                      if \operatorname{con}[2] = \operatorname{'reg'} and \operatorname{con}[2] \mathrel{!=} \operatorname{'}
control':
                           con[0]. has parent = True
                      if structure handle.
represented _object _handle.id != 'reg' and structure _handle.
represented object handle.id != 'input':
                           structure handle.
represented object handle.output nodes [0] = structure handle
                      if (con [3] != 'control' and con [2] != '
reg' and con[2] \mathrel{!=} \mathrel{'input'}:
                           connect children (structure handle,
con[4], node.i2)
                      elif(con[2] = 'input'):
                           #if input i2 needs to be set
correctly
                           connect_children(structure_handle,
con[4], con[5]) #node.i2)
    else:
         pass
for node in output_nodes_q:
    if (node != None):
         for con in node.connected inputs:
```

1573

1574

1578

1581

1582

1584

1586

1587

1588

1589

1590

1594

1597

1598

1599

1600

1602

1603

1605

| 1606 | $child_handle = con[0]$                                            | 1639 | $con[0]$ .has_parent = True                                        |
|------|--------------------------------------------------------------------|------|--------------------------------------------------------------------|
| 1607 |                                                                    | 1640 | if structure_handle.                                               |
| 1608 | $structure_handle = structure(parent,$                             |      | represented_object_handle.id != 'reg' and structure_handle.        |
|      | child_handle, con[4])                                              |      | represented_object_handle.id != 'input':                           |
| 1609 | $added = parent.add_child(structure_handle)$                       | 1641 | $structure\_handle$ .                                              |
| 1610 | if added:                                                          |      | $represented\_object\_handle.output\_nodes[0] = structure\_handle$ |
| 1611 | $structure\_handle.structure\_type =$                              | 1642 | if $(con[3] != 'control' and con[2] != 'reg'$                      |
|      | child_handle.id                                                    |      | and $con[2] \mathrel{!=} \mathrel{'input'}:$                       |
| 1612 | structure_handle.                                                  | 1643 | <pre>connect_children(structure_handle, con</pre>                  |
|      | $structure\_connection\_characteristic = con[3]$                   |      | [4], node.i2)                                                      |
| 1613 | if $con[2] = 'reg' and con[2] != 'control':$                       | 1644 | elif(con[2] = 'input'):                                            |
| 1614 | $con [0].has_parent = True$                                        | 1645 | connect_children(structure_handle, con                             |
| 1615 | if structure_handle.                                               |      | [4], con[5])                                                       |
|      | represented_object_handle.id != 'reg' and structure_handle.        | 1646 |                                                                    |
|      | represented_object_handle.id != 'input':                           | 1647 |                                                                    |
| 1616 | structure_handle.                                                  | 1648 |                                                                    |
|      | $represented\_object\_handle.output\_nodes[0] = structure\_handle$ | 1649 | #structure tree class                                              |
| 1617 | if $(con[3] != 'control' and con[2] != 'reg'$                      | 1650 | class structure:                                                   |
|      | and $con[2] = 'input'$ :                                           | 1651 | structure_type = ''                                                |
| 1618 | $connect\_children(structure\_handle, con$                         | 1652 | $structure\_connection\_characteristic = $                         |
|      | [4], node.i2)                                                      | 1653 | id = 'structure'                                                   |
| 1619 | elif(con[2] = 'input'):                                            | 1654 | <pre>definit(self, parent, represented_object_handle, i1):</pre>   |
| 1620 | $connect\_children(structure\_handle,con$                          | 1655 | self.parent = parent                                               |
|      | [4], con[5])                                                       | 1656 | self.children = []                                                 |
| 1621 |                                                                    | 1657 | $self.represented_object_handle =$                                 |
| 1622 | for node in output_nodes_qn:                                       |      | represented_object_handle                                          |
| 1623 | if (node != None):                                                 | 1658 | self.i1 = i1                                                       |
| 1624 |                                                                    | 1659 | self.powerStructure = None                                         |
| 1625 | if node.id == 'structure':                                         | 1660 | <pre>def add_child(self, child):</pre>                             |
| 1626 | parent = node                                                      | 1661 | if child.represented_object_handle == self.                        |
| 1627 |                                                                    |      | represented_object_handle:                                         |
| 1628 | return                                                             | 1662 | return False                                                       |
| 1629 |                                                                    | 1663 | for c in self.children:                                            |
| 1630 | for con in node.connected_inputs:                                  | 1664 | if c.represented_object_handle == child.                           |
| 1631 | $child_handle = con[0]$                                            |      | represented_object_handle:                                         |
| 1632 | # print(con)                                                       | 1665 | return False                                                       |
| 1633 | $structure_handle = structure(parent,$                             | 1666 | self.children.append(child)                                        |
|      | child_handle, con[4])                                              | 1667 | return True                                                        |
| 1634 | added = parent.add_child(structure_handle)                         | 1668 | def print(self):                                                   |
| 1635 | if added:                                                          | 1669 | if self.children != []: print("{", end = '')                       |
| 1636 | $structure\_handle.structure\_type =$                              | 1670 | for child in self.children:                                        |
|      | child_handle.id                                                    | 1671 | $print(child.represented_object_handle.id+",",end =$               |
| 1637 | structure_handle.                                                  |      | '')                                                                |
|      | $structure\_connection\_characteristic = con[3]$                   | 1672 | child.print()                                                      |
| 1638 | if $con[2] = 'reg' and con[2] != 'control':$                       | 1673 |                                                                    |

```
if self.children != []: print("}", end = '')
                                                                              1716
1674
                                                                              1717
1675
         def repr (self, level=0):
1676
                                                                              1718
             ret = "\t"*level+repr(self.represented_object_handle.id)
1677
        +"\n"
                                                                              1719
             if level < 11:
1678
                                                                              1720
                 for child in self.children:
                                                                              1721
1679
                      ret += child. repr (level+1)
1680
             return ret
                                                                              1724
1683 #calls all the functions in the right order to create the
         structural representation
1684 def run parse elab(filename):
         parse file(filename)
1685
                                                                              1728
         for m in modules:
1686
                                                                              1729
             m.set connection points()
1687
                                                                              1730
         process dependencies()
1688
                                                                              1731
         connect_structure(modules[0])
1690
1691
                                                                              1734
        #need to also count module instantiations with no content-
         assume hinst
                                                                              1736
         count gates (modules [0])
         print gates()
1694
                                                                              1738
1695
         return modules, top level parents
                                                                              1739
1696
                                                                              1740
1697 #count gates in representation
                                                                              1741
1698 def count gates (module):
                                                                              1742
         global reg n
1699
         global not n
1700
         global buf n
                                                                              1745
1702
         global and 2 n
                                                                              1746
         global or2_n
                                                                              1747
1703
         global mux n
                                                                              1748
1704
         global select n
                                                                              1749
         global shift_n
1706
         global comp n
                                                                              1750
         global xor2 n
         global mult n
                                                                              1751
         global sub n
                                                                              1752
1710
         global b_shift_n
                                                                              1753
         global add n
1712
                                                                              1754
         global shift add n
1713
         global div n
1714
        m = module
1715
```

```
if m != None:
                                for r in m. regs:
                                           if r.output nodes q[0] != None or r.output nodes qn
                      [0] != None:
                                                      if r.has parent:
                                                      #print(r.name)
                                                                 reg_n
                                                                                                = reg n +1
                                                     #else:
                                                              \# \operatorname{reg}_n = \operatorname{reg}_n + 1
                                                                                                                    + len (m. nots
                                                                = not n
                                not n
                                buf n
                                                                = buf n
                                                                                                                    + len (m. bufs
                                                                                                                    + len (m. and 2s
                                and2 n
                                                                = and 2 n
                                                                                                                    + len (m. or 2s
                                or2 n
                                                                = or2 n
                                                                                                                    + len (m. muxes
                                mux n
                                                                = mux n
                                select n
                                                                = select n
                                                                                                                    + len (m. selects
                                                                                                                    + len (m. shifters
                                shift n
                                                                = shift n
                                                                                                                    + len (m. comparators )
                                comp n
                                                                = comp n
                                xor2 n
                                                                = xor2 n
                                                                                                                    + len(m.xor2s)
                                                                                                                    + len (m. multipliers )
                                mult n
                                                                = mult n
                                                                                                                    + len (m. subtractors )
                                sub n
                                                                = sub n
                                                                = b shift n
                                                                                                                    + len (m. b shifters )
                                b shift n
                                add n
                                                                = add n
                                                                                                                    + len (m. adders
                                shift add n = shift add n
                                                                                                                    + len (m. shift adders)
                                                                                                                    + len (m. divisors
                                div n
                                                                = div n
                                for d in module.dependencies:
                                           m = d.module handle
                                           count gates (m)
1743 #print gate counts
1744 def print gates():
                      print("regs \ t \ t "+ str(reg_n)
                                                                                                               ))
                      print ("muxest t"+ str (mux n
                                                                                                                ))
                      print ("nots t t + str(not_n)
                                                                                                               ))
                      print ("bufs t t"+ str (buf n
                                                                                                               ))
                      print("arithmetic:\t"+str(mult n+sub n+add n+shift add n+
                     div_n))
                      print("logic: t t"+str(and2 n+or2 n+shift n+comp n+xor2 n+
                      b shift n))
                      print ("selects t \in t"+ str (select n
                                                                                                                  ))
                      print()
                      print("Total: t t "+str(reg n+not n+buf n+and2 n+or2 n+mux n)
                     +select n+shift n+comp n+xor2 n+mult n+sub n+b shift n+add n
                     +shift add n+div n))
```

## E Code implemented in Chapter 7

```
1 from liberty.parser import parse liberty
2 #spec = liberty.parser("liberty.parser", )
3 from pathlib import Path
4 import argparse
5 import numpy as np
6 import os, sys
7 import random
8 from datetime import datetime
9 import json
10 import re
11
12 path to libfile = sys.argv[1]
13 \text{ starttime} = \text{datetime.now}()
14 path to calibration file = sys.argv[2]
15 \text{ cells} = 11
16 processed library path = "powerlib.txt"
17
18 input set = {"A1", "I", "S", "I0", "TE", "CI", "CO", "A", "CDN",
       "D", "SDN"
19 output set = { "Z", "ZN", "Q", "QN", "ZN" }
20
21 def set_cells_and_environment(libfile):
       library = parse liberty(open(libfile).read())
22
      #print("done parsing liberty after "+str((datetime.now()-
23
      starttime)/60)+" minutes")
24
                               = str(library.get("voltage unit")
       voltage unit
25
           ).replace("\"", "")
                               = str(library.get("current unit")
26
      current unit
           ).replace("\"", "")
      leakage power unit
                               = str(library.get("
27
      leakage power unit") ).replace("\"", "")
       capacitive load unit = str(library.get("
28
       capacitive load unit")).replace("\"", "")
      #what is unit of power in power templates?
29
30
      #also unit of leakage power
31
```

```
cells = library.get groups("cell")
32
      jsonstring = json.dumps([voltage unit, current unit,
33
      leakage power unit, capacitive load unit])
34
       fp = open(processed library path, "w")
       fp.write(jsonstring+"n")
35
       for cell in cells:
36
37
          dynamic current = cell.get groups("dynamic current")
38
          leakage power = cell.get groups("leakage power")
39
                           = cell.get groups("pin")
           pins
40
          cellName = str(cell.args[0])
41
           occurences in calibration file = count occurence(
42
      cellName)
          leakagePower = str(leakage power[-1].get("value"))
43
44
           footprint = cell.get("cell footprint")
45
46
47
          \# pinLists = []
          input pins =[]
48
          output pins = []
49
          for pin in pins:
50
               \# pinList = []
51
               pinName = pin.args[0]
52
               direction = pin.get("direction")
53
               pinfuction = None
54
               intPwr lists = []
55
               pin_cap = 0
56
               pwrPin = pin.get("related power pin")
57
58
               gndPin = pin.get("related ground pin")
               if (direction == "output"):
59
                   pinfunction = pin.get("function")
60
61
               else:
                   pin cap = pin.get("capacitance")
62
                   #get internal power groups:
63
               internal power groups = pin.get groups("
64
      internal power")
               for intPwr in internal power groups:
65
```

| 66  |             | $related_pin = intPwr.get("related_pin")$        |
|-----|-------------|--------------------------------------------------|
| 67  |             | when $= intPwr.get("when")$                      |
| 68  |             | risePwrGroup = intPwr.get_group("rise_power")    |
| 69  |             | fallPwrGroup = intPwr.get_group("fall_power")    |
| 70  |             |                                                  |
| 71  |             | rise_arg = risePwrGroup.args[0]                  |
| 72  |             | fall_arg = fallPwrGroup.args[0]                  |
| 73  |             |                                                  |
| 74  |             | <pre>if rise_arg != "scalar":</pre>              |
| 75  |             | rise_i1 = risePwrGroup.get_array("index_1")      |
| 76  |             | rise_values = risePwrGroup.get_array("values     |
|     | ")          |                                                  |
| 77  |             | <pre>if fall_arg != "scalar":</pre>              |
| 78  |             | fall_i1 = fallPwrGroup.get_array("               |
|     | index_1")   |                                                  |
| 79  |             | fall_values = fallPwrGroup.get_array("values     |
|     | ")          |                                                  |
| 80  |             | fall_values_i = []                               |
| 81  |             | rise_values_i = []                               |
| 82  |             | $fall_cap = []$                                  |
| 83  |             | rise cap = []                                    |
| 84  |             | powersum_list = []                               |
| 85  |             | if direction == "output":                        |
| 86  |             | if fall_arg != "scalar":                         |
| 87  |             | fall_cap_np = fallPwrGroup.get_array("           |
|     | $index_2")$ |                                                  |
| 88  |             | $fall_cap = fall_cap_np.tolist()[0]$             |
| 89  |             | <pre>for timeIndex in range(0, len(fall_i1</pre> |
|     | [0])):      |                                                  |
| 90  |             | <pre>#print(fall_i1[0][timeIndex])</pre>         |
| 91  |             | if str(fall_i1[0][timeIndex]) == "               |
|     | 0.46 ":     |                                                  |
| 92  |             | $fall_values_i = fall_values[$                   |
|     | timeIndex]  |                                                  |
| 93  |             | break                                            |
| 94  |             |                                                  |
| 95  |             | fall_values_i = fall_values_i.tolist()           |
| 96  |             | if rise_arg != "scalar":                         |
| 97  |             | rise_cap_np = risePwrGroup.get_array("           |
|     | $index_2")$ |                                                  |
| 98  |             | rise_cap = rise_cap_np.tolist()[0]               |
| 99  |             | for timeIndex in range(0, len(rise_i1            |
|     | [0])):      | _                                                |
| 100 |             | <pre>#print(rise_i1[0][timeIndex])</pre>         |
|     |             |                                                  |

|                | <pre>if str(rise_i1[0][timeIndex]) == "</pre>                             |
|----------------|---------------------------------------------------------------------------|
| 0.46":         |                                                                           |
|                | rise_values_i = rise_values[                                              |
| timeIndex]     |                                                                           |
|                | break                                                                     |
|                |                                                                           |
|                | rise_values_i = rise_values_i.tolist()                                    |
|                | if rise_arg != 'scalar' and fall_arg != '                                 |
| scalar ':      |                                                                           |
|                | powersum_list = sum_list(rise_values_i,                                   |
| fall_values_i) |                                                                           |
| /              | <pre>elif rise_arg == 'scalar':</pre>                                     |
|                | $powersum\_list = fall\_values\_i$                                        |
|                | elif fall_arg == 'scalar':                                                |
|                | powersum_list = rise_values_i                                             |
|                | else:                                                                     |
|                | #both scalar                                                              |
|                | $powersum\_list = [float(0)]$                                             |
| eli            | f direction == "input":                                                   |
|                | if rise_arg != "scalar":                                                  |
|                | <pre>for timeIndex in range(0, len(rise_i1</pre>                          |
| [0])):         |                                                                           |
|                | <pre>#print(rise_i1[0][timeIndex])</pre>                                  |
|                | $\frac{1}{\text{if str}(\text{rise}_\text{i1}[0][\text{timeIndex}])} = "$ |
| ).46":         |                                                                           |
|                | rise_values_i = rise_values[0][                                           |
| imeIndex]      |                                                                           |
| 1              | break                                                                     |
|                | rise_values_i = [float(rise_values_i)]                                    |
|                | if fall_arg != "scalar":                                                  |
|                | <pre>for timeIndex in range(0, len(fall_i1</pre>                          |
| [0])):         |                                                                           |
|                | <pre>#print(fall_i1[0][timeIndex])</pre>                                  |
|                | $\inf_{if \text{ str}(fall_i1[0][timeIndex])} = "$                        |
| 0.46":         |                                                                           |
|                | fall_values_i = fall_values[0][                                           |
| timeIndex]     |                                                                           |
| ,              | break                                                                     |
|                | fall_values_i = [float(fall_values_i)]                                    |
|                | if rise_arg != 'scalar' and fall_arg != '                                 |
| scalar ':      |                                                                           |
|                | powersum list = sum list(                                                 |
| rise values i, |                                                                           |
| ,              | elif rise_arg == 'scalar':                                                |
|                | powersum list = fall values i                                             |

E-2

```
elif fall arg == 'scalar':
                                                                           163
134
                            powersum list = rise values i
                        else:
136
                                                                           165
                            #both scalar
                                                                           166
                            powersum list = [float(0)]
                                                                           167
138
                                                                           168
                                                                           169
140
                    related pin = str(related_pin).replace("\"","")
                                                                           170
141
                    when = str(when).replace("\"","")
142
                    #make one for scalar as well so not that many
143
                                                                           172
       empty lists?
                                                                           173
                    if (direction == "output" and (related pin in
144
                                                                           174
       input set)):
                        intPwr list = [related pin, str(when).
145
                                                                           176
       replace (" \ "", ""), [rise cap, powersum list]] #[rise cap,
       rise values i], [fall cap, fall values i]]
                                                                           178
                        intPwr lists.append(intPwr list)
146
                                                                           179
                    #inputs do not have related pins, remove them
147
       from list?
                    elif (direction == "input"):
148
                                                                           182
                        intPwr list = powersum list #[related pin,
149
                                                                           183
       str(when).replace("\"",""), powersum list]#rise values i,
                                                                           184
       fall values i]
                                                                           185
150
                        intPwr lists.append(intPwr list)
                                                                           186
                                                                           187
                if direction == "output": #and (str(pinName) in
                                                                           188
       output set):
                                                                           189
                    output pins.append([str(pinName), str(direction)
153
       .replace("\"",""), str(pinfunction).replace("\"",""), str(
       pwrPin).replace("\"",""), str(gndPin).replace("\"",""),
                                                                           192
       intPwr lists ])
                                                                           193
154
                else:# (str(pinName) in input set):
                                                                           194
                    input_pins.append([str(pinName), str(direction).
                                                                           195
       replace("\"",""), str(pin cap), str(pwrPin).replace("\"","")
                                                                           196
       , str(gndPin).replace("\"",""), intPwr lists ])
                                                                           197
                #print(pinLists)
156
           jsonstring = json.dumps([str(cellName), str(footprint).
                                                                           199
       replace ("\"",""), leakagePower, occurences in calibration file
                                                                           200
       , [input pins, output pins]], separators=(',', ':'))
                                                                           201
           fp.write(jsonstring+"n")
158
                                                                           202
                                                                           203
159
           #make json line with dumps
160
                                                                           204
       fp.close()
161
                                                                           205
       print("done extracting data after "+str((datetime.now()-
162
                                                                           206
       starttime)/60)+" minutes")
                                                                           207
```

```
164 def get cells(filename):
       \#cellLib = cell library()
       cell list = []
       #open file
       #read file line for line
       with open(filename, 'r') as svfile:
           line = svfile.readline()
           linenum = 1
           while line:
               #json loads on line to get all variables
               decoded cell line = json.loads(line)
               cell list.append(decoded cell line)
               line = svfile.readline()
       return cell list
180 \# set cells()
181 def sum list(l1, l2):
       returnlist = []
       if len(11) = len(12):
           for i in range (0, len(l1)):
                returnlist.append(float(l2[i])+float(l1[i]))
       else:
           print("trying to sum lists of different length")
       return returnlist
190 #sort cells in regs, mux and logic?
191 def sort cells (processed library path):
       cells = get cells (processed library path)
       cell environment = cells.pop(0)
       cell list = []
       cellLib = cell \ library()
       for c in cells:
           one cell = cell(c, cellLib)
            cell list.append(one cell)
       cellLib.set group weights()
       #make groups handling select op, add op and comp op
       adder = cell group (['add op'], 'adder')
       cellLib.combination cells.append(adder)
       comp = cell group(['comp_op'], 'comp')
       cellLib.combination cells.append(comp)
```

```
Ψ
ω
```

```
mult = cell_group( ['mult_op'], 'mult')
208
        cellLib.combination_cells.append(mult)
209
       #cellLib.print available cells()
210
        return cellLib
211
213 class cell:
                         = '''
        footprint
214
                        = 11
       name
215
       leakage_power = 0
216
       #synthetic gate list = [] # litst containing equivalent
217
        synthetic gate list
        def init (self, def list, cell lib):
218
            self.synthetic_gate_list
                                          = []
219
            self.footprint
                                          = def list[1]
220
            self.def list
                                          = \ \mathrm{def} \ \mathrm{list}
221
                                          = def list[0]
            self.name
222
                                          = def list[2]
            self.leakage power
223
            self.calibration count
                                          = def list[3]
224
                                          = def_{list}[4]
            self.pin_list
225
                                          = def list[4][0]
            self.input pins
226
                                          = def list[4][1]
            self.output pins
            self.N_inputs
                                          = len(self.input pins)
228
            self.N_outputs
                                          = len(self.output pins)
229
230
           #look through dict and set syn gate sequence and name of
        cell.
           #dict corresponds to # inputs
231
232
           #need to append to correct list in cell lib
233
           #check if mux, check if reg, else, check N inputs
234
            match = False
235
            for key, l in rx_dict_reg_cells.items():
236
                #look for name match among registers
237
                match = l[0].search(self.name)
238
                if match:
239
                     self.synthetic gate list = l[1]
240
                    #look through list in cell lib for cell_group
241
        with matching key,
                    group = cell_lib.find_cell_group(cell_lib.regs,
242
       key)
                     if group == None:
243
                        \# make new cell group
244
                         group = cell group (self.synthetic gate list,
245
        key)
                         #append cell to list in group
246
                         group.append cell(self)
247
```

| #append cell group to list in library                                                 |
|---------------------------------------------------------------------------------------|
| cell_lib.regs.append(group)                                                           |
| else:                                                                                 |
| #else append cell to list in cell_group                                               |
| group.append_cell(self)                                                               |
| break                                                                                 |
| if not match:                                                                         |
| <pre>for key, l in rx_dict_mux_cells.items():</pre>                                   |
| #look for name match among multiplexers                                               |
| match = 1 [0]. search(self.name)                                                      |
| if match:                                                                             |
| self.synthetic_gate_list = l[1]                                                       |
| group = cell lib.find cell group(cell lib.                                            |
| muxes, key)                                                                           |
| if group == None:                                                                     |
| # make new cell group                                                                 |
| group = cell group (self.)                                                            |
| synthetic gate list, key)                                                             |
| #append cell to list in group                                                         |
| group.append cell(self)                                                               |
| #append cell group to list in library                                                 |
| cell lib.muxes.append(group)                                                          |
| else:                                                                                 |
| #else append cell to list in cell group                                               |
| group.append_cell(self)                                                               |
| break                                                                                 |
| if not match:                                                                         |
| for key, l in rx_dict_sel_cells.items():                                              |
| #look for name match among multiplexers                                               |
| match = $1[0]$ . search (self.name)                                                   |
| if match:                                                                             |
|                                                                                       |
| <pre>self.synthetic_gate_list = l[1] group = coll_lib_find_coll_group(coll_lib_</pre> |
| group = cell_lib.find_cell_group(cell_lib.                                            |
| selects, key)<br>if group == None:                                                    |
|                                                                                       |
| # make new cell group                                                                 |
| group = cell_group(self.                                                              |
| synthetic_gate_list, key)                                                             |
| #append cell to list in group                                                         |
| group.append_cell(self)                                                               |
| #append cell group to list in library                                                 |
| cell_lib.selects.append(group)                                                        |
| else:                                                                                 |
| <pre>#else append cell to list in cell_group</pre>                                    |
| group.append_cell(self)                                                               |

249

250

251 252

253

254

255

256

257

258

259

260

261

262

263

264

265

266

267

268

269

270

271

272

273

274

275

276

277

278

279

280

281

282

283

284

285

286

287

288

Ļ

| 289 | break                                                        | 331 |  |  |
|-----|--------------------------------------------------------------|-----|--|--|
| 290 | #first look through mux dict and reg dict for matches,       |     |  |  |
|     | if none:                                                     | 333 |  |  |
| 291 | if not match:                                                | 334 |  |  |
| 292 | dictionary = get_dict_N(self.N_inputs)                       | 335 |  |  |
| 293 | <pre>for key, l in dictionary.items():</pre>                 | 336 |  |  |
| 294 | match = l [0].search(self.name)                              | 337 |  |  |
| 295 | if match:                                                    | 338 |  |  |
| 296 | $\# { m found}$ regex, set list and break                    |     |  |  |
| 297 | $self.synthetic_gate_list = l[1]$                            | 339 |  |  |
| 298 | $l = cell_lib.get_list(self.N_inputs)$                       | 340 |  |  |
| 299 | $group = cell_lib.find_cell_group(l, key)$                   | 341 |  |  |
| 300 | if group == None:                                            | 342 |  |  |
| 301 | # make new cell group                                        | 343 |  |  |
| 302 | $group = cell\_group(self.$                                  | 344 |  |  |
|     | <pre>synthetic_gate_list , key)</pre>                        | 345 |  |  |
| 303 | #append cell to list in group                                | 346 |  |  |
| 304 | group.append_cell(self)                                      | 347 |  |  |
| 305 | #append cell group to list in library                        | 348 |  |  |
| 306 | l.append(group)                                              | 349 |  |  |
| 307 | else:                                                        | 350 |  |  |
| 308 | $\#$ else append cell to list in cell_group                  | 351 |  |  |
| 309 | group.append_cell(self)                                      | 352 |  |  |
| 310 | break                                                        | 353 |  |  |
| 311 |                                                              | 354 |  |  |
| 312 | #group together all cells with equivalent functionality      | 355 |  |  |
| 313 | #if no cell_group matching "matching_key" is found, make new | 356 |  |  |
|     | match group                                                  | 357 |  |  |
| 314 | #put cell groups in library instead of cells                 | 358 |  |  |
| 315 | class cell_group:                                            | 359 |  |  |
| 316 | $matching_key = ''$                                          | 360 |  |  |
| 317 | <pre>definit(self , sequence , matching_key):</pre>          | 361 |  |  |
| 318 | $self.matching_key = matching_key$                           | 362 |  |  |
| 319 | $self.synthetic_gate_list = sequence$                        | 363 |  |  |
| 320 | self.cells = []                                              | 364 |  |  |
| 321 | self.cellcounts = []                                         | 365 |  |  |
| 322 | self.weights = []                                            |     |  |  |
| 323 | <pre>def append_cell(self , cell):</pre>                     | 366 |  |  |
| 324 | $\#N = count_occurence(cell.name)$                           |     |  |  |
| 325 | <pre>self.cellcounts.append(cell.calibration_count)</pre>    | 367 |  |  |
| 326 | self.cells.append(cell)                                      |     |  |  |
| 327 | <pre>#print("Appended cell "+cell.name)</pre>                | 368 |  |  |
| 328 | #print("Occurences: "+str(N))                                |     |  |  |
| 329 | <pre>def get_weights(self):</pre>                            | 369 |  |  |
| 330 | $total\_count = 0$                                           | 370 |  |  |

E-2

```
for i in self.cellcounts:
                total count = total count +i
           for i in range(0, len(self.cellcounts)):
               count = self.cellcounts[i]
                if total_count > 0:
                    self.weights.append(round(count/total_count,2))
                else:
                    self.weights.append(round(1/len(self.cellcounts)
       , 2))
           for c in self.cells:
                print(c.name+", ", end='')
           print()
           print(self.weights)
345 \operatorname{def} \operatorname{get} \operatorname{dict} N(N):
       if N == 1:
           return rx_dict_1_cells
       elif N == 2:
           return rx dict 2 cells
       elif N == 3:
           return rx dict 3 cells
       elif N == 4:
           return rx_dict_4_cells
       elif N = 5:
           return rx_dict_5_cells
       else:
           return rx_dict_6_cells
360 rx_dict_1_cells = \{
       'not' : [re.compile(r"INV"), ['gtech_not']],
362 }
363 rx_dict_2_cells = \{
       'and2' : [re.compile(r"AN2X?D"),
                                              ['gtech and2']],
       'ind2' : [re.compile(r"IND2D"),
                                              ['gtech_not', '
       gtech_and2', 'gtech_not']],
       'nand2' : [re.compile(r"ND2D")],
                                              ['gtech_and2', '
       gtech not']],
       'nor2' : [re.compile(r"^NR2X?D"), ['gtech or2', 'gtech not
       ']],
       'xnor2' : [re.compile(r"XNR2"),
                                              ['gtech xor2', '
       gtech not']],
       'or2' : [re.compile(r"^OR2X?D"), ['gtech or2']],
       'xor2' : [re.compile(r"^XOR2"), ['gtech xor2']],
```

| 371 | <pre>'inor2' : [re.compile(r"INR2X?D"), ['gtech_not','gtech_or2'</pre> | 395  | 'orano                 |
|-----|------------------------------------------------------------------------|------|------------------------|
|     | , 'gtech_not']],                                                       |      | $\operatorname{gtech}$ |
| 372 | 'andor22' : [re.compile(r"AO22D"), ['select_op']], #                   | 396  | 'or4 '                 |
|     | duplicated here to be found when list are short                        |      | gtech                  |
| 373 | }                                                                      | 397  | 'iinoi                 |
| 374 | $rx_dict_3_cells = {$                                                  |      | $\operatorname{gtech}$ |
| 375 | 'and3' : [re.compile(r"AN3X?D"), ['gtech_and2','                       | 398  | 'ando                  |
|     | $gtech_and2']],$                                                       |      | gtech                  |
| 376 | 'nand3' : [re.compile(r"^G?ND3D"), ['gtech_and2','                     | 399  | # ide                  |
|     | $gtech_and2', 'gtech_not']],$                                          |      | gtech_                 |
| 377 | 'inand3' : [re.compile(r"^IND3D"), ['gtech_not', '                     | 400  | # ide                  |
|     | gtech_and2','gtech_and2', 'gtech_not']],                               |      | gtech_                 |
| 378 | 'nor3' : [re.compile(r"^G?NR3"), ['gtech_or2', '                       | 401  |                        |
|     | <pre>gtech_or2','gtech_not']],</pre>                                   | 402  | # ' and                |
| 379 | <pre>'inor3' : [re.compile(r"^INR3"), ['gtech_not', '</pre>            | 403  | 'ando                  |
|     | gtech_or2', 'gtech_or2','gtech_not']],                                 |      | gtech_                 |
| 380 | 'iao21' : [re.compile(r"^IAO21"), ['gtech_or2', '                      | 404  | 'ando                  |
|     | gtech not', 'gtech or2', 'gtech not']],                                |      | gtech                  |
| 381 | 'or3' : [re.compile(r"^OR3X?D"), ['gtech or2', '                       | 405  | 'ind4                  |
|     | gtech_or2']],                                                          |      | gtech                  |
| 382 | $'xor3'$ : [re.compile(r"^XOR3"), ['gtech xor2', '                     | 406  | }                      |
|     | gtech_xor2']],                                                         | 407  | rx_dict_5              |
| 383 | 'andori21' : [re.compile(r"^G?AOI21D"),['gtech_and2', '                | 408  | <br>'and5              |
|     | gtech_or2', 'gtech_not']],                                             |      | gtech                  |
| 384 | 'orand21' : [re.compile(r"OA21"), ['gtech_or2', '                      | 409  | 'nor5                  |
|     | gtech_and2']],                                                         |      | gtech                  |
| 385 | 'xnor3' : [re.compile(r"XNR3"), ['gtech_xor2', '                       | 410  | 'or5 '                 |
|     | gtech_xor2', 'gtech_not']],                                            |      | gtech                  |
| 386 | 'iorand21' : [re.compile(r"IOA21D"), ['gtech and', '                   | 411  | 'xor5                  |
|     | gtech_not', 'gtech_and', 'gtech_not']],                                |      | gtech                  |
| 387 | 'andori222': [re.compile(r"MAOI222"), ['gtech_and2', '                 | 412  | 'xnor                  |
|     | gtech_or2', 'gtech_or2', 'gtech_not']]                                 |      | gtech                  |
| 388 |                                                                        | 413  | 'nand                  |
| 389 | $rx dict 4 cells = {$                                                  |      | gtech_                 |
| 390 | 'and4' : [re.compile(r"AN4X?D"), ['gtech and2','                       | 414  | 'oa22                  |
|     | gtech_and2','gtech_and2']],                                            |      | gtech                  |
| 391 | 'nor4' : [re.compile(r"^NR4"), ['gtech or2','                          | 415  | 'ao22                  |
|     | gtech_or2','gtech_or2','gtech_not']],                                  |      | gtech                  |
| 392 |                                                                        | 416  |                        |
|     | gtech_and2', 'gtech_and2', 'gtech_not']],                              |      | ,<br>rx_dict_6         |
| 393 | 'xnor4' : [re.compile(r"XNR4"), ['gtech_xor2', '                       | 418  | 'and6                  |
| 500 | gtech_xor2', 'gtech_xor2', 'gtech_not']],                              | 110  | gtech                  |
| 394 | 'xor4' : [re.compile(r"^XOR4"), ['gtech_xor2', '                       | 419  | 'nand                  |
| 004 | gtech_xor2', 'gtech_xor2']],                                           | -110 | gtech                  |
|     | Stoon_Abig , Stoon_Abig II,                                            |      | steel                  |

E-6

| 95  | <pre>'orand211' : [re.compile(r"OA211"), ['gtech_or2', '</pre>                                                   |   |
|-----|------------------------------------------------------------------------------------------------------------------|---|
| 96  | <pre>gtech_and2', 'gtech_and2']], 'or4' : [re.compile(r"^OR4X?D"), ['gtech_or2', '</pre>                         |   |
|     | gtech_or2', 'gtech_or2']],                                                                                       |   |
| 97  | <pre>'iinor4' : [re.compile(r"HNR4"), ['gtech_not', '</pre>                                                      |   |
|     | <pre>gtech_or2', 'gtech_or', 'gtech_or', 'gtech_not']],</pre>                                                    |   |
| 98  | <pre>'andor211' : [re.compile(r"AO211D"), ['gtech_and2', ' gtech_or2', 'gtech_or2']],</pre>                      |   |
| 99  | # identical to 21 'iandor22' : [re.compile(r"IAO22D"),                                                           |   |
| 00  | gtech_or2','gtech_not', 'gtech_or2', 'gtech_not']],                                                              |   |
| 00  | # identical to 21 'orandi22' : [re.compile(r"OAI22D"),                                                           | 1 |
| 00  | gtech_and2', 'gtech_not', 'gtech_and2', 'gtech_not']],                                                           |   |
| 01  | 8·····                                                                                                           |   |
| 02  | <pre>#'andor22' : [re.compile(r"AO22D"), ['select_op']],</pre>                                                   |   |
| 03  | 'andori31' : [re.compile(r"AOI31D"), ['gtech and2','                                                             |   |
|     | gtech_and2', 'gtech_or2', 'gtech_not']],                                                                         |   |
| 04  | 'andor31' : [re.compile(r"AO31D"), ['gtech_and2', '                                                              |   |
|     | gtech_and2', 'gtech_or2']],                                                                                      |   |
| 05  | <pre>'ind4' : [re.compile(r"IND4D"), ['gtech_not', '</pre>                                                       |   |
|     | <pre>gtech_and2', 'gtech_and2', 'gtech_not']]</pre>                                                              |   |
| 06  | }                                                                                                                |   |
| 07  | $rx_dict_5_cells = {$                                                                                            |   |
| 08  | 'and5' : [re.compile(r"^G?AN5D"), ['gtech_and2','                                                                |   |
|     | $gtech_and2', 'gtech_and2', 'gtech_and2']],$                                                                     |   |
| 09  | 'nor5' : [re.compile(r"^G?NR5"), ['gtech_or2','                                                                  |   |
|     | <pre>gtech_or2','gtech_or2', 'gtech_or2','gtech_not']],</pre>                                                    |   |
| 10  | 'or5' : [re.compile(r"^OR5"), ['gtech_or2','                                                                     |   |
|     | <pre>gtech_or2', 'gtech_or2', 'gtech_or2']],</pre>                                                               |   |
| 11  | 'xor5' : [re.compile(r"XOR5D"), ['gtech_xor2','                                                                  |   |
|     | gtech_xor2', 'gtech_xor2', 'gtech_xor2']],                                                                       |   |
| 12  | 'xnor5' : [re.compile(r"XNR5D"), ['gtech_xor2','                                                                 |   |
| 10  | <pre>gtech_xor2', 'gtech_xor2', 'gtech_xor2', 'gtech_not']], 'nond5', [ro.compile(r"ND5D")]</pre>                |   |
| 13  | <pre>'nand5' : [re.compile(r"ND5D"), ['gtech_and2',' gtech_and2','gtech_and2', 'gtech_and2','gtech_not']],</pre> |   |
| 14  | 'oa221' : [re.compile(r"OA221"), ['gtech_or2', '                                                                 |   |
| 1.4 | gtech_and2', 'gtech_and2']],                                                                                     |   |
| 15  | 'ao221' : [re.compile(r"AO221"), ['gtech_and2', '                                                                |   |
|     | gtech_or2', 'gtech_or']],                                                                                        |   |
| 16  |                                                                                                                  |   |
|     | $rx$ dict 6 cells = {                                                                                            |   |
| 18  | 'and6' : [re.compile(r"AN6D"), ['gtech_and2','                                                                   |   |
|     | gtech_and2', 'gtech_and2', 'gtech_and2', 'gtech_and2']],                                                         |   |
| 19  | 'nand6' : [re.compile(r"ND6D"), ['gtech_and2','                                                                  |   |
|     | gtech_and2', 'gtech_and2', 'gtech_and2', 'gtech_and2', '                                                         |   |
|     | <pre>gtech_not']],</pre>                                                                                         |   |
|     |                                                                                                                  |   |

11

E E

```
'xnor6'
                   : [re.compile(r"XNR6"), ['gtech xor2','
420
       gtech xor2', 'gtech xor2', 'gtech xor2', 'gtech xor2', '
       gtech not']],
       'nor6 '
                    : [re.compile(r"^NR6"), ['gtech or2', '
421
       gtech or2', 'gtech or2', 'gtech or2', 'gtech or2', 'gtech not'
       ]],
       'or6'
                    : [re.compile(r"^OR6")],
422
                                               ['gtech or2','
       gtech or2', 'gtech or2', 'gtech or2', 'gtech or2']],
                    : [re.compile(r"XOR6"), ['gtech xor2','
       'xor6'
423
       gtech xor2', 'gtech xor2', 'gtech xor2', 'gtech xor2']],
424
425 rx dict reg cells = {
       'reg' : [re.compile(r"DF[C|S|Q|N][N|D|C]"), ['reg']],
426
       'reg' : [re.compile(r"(L[H|N]Q)"), ['reg']]
427
428
429 rx dict mux cells = {
        'mux2n' : [re.compile(r"MUX2N"), ['mux op', 'gtech not']],
430
       'mux2' : [re.compile(r"MUX2"), ['mux op']]
431
432
433 rx dict sel cells = {
       'andor22' : [re.compile(r"AO22D"), ['select op']], #
434
       duplicated here to be found when list are short
435
436
437
   class cell_library:
438
       def init (self):
439
440
           self.cells 6 = []
           self.cells 5 = []
441
           self.cells 4 = []
442
           self.cells 3 = []
443
444
           self.cells 2 = []
           self.cells_1 = []
445
           self.muxes = []
446
           self.regs = []
447
           self.selects = []
448
           self.combination cells = []
449
           self.group lists = [self.cells 6, self.cells 5, self.
450
       cells 4, self.cells 3, self.cells 2, self.cells 1, self.
       muxes, self.regs]
       def get list(self, N):
451
           list dict = \{
452
               1 : self.cells 1,
453
                2 : self.cells 2,
454
                3 : self.cells 3,
455
```

```
4 : self.cells 4,
        5 : self.cells 5,
        6 : self.cells 6
    l = list dict.get(N, lambda: None)
    if 1 == None:
        print("could not get list, no corresponding N")
        return 1
    else:
        return 1
def find cell_group(self, l, group_key):
    for g in 1:
        if g.matching key == group key:
             #found, return group
             return g
    #not found, return none
    #if none make new group outside function
    return None
def set group weights (self):
    for l in self.group lists:
        for g in 1:
             g.get weights()
def print_available_cells(self):
    print("6 input")
    for c in self.cells 6:
        print(" \setminus t" + c. matching key)
    print("5 input")
    for c in self.cells_5:
        print(" \setminus t" + c.matching key)
    print("4 input")
    for c in self.cells 4:
        print(" \setminus t" + c.matching key)
    print("3 input")
    for c in self.cells 3:
        print(" \setminus t" + c.matching key)
    print("2 input")
    for c in self.cells 2:
        print(" \setminus t" + c. matching key)
    print("1 input")
    for c in self.cells 1:
        print(" \setminus t" + c. matching key)
    print("registers")
    for c in self.regs:
        print(" \setminus t" + c.matching key)
    print("muxes")
```

457

458

459

460

461

462

463

464

465

466

467

468

469

470

471

472

473

474

475

476

477

478

479

480

481

482

483

484

485

486

487

488

489

490

491

492

493

494

495

496

497

498

499

500

E-7

```
for c in self.muxes:
501
                print(" \setminus t" + c.matching key)
503
504 # find cell group in list of cell groups
   def find cell(key, l):
505
       for cell group in 1:
            if key == cell group.matching key:
                return cell group
508
       return None
     count occurence of a word in a file
511 #
512 def count occurence(word):
       path = path to calibration file
       fp = open(path, "r")
514
       f = fp.read()
       return f.count(word)
518
519 #make list of synthetic cells into list of cells from cell
       library
520 def transform list (cell lib, to transform):
       if len(to transform) < 6:
           i = len(to transform) + 1 \# 6
       else:
524
           i = 6
       #make list of index structs, sort list, then go through list
        and append
526
       ind = []
       templist = list(to transform)
       #look for regs
       regindexes = find sequence(templist, cell lib.regs[0])
529
530
       if regindexes != []:
           for r in regindexes:
                ind.append(r)
                templist[r[0]] = 0
       selindexes = find_sequence(templist, cell_lib.selects[0])
       if selindexes != []:
            for r in selindexes:
                ind.append(r)
                templist[r[0]] = 0
538
540
       muxindexes = find sequence(templist, cell lib.muxes[0])
541
       #for mux in muxindexes:
542
       if muxindexes != []:
543
```

```
for m in muxindexes:
        ind.append(m)
        templist[m[0]] = 0
l = cell \ lib.combination \ cells
for element in 1:
    indexes = find sequence(templist, element)
    if indexes != []:
            for k in indexes:
                ind.append(k)
            #print("Found "+str(element.synthetic gate list)
+" in "+str(to transform))
            for ii in indexes:
                for r in range (ii [0], ii [1]):
                    #print(r)
                    #templist.pop(r)
                     templist[r] = 0
#look through lists looking for matches to replace sequences
while i != 0:
    l = cell \ lib.get \ list(i)
    #print("called cell lib.get list "+str(i))
    #go through dict with that many inputs:
    #need to relate this list to cell group somehow as well
to have power info available
    for element in 1:
        #print("looking for "+str(element.
synthetic gate list)+" in "+str(to transform))
        indexes = find sequence(templist, element) #need
element as well in list, not only indexes
        if indexes != []:
            for k in indexes:
                ind.append(k)
            #print("Found "+str(element.synthetic gate list)
+" in "+str(to transform))
            for ii in indexes:
                 for r in range (ii [0], ii [1]):
                     templist[r] = 0
    i = i - 1
#replace found indexes when they are found so they cannot be
 found again
ind.sort()
elementlist = []
```

545

546

547

548

554

558

559

560

561

562

563

564

565

566

567

568

569

570

571

573

574

575

578

579

580

581

582

| 583 |                                                               | 593 | $l = element.synthetic_gate_list$      |
|-----|---------------------------------------------------------------|-----|----------------------------------------|
| 584 | for indexelement in ind:                                      | 594 | indexes = []                           |
| 585 |                                                               | 595 | $temp = list(to_find)$                 |
| 586 | elementlist.append(indexelement[2])                           | 596 |                                        |
| 587 |                                                               | 597 | <pre>for i in range(len(temp)):</pre>  |
| 588 | return elementlist                                            | 598 |                                        |
| 589 |                                                               | 599 | if temp[i:i+len(1)] == 1:              |
| 590 | # returns none if indexes not in list or list of (startindex, | 600 | temp $[i:i+len(1)] = [0]*len(1)$       |
|     | stopindex) for each occurence                                 | 601 |                                        |
| 591 | # also returns the index(es) it found                         | 602 | indexes.append((i, i+len(l), element)) |
| 592 | def find_sequence(to_find, element):                          | 603 | return indexes                         |

## F Code implemented in Chapter 8

```
1 import parse elab
2 import liberty_data
3 import sys
 4
5 filename = sys.argv[1]
6
7
 8
9
10 #list of all cells from the cell library
11 cellLib = liberty data.sort cells(liberty data.
       processed library path)
12
13
14 powerStructures = []
15
16 #iterate through structures
17 def go through structures():
       modules, top structures = parse elab.run parse elab(filename
18
       )
19
       for s in top_structures:
20
          p = power structure(s)
21
          p.parent = None
22
23
           p.name = s.represented object handle.name
24
           powerStructures.append(p)
25
           elementlist = liberty data.transform list(cellLib, [s.
26
       represented object handle.id])
          p.cell lib list = elementlist
27
          p.structural\_rep\_list = [s]
28
           srepr = repr(s)
30
           print(s.represented_object_handle.name)
31
           print(srepr)
32
           count powerStructure(p)
33
           lists from top(s,p)
34
```

뉟

```
for p in powerStructures:
35
36
37
           print(p.name)
           \#v = value()
38
           \#p.print(v, 0)
39
           #print()
40
41
           r = repr(p)
42
           print(r)
43
       print powercount()
44
45
46 def print stuff(top structures):
       for s in top structures:
47
48
           print("Top structure: ")
           print(s.represented object handle.name)
49
           print("Children:")
50
           for c in s.children:
51
                print("\t"+c.represented object handle.name)
52
                \#print("\tlvl3: ")
                for gc in c.children:
54
55
                    print("\t\t"+gc.represented_object_handle.name)
                    for ggc in gc.children:
56
                         print(" \setminus t \setminus t = gc.represented object handle
57
       . name)
58
59
60 def lists_from_top(s, power_s):
      #print(s.represented object handle.name)
61
62
       1 = []
63
       \#while s != None:
64
65
       st = None
       for c in s.children:
66
           \#s = c
67
68
           \#1 = []
69
           \#structure_list = []
           \#add s list = []
70
```

```
##goes through structure elements until fanout
71
                                                                            113
           \#st = add structure(l, structure list, c, add s list)
                                                                                        #
72
                                                                            114
           ##print("\t",end='')
73
                                                                            115
                                                                                        #
           ##print(1)
                                                                                        #
74
           \#\!\#\!go through cell lists compare to l
75
                                                                            117
           #elementlist = liberty data.transform list(cellLib, l)
76
                                                                            118
           #make power structure object with s as parent
77
           \#p = power structure(s, structure list, elementlist)
78
           #power s.children.append(p)
                                                                                     etc
79
            if c.powerStructure == None:
80
                                                                            121
                1 = []
81
                structure list = []
82
                                                                            123
                add s list = []
83
84
                #goes through structure elements until fanout
                st = add structure(l, structure list, c, add s list)
85
                                                                            126
                \#print ("\t",end='')
                                                                            127
86
                                                                                        #no child
87
                \#print(1)
                                                                            128
                #go through cell lists compare to l
                                                                                        return None
88
                elementlist = liberty_data.transform_list(cellLib, l)
                                                                                    else:
89
                                                                            130
90
91
                p = power structure(power s)
                                                                            132
                for s in structure list:
                                                                                        return s
92
                    s.powerStructure = p
93
                power s.children.append(p)
94
95
                p.structural rep list = structure list
                p.cell_lib_list = elementlist
                                                                                    objects in list
96
                #count powerStructure(p)
                                                                            137 class power structure:
97
                                                                                    name = 1
98
                \#if output nodes [0] == None or is reg
                                                                            138
                if c.structure connection characteristic != 'control
99
                                                                            139
       ۰.
                                                                            140
                    \#if c.represented object handle.output nodes [0]
100
                                                                            141
       != None:
                                                                            142
                    count_powerStructure(p)
                                                                            143
                for add 1 in add s list:
                                                                                         self.parent
                                                                            144
                    #after fanout, go through children until fanout
                                                                            145
                                                                                    #
                                                                            146
                    #for c in st.children:
104
                                                                            147
                    lists_from_top(add_l, p)
                                                                            148
            else:
106
                                                                            149
                p = c.powerStructure
                power_s.children.append(p)
           #for e in elementlist:
110
                                                                            153
                 print(e.matching key)
           #
                                                                            154
           \#if st != None:
112
```

```
#for add l in add s list:
                #after fanout, go through children until fanout...
                #for c in st.children:
                lists_from_top(add_l, p)
   def add_structure(l, structure_list, s, s_list):
       #look at what rep obj handle is and change it if select, add
       l.append(s.represented object handle.id)
       structure list.append(s)
       #print("\t"+s.represented object handle.name)
       if len(s.children) == 1 and s.children[0] != None:
           #print(s.represented object handle.name)
           add structure (1, structure list, s.children [0], s list)
       elif len(s.children) == 0:
           s list.append(s)
135 #save first elements parent, and last elements children,
136 #envelop with structure having old objects in list and new
       countedBool = False
       def init (self, parent):
           self.structural rep list
                                        = [1]
           self.cell lib list
                                        = []
           self.children
                                        = []
                                        = parent
       #def printstart(self):
            for c in self.cell
       def print(self, depth, startlvl):
           \#print(self.represented object handle.id+", ",end = '')
           #if self.children != []: print("{", end = '')
           \#print("{ ", end = '')
           if self.cell lib list != []:
               print("{", end = '')
               depth.i = depth.i+1
           #if self.children != []: print("{", end='')
```

F-2

```
for c in range(len(self.cell lib list)):
156
                if c+1 = len(self.cell lib list):
                                                                            199
                    print(self.cell lib list[c].matching key+" ",end
158
                                                                            200
        = '')
                                                                            201
                    if self.children != []:
                                                                            202
                         depth.i = depth.i+1
                                                                            203
                         print("{", end = ''})
                                                                            204
                    #else:
                                                                            205
                    #
                          if lastchild: print("", end = '')
                                                                            206
164
                else :
                                                                            207
                    print(self.cell lib list[c].matching key+" {",
165
                                                                            208
       end = '')
                                                                             209
                    depth.i = depth.i +1
166
                                                                            210
           #if self.cell lib list != []:
167
                                                                            211
                 print("), end = '')
           #
168
                                                                            212
                                                                            213
           #elif (len(self.children))
           #if self.children != []:
                                                                            215
                 print("{", end='')
           #
                                                                            216
           #childbracketcount = len()
                                                                             217
174
                                                                             218 nots
            for child in range(len(self.children)):
                                                                            219 logic
                \#print(self.represented object handle.id+", ",end =
176
                                                                             220 mux
       '')
177
                if child+1 = len(self.children):
                                                                             222 comp
                    self.children[child].print(depth, depth.i)
178
                                                                            223 regs
                else:
179
180
                    self.children[child].print(depth, depth.i)
                                                                            225
                #if self.children[child].children == []:
                                                                            226
181
                     print ("}", end = '')
                #
182
                    \#childbracketcount = childbracketcount -1
183
                #else:
184
                     print (", ", end= '')
                #
185
186
           #if lastchild: print("}",end='')
187
           #if self.children == []: print("}",end='')
188
                                                                            227
           #if self.children != []: print("}", end='')
                                                                            228
189
           #if self.cell lib list != [] and self.children == []:
                                                                            229
                 print("), end = '')
           #
                                                                            230
                 depth.i = depth.i -1
           #
                                                                            231
                                                                            232
            while depth.i != startlvl:
194
                                                                            233
                print("), end = '')
195
                                                                            234
                depth.i = depth.i-1
196
                                                                            235
           #if self.children != []: print("}", end='')
197
                                                                            236
```

#if self.cell lib list == []:  $print("\}", end = '')$ # #print( ) #print("}", end = '') #for c in child.cell lib list: print (c. matching key+", ", end = '') # def repr (self, level=0): value = ''for v in self.cell lib list: value = value+v.matching key+" "  $ret = " \setminus t " * level + repr(value) + " \setminus n"$ if level < 11: for child in self.children: ret += child. repr (level+1)return ret 214 class value: i = 0#if self.children == []: print("}", end = '') = 0= 0= 0221 arithm = 0= 0= 0224 def count powerStructure(s):  $notstruct = \{ 'not' \}$ logicstruct = { 'and5', 'nor5', 'or5', 'xor5', 'xnor5', ' nand5', 'oa221', 'ao221', 'andor22', 'andori31', 'andor31', 'ind4', 'and4', 'nor4', 'nand4', 'xnor4', 'xor4', 'orand211', 'or4 ', 'iinor4', 'andor211', 'and3', 'nand3', 'inand3', 'nor3', 'inor3' ', 'iao21', 'or3', 'xor3', 'andori21', 'orand21', 'xnor3', ' iorand21', 'andori222', 'and2', 'ind2', 'nand2', 'nor2', ' xnor2', 'or2' , 'xor2' , 'inor2' }  $regstruct = \{ 'reg' \}$  $muxstruct = \{ 'mux', 'mux2n' \}$ arithmstruct = { 'adder', 'mult'}  $compstruct = \{ comp' \}$ global nots global logic global mux global arithm global comp

global regs

F-3

| 237 | <pre>#print(s.cell_lib_list)</pre>                          | 266 | elif handle.id != 'input' and handle.id != 'output'       |
|-----|-------------------------------------------------------------|-----|-----------------------------------------------------------|
| 238 | $str\_rep\_offset = 0$                                      |     | and structure.structure_connection_characteristic $!=$ '  |
| 239 | strlist2 = []                                               |     | control':                                                 |
| 240 | <pre>removestruct = { 'input', 'output', 'gtech_buf'}</pre> | 267 | if handle.output_nodes[0] != None:                        |
| 241 | if s.countedBool == False:                                  | 268 | if cell.matching_key in notstruct:                        |
| 242 | <pre>for obj in s.structural_rep_list:</pre>                | 269 | nots = nots+1                                             |
| 243 | $h = obj.represented_object_handle$                         | 270 | <pre>print("added not")</pre>                             |
| 244 | if h.id in removestruct:                                    | 271 | <pre>elif cell.matching_key in logicstruct:</pre>         |
| 245 | pass                                                        | 272 | logic = logic+1                                           |
| 246 | else:                                                       | 273 | <pre>print("added logic")</pre>                           |
| 247 | strlist2.append(obj)                                        | 274 | <pre>elif cell.matching_key in muxstruct:</pre>           |
| 248 | s.countedBool = True                                        | 275 | $\max = \max + 1$                                         |
| 249 | <pre>for cellindex in range(0,len(s.cell_lib_list)):</pre>  | 276 | <pre>print("added mux")</pre>                             |
| 250 |                                                             | 277 | elif cell in arithmstruct:                                |
| 251 | cell = s.cell_lib_list[cellindex]                           | 278 | $\operatorname{arithm}$ = $\operatorname{arithm}$ +1      |
| 252 |                                                             | 279 | <pre>print("added arithm")</pre>                          |
| 253 | $structure = strlist2[cellindex+str_rep_offset]$            | 280 | <pre>elif cell.matching_key in compstruct:</pre>          |
| 254 | $handle = structure.represented_object_handle$              | 281 | $\mathrm{comp}\ =\ \mathrm{comp}{+1}$                     |
| 255 |                                                             | 282 | <pre>print("added comp")</pre>                            |
| 256 | $handle = structure.represented_object_handle$              | 283 |                                                           |
| 257 |                                                             | 284 | #need to find a good way to count so not multiplied       |
| 258 | $str\_rep\_offset = str\_rep\_offset + len(cell.$           |     | structure make objects?                                   |
|     | synthetic_gate_list)-1                                      | 285 | <pre>def print_powercount():</pre>                        |
| 259 |                                                             | 286 | $print("nots: \t"+str(nots))$                             |
| 260 | if handle.id $= 'reg'$ :                                    | 287 | print("regs: t"+str(regs))                                |
| 261 |                                                             | 288 | print("logic: t"+str(logic))                              |
| 262 | if handle.has_parent:                                       | 289 | print("mux: t"+str(mux))                                  |
| 263 | if cell.matching_key in regstruct and                       | 290 | $print("arithm: \t "+str(arithm))$                        |
|     | cellindex < 1:                                              | 291 | $print("comp: \times tr(comp)))$                          |
| 264 | m regs =  m regs +1                                         | 292 | $print("total: \t"+str(nots+logic+mux+arithm+comp+regs))$ |
| 265 | <pre>print("added reg")</pre>                               | 293 | go_through_structures()                                   |



