0% found this document useful (0 votes)

30 views14 pages

Compiler 8 (Intermediate Code Generation)

The document discusses intermediate code generation in compilers, explaining its role in translating source programs into machine-readable code while maintaining a common analysis phase across different machines. It covers concepts such as Directed Acyclic Graphs (DAG), three-address code, and various forms of representation like quadruples and triples. Additionally, it highlights the importance of Boolean expressions in controlling program flow and their translation in programming languages.

Uploaded by

Marzia Borno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views14 pages

Compiler 8 (Intermediate Code Generation)

Uploaded by

Marzia Borno

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

12/5/2020

Intermediate Code
Generation

Zakia Zinat Choudhury

Lecturer
Department of Computer Science & Engineering
University of Rajshahi

Introduction

Figure: Logical structure of a compiler front end

In the analysis-synthesis model of a compiler, the front end of a compiler translates a source
program into an independent intermediate code, then the back end of the compiler uses this
intermediate code to generate the target code (which can be understood by the machine).

1
12/5/2020

Why Intermediate Code Generator is Used to

Translate Target Code?

❑ If a compiler translates the source language to its target machine language without having the
option for generating intermediate code, then for each new machine, a full native compiler is
required.
❑ Intermediate code eliminates the need of a new full compiler for every unique machine by keeping
the analysis portion same for all the compilers.
❑ The second part of compiler, synthesis, is changed according to the target machine.
❑ It becomes easier to apply the source code modifications to improve code performance by applying
code optimization techniques on the intermediate code.

Directed Acyclic Graph

A directed acyclic graph (hereafter called a DAG) for an expression identifies the common
subexpressions (subexpressions that occur more than once) of the expression.

Directed Acyclic Graph (DAG) is a tool that depicts the structure of basic blocks, helps to
see the flow of values flowing among the basic blocks, and offers optimization too. DAG
provides easy transformation on basic blocks.

DAG can be understood here:

❑ Leaf nodes represent identifiers, names or constants.
❑ Interior nodes represent operators.
❑ Interior nodes also represent the results of expressions or the identifiers/name where
the values are to be stored or assigned.

2
12/5/2020

Example of Directed Acyclic Graph

The constructed DAG for the Expression

Three-Address Code
A statement involving no more than three references(two for operands and one for result) is known as
three address statement. A sequence of three address statements is known as three address code. In
three-address code, there is at most one operator on the right side of an instruction; that is, no built-up
arithmetic expressions are permitted. Thus, a source-language expression like x+y*z might be
translated into the sequence of three-address instructions

where t1 and t2 are compiler-generated temporary names.

A three-address code has at most three address locations to calculate the expression. Three-address
code is built from two concepts: addresses and instructions. In object-oriented terms, these concepts
correspond to classes, and the various kinds of addresses and instructions correspond to appropriate
subclasses.

3
12/5/2020

Addresses

An address can be one of the following:

• A name: For convenience, we allow source-program names to appear as addresses in
three-address code. In an implementation, a source name is replaced by a pointer to its
symbol-table entry, where all information about the name is kept.
• A constant: In practice, a compiler must deal with many different types of constants and
variables.
• A compiler-generated temporary: It is useful, especially in optimizing compilers, to
create a distinct name each time a temporary is needed. These temporaries can be
combined, if possible, when registers are allocated to variables.

Instructions
Here is a list of the common three-address instruction forms:
1. Assignment instructions of the form x = y op z, where op is a binary arithmetic or
logical operation, and x, y, and z are addresses.
2. Assignments of the form x = op y, where op is a unary operation. Essential unary
operations include unary minus, logical negation, shift operators, and conversion
operators that, for example, convert an integer to a floating-point number.
3. Copy instructions of the form x = y, where x is assigned the value of y.
4. An unconditional jump goto L. The three-address instruction with label L is the next to
be executed.
5. Conditional jumps of the form if x goto L and ifFalse x goto L. These instructions
execute the instruction with label L next if x is true and false, respectively. Otherwise,
the following three-address instruction in sequence is executed next, as usual.

4
12/5/2020

Instructions
6. Conditional jumps such as if x relop y goto L, which apply a relational operator (<,
==, >=, etc.) to x and y, and execute the instruction with label L next if x stands in
relation relop to y. If not, the three-address instruction following if x relop y goto L is
executed next, in sequence.
7. Procedure calls and returns are implemented using the following instructions: param
x for parameters; callp, n and y = callp, n for procedure and function calls, respectively;
and return y, where y, representing a returned value, is optional. Their typical use is as
the sequence of three address instructions

generated as part of a call of the procedure p(xl,x2,. . . ,x,). The integer n, indicating the
number of actual parameters in "call p, n," is not redundant because calls can be nested.

Instructions
8. Indexed copy instructions of the form x = y [i] and x [i] = y. The instruction x = y[i]
sets x to the value in the location i memory units beyond location y . The instruction x[i]
= y sets the contents of the location I units beyond x to the value of y.
9. Address and pointer assignments of the form x = & y, x = * y, and * x = y. The
instruction x = & y sets the r-value of x to be the location (l-value) of y2. Presumably, y
is a name, perhaps a temporary, that denotes an expression with an expression with an l-
vale such as A [i] [j] , and x is a pointer name or temporary. In the instruction x = * y,
presumably y is a pointer or a temporary whose r-value is a location. The r-value of x is
made equal to the contents of that location. Finally, * x = y sets the r-value of the object
pointed to by x to the r-value of y.

5
12/5/2020

Format of Three-Address Code

A three-address code can be represented in two forms :
quadruples and triples.

Quadruples
A quadruple (or just "quad') has four fields, which we call op, arg1, arg2, and result. The op
field contains an internal code for the operator.
For instance, the three-address instruction x = y +z is represented by placing + in op, y in
arg1, z in arg2, and x in result.

The following are some exceptions to this rule:

1. Instructions with unary operators like x = minus y or x = y do not use arg2.
2. Operators like param use neither arg2 nor result.
3. Conditional and unconditional jumps put the target label in result.

6
12/5/2020

Example of Quadruples

Triples
A triple has only three fields, which we call op, arg1, and arg2. The results of respective sub-
expressions are denoted by the position of expression. Triples represent similarity with DAG
and syntax tree. They are equivalent to DAG while representing expressions Note that the
result field in is used primarily for temporary names. Using triples, we refer to the result of
an operation x op y by its position, rather than by an explicit temporary name.

7
12/5/2020

Example of Triples

Indirect Triples
This representation is an enhancement over triples representation. It uses pointers instead of
position to store results. This enables the optimizers to freely re-position the sub-expression
to produce an optimized code.
With indirect triples, an optimizing compiler can move an instruction by reordering the
instruction list, without affecting the triples themselves.

8
12/5/2020

Example of Indirect Triples

Types and Declarations

Read from the Textbook
Page 371-373

9
12/5/2020

Difference between syntax tree and Parse tree

Parse Tree Syntax Tree

Interior nodes are non- Interior nodes are

terminals, leaves are “operators”, leaves are
terminals. operands.
Rarely constructed as a When representing a
data structure. program in a tree structure
usually use a syntax tree.
Represents the concrete Represents the abstract
syntax of a program. syntax of a program (the
semantics).

Notation
The way to write arithmetic expression is known as a notation.
An arithmetic expression can be written in three different but equivalent notations, i.e.,
without changing the essence or output of an expression.
These notations are:
• Infix Notation
• Polish (Prefix) Notation
• Reverse-Polish (Postfix) Notation

10
12/5/2020

Infix Notation
The expression writes in infix notation where operators are used in-between operands.
It is easy for us humans to read, write, and speak in infix notation but the same does not go
well with computing devices. An algorithm to process infix notation could be difficult and
costly in terms of time and space consumption.
Example:

Infix notation with parenthesis: (3 + 2) * (5 – 1)

Polish Notation
Polish notation is a notation form for expressing arithmetic, logic and algebraic equations.
Its most basic distinguishing feature is that operators are placed on the left or written
ahead of their operands. If the operator has a defined fixed number of operands, the syntax
does not require brackets or parenthesis to lessen ambiguity.
Example:

Infix notation with parenthesis: (3 + 2) * (5 – 1)

Polish notation: * + 3 2 – 5 1

Polish notation is also known as prefix notation.

11
12/5/2020

Reserved Polish Notation

Reserved Polish notation most basic distinguishing feature is that operators are placed on
the right or written after of their operands.
The idea is simply to have a parenthesis-free notation that makes each equation shorter and
easier to parse in terms of defining the evaluation priority of the operators.
Example:

Infix notation with parenthesis: (3 + 2) * (5 – 1)

Reserved Polish notation: 3 2+ 5 1-*

Reserved Polish notation is also known as postfix notation.

Boolean Expression
Boolean expressions are composed of the Boolean operators (which we denote &&, ||, and !, using
the C convention for the operators AND, OR, and NOT, respectively) applied to elements that are
Boolean variables or relational expressions. Relational expressions are of the form El re1 E2, where
El and E2 are arithmetic expressions. In this section, we consider Boolean expressions generated by
the following grammar:

We use the attribute rel.op to indicate which of the six comparison operators <, <=, =, !=, >, or >= is
represented by rel. As is customary, we assume that || and && are left-associative, and that || has
lowest precedence, then &&, then !.

Given the expression B1 || B2, if we determine that B1 is true, then we can conclude that the entire
expression is true without having to evaluate B2. Similarly, given B1&&B2, if B1 is false, then the
entire expression is false.

12
12/5/2020

Why Boolean Expression is Used?

The translation of statements such as if-else-statements and while-statements is tied to the translation
of Boolean expressions. In programming languages, Boolean expressions are often used to

1. Alter the flow of control: Boolean expressions are used as conditional expressions in statements
that alter the flow of control. The value of such Boolean expressions is implicit in a position
reached in a program. For example, in if (E) S, the expression E must be true if statement S is
reached.

2. Compute logical values: A Boolean expression can represent true or false as values. Such
Boolean expressions can be evaluated in analogy to arithmetic expressions using three-address
instructions with logical operators.

Control-Flow Translation of Boolean Expression

Read from the Textbook
Page 403-405

13
12/5/2020

1. Construct the DAG for the expression:

((x + y)-((x + y)*(x - y)))+((x + y)*(x - y))

2. Translate the arithmetic expression a = b * c – b * d

Assignments into:
a) A syntax tree
b) Quadruples
c) Triples
d) Indirect Triples

Thank You

Chapter 6
No ratings yet
Chapter 6
28 pages
Intermediate Code Generation: CD: Compiler Design
No ratings yet
Intermediate Code Generation: CD: Compiler Design
41 pages
CSE-303 Chapter-06 Final
No ratings yet
CSE-303 Chapter-06 Final
97 pages
Mod 4
No ratings yet
Mod 4
39 pages
CD Unit3
No ratings yet
CD Unit3
17 pages
CS 346: Intermediate Code Generation: Resource
No ratings yet
CS 346: Intermediate Code Generation: Resource
60 pages
Intermediate Code Generation Lecture
No ratings yet
Intermediate Code Generation Lecture
29 pages
Unit-Iii
No ratings yet
Unit-Iii
19 pages
18 Unit-4
No ratings yet
18 Unit-4
16 pages
Compiler Intermediate Code Basics
No ratings yet
Compiler Intermediate Code Basics
62 pages
Unit-Iii: Intermediate Code Generation
No ratings yet
Unit-Iii: Intermediate Code Generation
47 pages
Unit 4
No ratings yet
Unit 4
51 pages
Intermediate Code for Compilers
No ratings yet
Intermediate Code for Compilers
38 pages
CD Module 8 Print
No ratings yet
CD Module 8 Print
58 pages
3 Intermediate Code Generation
No ratings yet
3 Intermediate Code Generation
20 pages
Compiler Design Lec-Six Intermediate Languages
No ratings yet
Compiler Design Lec-Six Intermediate Languages
21 pages
Lec05 Intermediate Code Generation
No ratings yet
Lec05 Intermediate Code Generation
40 pages
CD Unit-Iii
No ratings yet
CD Unit-Iii
20 pages
Cs 3007 Inter Code Gen
No ratings yet
Cs 3007 Inter Code Gen
42 pages
Compiler Design for B.Tech Students
No ratings yet
Compiler Design for B.Tech Students
21 pages
Lect 7
No ratings yet
Lect 7
8 pages
UNIT-4 Notes
No ratings yet
UNIT-4 Notes
27 pages
006chapter 6 - Intermediate Code Generation
No ratings yet
006chapter 6 - Intermediate Code Generation
23 pages
BCS 324 Topic 5
No ratings yet
BCS 324 Topic 5
35 pages
4th Phase New Intermediate Code Generator
No ratings yet
4th Phase New Intermediate Code Generator
24 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
23 pages
CD - CH5 - Intermediate Code Generation
No ratings yet
CD - CH5 - Intermediate Code Generation
54 pages
Lecture Notes Compiler Design Chapter-6
No ratings yet
Lecture Notes Compiler Design Chapter-6
55 pages
Compiler Intermediate Code Guide
No ratings yet
Compiler Intermediate Code Guide
26 pages
Unit-Iv: Intermediate Code Generation
No ratings yet
Unit-Iv: Intermediate Code Generation
19 pages
Intermediate Code Generation Guide
No ratings yet
Intermediate Code Generation Guide
5 pages
Intermediate Code in Compiler Construction
No ratings yet
Intermediate Code in Compiler Construction
34 pages
Compiler Design Chapter-6
No ratings yet
Compiler Design Chapter-6
83 pages
1 Unit 4 Complete
No ratings yet
1 Unit 4 Complete
92 pages
Intermediate Code Generation Guide
No ratings yet
Intermediate Code Generation Guide
27 pages
Intermediate Code Generation Guide
No ratings yet
Intermediate Code Generation Guide
47 pages
FALLSEM2023-24 BCSE307L TH VL2023240100900 2023-06-14 Reference-Material-I
No ratings yet
FALLSEM2023-24 BCSE307L TH VL2023240100900 2023-06-14 Reference-Material-I
33 pages
CD Unit 5
No ratings yet
CD Unit 5
49 pages
TSR - Class Cd-Unit 3
No ratings yet
TSR - Class Cd-Unit 3
111 pages
Intermediate Code Generation
No ratings yet
Intermediate Code Generation
13 pages
CS6109 Module 8
No ratings yet
CS6109 Module 8
42 pages
Lecture 08
No ratings yet
Lecture 08
36 pages
Module 4 - Intermediate Code Generation
No ratings yet
Module 4 - Intermediate Code Generation
61 pages
24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
100% (1)
24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
44 pages
1 Intermediate Code Generation VNM
No ratings yet
1 Intermediate Code Generation VNM
17 pages
UNIT-3 Odg
No ratings yet
UNIT-3 Odg
17 pages
Compiler Engineering
No ratings yet
Compiler Engineering
24 pages
CH-6 Intermediate Code Generator
No ratings yet
CH-6 Intermediate Code Generator
54 pages
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
No ratings yet
Lecture On Compiler Design: Chapter 8: Intermediate Code Generation
11 pages
Compiler - Three Addr Codes
No ratings yet
Compiler - Three Addr Codes
33 pages
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Compiler Design - WWW - Rgpvnotes.in
23 pages
Intermediate
No ratings yet
Intermediate
29 pages
Compiler Design Unit 3
No ratings yet
Compiler Design Unit 3
14 pages
Intermediate Code Generation Guide
100% (1)
Intermediate Code Generation Guide
14 pages
Module 5 Chapter 6 ICG
No ratings yet
Module 5 Chapter 6 ICG
44 pages
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
No ratings yet
INTERMEDIATE CODE GENERATION & RUNTIME ENVIRNOMENTS
35 pages
Three Address Code
No ratings yet
Three Address Code
4 pages
Intermediate Code Generator-20241219073843
No ratings yet
Intermediate Code Generator-20241219073843
40 pages
Computer Graphics
No ratings yet
Computer Graphics
81 pages
Practicle - Class 9-10 Biology
No ratings yet
Practicle - Class 9-10 Biology
28 pages
Distributeddatabase
No ratings yet
Distributeddatabase
27 pages
Compiler 7 (Semantic Analysis & Symbol Table)
No ratings yet
Compiler 7 (Semantic Analysis & Symbol Table)
6 pages
A Level Mathematics Paper 1 Equations Involving Indices Logarithms & Others
No ratings yet
A Level Mathematics Paper 1 Equations Involving Indices Logarithms & Others
10 pages
Aci sp-226-2005
No ratings yet
Aci sp-226-2005
158 pages
Benazepril
No ratings yet
Benazepril
2 pages
Residential Tenancy Agreement Guide
No ratings yet
Residential Tenancy Agreement Guide
6 pages
TOC1
No ratings yet
TOC1
19 pages
Marketing Info Systems Guide
100% (1)
Marketing Info Systems Guide
13 pages
Chapter Two Brain and Language
No ratings yet
Chapter Two Brain and Language
6 pages
Arson Investigation
No ratings yet
Arson Investigation
36 pages
Business Trust & Communication
No ratings yet
Business Trust & Communication
20 pages
Ujian Bulan Mac: Bahasa Inggeris Kertas 1 Tahun 4
No ratings yet
Ujian Bulan Mac: Bahasa Inggeris Kertas 1 Tahun 4
9 pages
Indian Railways: Challenges & Services
No ratings yet
Indian Railways: Challenges & Services
18 pages
Calculus in Basketball 1
100% (1)
Calculus in Basketball 1
3 pages
U600 ULTRASONIC SCALER - Woodpecker Medical
No ratings yet
U600 ULTRASONIC SCALER - Woodpecker Medical
18 pages
17MU5A0305 Project Report
No ratings yet
17MU5A0305 Project Report
107 pages
Aqa 2011 Past Paper
No ratings yet
Aqa 2011 Past Paper
14 pages
Inspection Punch List
No ratings yet
Inspection Punch List
2 pages
Ranjan Kumar Jaiswal Kotak Mahindra Bank
No ratings yet
Ranjan Kumar Jaiswal Kotak Mahindra Bank
6 pages
Disassembly and Assembly: Automatic Transmission
No ratings yet
Disassembly and Assembly: Automatic Transmission
1 page
Ciena Sample Aptitude Placement Paper Level1
No ratings yet
Ciena Sample Aptitude Placement Paper Level1
7 pages
Magnesium Alloys Containing Rare Earth Metals Structure and Properties 1st Edition L.L. Rokhlin (Author) Download PDF
100% (13)
Magnesium Alloys Containing Rare Earth Metals Structure and Properties 1st Edition L.L. Rokhlin (Author) Download PDF
84 pages
Solving Problems Involving Loans
No ratings yet
Solving Problems Involving Loans
13 pages
Skytrain Avia Services: Emergency Procedures Manual
No ratings yet
Skytrain Avia Services: Emergency Procedures Manual
32 pages
Ongoing Regular Recruit Intake Applications
No ratings yet
Ongoing Regular Recruit Intake Applications
3 pages
Insider Trading: India vs USA
100% (1)
Insider Trading: India vs USA
13 pages
Sri Jayacharajendra Polytechnic Internship Report
No ratings yet
Sri Jayacharajendra Polytechnic Internship Report
12 pages
Barun Yadav
No ratings yet
Barun Yadav
1 page
02 Second Law
No ratings yet
02 Second Law
61 pages
InfoSec Policy for Company LLC Staff
No ratings yet
InfoSec Policy for Company LLC Staff
5 pages
Winter 23 22619
100% (1)
Winter 23 22619
18 pages
5 Amazing Tech Projects To DO
No ratings yet
5 Amazing Tech Projects To DO
1 page

Compiler 8 (Intermediate Code Generation)

Uploaded by

Compiler 8 (Intermediate Code Generation)

Uploaded by

12/5/2020

Zakia Zinat Choudhury

Figure: Logical structure of a compiler front end

Why Intermediate Code Generator is Used to

Directed Acyclic Graph

DAG can be understood here:

Example of Directed Acyclic Graph

The constructed DAG for the Expression

where t1 and t2 are compiler-generated temporary names.

An address can be one of the following:

Format of Three-Address Code

The following are some exceptions to this rule:

Example of Indirect Triples

Types and Declarations

Difference between syntax tree and Parse tree

Parse Tree Syntax Tree

Interior nodes are non- Interior nodes are

Infix notation with parenthesis: (3 + 2) * (5 – 1)

Infix notation with parenthesis: (3 + 2) * (5 – 1)

Polish notation is also known as prefix notation.

Reserved Polish Notation

Infix notation with parenthesis: (3 + 2) * (5 – 1)

Reserved Polish notation is also known as postfix notation.

Why Boolean Expression is Used?

Control-Flow Translation of Boolean Expression

1. Construct the DAG for the expression:

2. Translate the arithmetic expression a = b * c – b * d

You might also like