1 / 39

Compiler construction in4020 – lecture 7

Compiler construction in4020 – lecture 7. Koen Langendoen Delft University of Technology The Netherlands. multi-visit grammars. L -attributed grammars. S -attributed grammars. Summary of lecture 6. attribute grammars formal attributes per symbol: inherited & synthesized

gwen
Download Presentation

Compiler construction in4020 – lecture 7

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Compiler constructionin4020 – lecture 7 Koen Langendoen Delft University of Technology The Netherlands

  2. multi-visit grammars L -attributed grammars S -attributed grammars Summary of lecture 6 • attribute grammars • formal attributes per symbol: inherited & synthesized • attribute evaluation rule per production • dependency graphs • evaluation order

  3. Quiz 3.1Which of the following items belong to a non-terminal, and which to a production? (a) inherited attribute (b) synthesized attribute (c) attribute evaluation rule (d) dependency graph (e) visiting routine

  4. program text lexical analysis tokens syntax analysis AST context handling annotated AST Overview • context handling • annotating the AST • attribute grammars • manual methods • symbolic interpretation • data-flow equations

  5. Manual methods for analyzing the AST • preparing the grounds for code generation • constant propagation • last-def analysis (reaching definitions) • live analysis • common subexpression elimination • dead-code elimination • ... • we need flow-of-control information

  6. - last first * * * ‘c’ ‘b’ ‘4’ ‘a’ Threading the AST • determine the control flow graph that records the successor(s) of AST nodes • intermediate code • PUSH b • PUSH b • MUL • PUSH 4 • PUSH a • MUL • PUSH c • MUL • SUB ‘b’

  7. * * ‘c’ ‘4’ ‘a’ Threading the AST • result is a post-order traversal of the AST • global variable: Last node pointer PROCEDURE Thread binary expression (Expr node pointer): Thread expression (Expr node pointer .left operand); Thread expression (Expr node pointer .right operand); // link this node to the dynamically last node SET Last node pointer .successor TO Expr node pointer; // make this node the new dynamically last node SET Last node pointer TO Expr node pointer;

  8. Multiple successors • problem: threading is built around single Last node pointer IF condition THEN ELSE first last

  9. Multiple successors • problem: threading is built around single Last node pointer • solution: introduce join node IF condition THEN ELSE first FI last

  10. Symbolic interpretation • behaviour of code is determined by (the values of) variables • simulate run-time execution at compile time • attach a stack representation to each arrow in the control flow graph • an entry summarizes all compile-time knowledge about the variable/constant

  11. condition THEN ELSE Symbolic interpretation IF FI

  12. y 5 x T THEN y dead code 5 x 5 0 y y 5 5 y x 5 x x 7 5 merge y y y y 5 5 5 5 x x x x Symbolic interpretation IF condition ELSE > x = 7; y 0 FI

  13. y 5 x Exercise (5 min.) • draw the control flow graph for while Cdo Sod • propagate initial stack when C represents y>x and S stands for x:=7

  14. Answers

  15. y 5 WHILE x condition BODY b y 5 x 7 y y y 5 5 5 x x x Answers

  16. Simple symbolic interpretation • used in narrow compiler • simple properties + simple control-flow • example: detecting the use of uninitialized variables • maintain property list • advance list through control flow graph int foo(int n) {int first; while (n-- > 0) { if (glob[n] == KEY) first = n; } return first; }

  17. Trackinguninitialized variables • parameter declaration: • variable declaration: • expression: • assignment: • control statement • fork nodes: • join nodes: add (ID:Initialized) tuple add (ID:Uninitialized) tuple check status of used variables set tuple to (ID:Initialized) int foo(int n) {int first; while (n-- > 0) { if (glob[n] == KEY) first = n; } return first; } copy list merge lists merge( Init, Init) = Init merge( Uninit, Uninit) = Uninit merge( x, x) = Maybe

  18. Simple symbolic interpretation • flow-of-control structures with one entry and one exit point (no GOTOs) • the values of the property are severely constrained (see book) • example: constant propagation

  19. Constant propagation • record the exact value iso Initialized int i = 0; while (condition) { if (i>0) printf("loop reentered\n"); … i++; } // i == 0 // i == 0 // i == 1 // i == {0,1} • simple symbolic interpretation fails

  20. Full symbolic interpretation • maintain a property list for each entry point (label), initially set to empty • traverse flow of control graph L: merge current-list into list-L continue with list-L jump L; merge current-list into list-L continue with the empty list • repeat until nothing changed guarantee termination – select suitable property

  21. // i == 0 // i == ANY // i == 0 // i == ANY Constant propagation • full symbolic interpretation • property: unknown < value < any int i = 0; while (condition) { if (i>0) printf("loop reentered\n"); … i++; } int i = 0; L: if (condition) { if (i>0) printf("loop reentered\n"); … i++; goto L; } // i == 0 // i == ANY // i == 1 // i == ANY // empty

  22. Break

  23. IN n = n-1; OUT Data flow equations • “automated” full symbolic interpretation • stack replaced by collection of sets • IN(N) • OUT(N) • semantics are expressed by constant sets • KILL(N) • GEN(N) • equations • IN(N) = M  predecessor[N] OUT(M) • OUT(N) = IN(N) \ KILL(N)  GEN(N)

  24. Data flow equations • solved through iteration (closure algorithm) • data-flow nodes may not change the stack • individual statements • basic blocks • example: tracking uninitialized variables • properties: I x, M x, U x • GEN(x = expr ;) = • KILL(x = expr ;) = int foo(int n) {int first; while (n-- > 0) { if (glob[n] == KEY) first = n; } return first; } int foo(int n) {int first; L: n = n-1; if (n >= 0) { if (glob[n] == KEY) first = n; goto L; } return first; } {I x} {U x, M x}

  25. {I x} {U x, M x} Data flow equations • solved through iteration (closure algorithm) • data-flow nodes may not change the stack • individual statements • basic blocks • example: tracking uninitialized variables • properties: I x, M x, U x • GEN(x = expr ;) = • KILL(x = expr ;) = 1 L: n = n-1; 2 if (n >= 0) F T 3 if (glob[n] == KEY) T F 4 first = n; 5 goto L; 6 return first;

  26. {} {} 0 1 int foo(int n) { int first; L: n = n-1; F {} T 2 if (n >= 0) T {} F 3 if (glob[n] == KEY) {} 4 first = n; 5 goto L; {} 6 return first; Iterative solution {I n, U f} • initialization: set all sets to empty • iterate from top to bottom {I n, U f} {I n, M f} {I n, U f} {I n, M f} {I n, U f} IN(N) = M  predecessor[N] OUT(M) OUT(N) = IN(N) \ KILL(N)  GEN(N) {I n, I f} {I n, M f}

  27. Iterative solution

  28. Efficient data-flow equations • limit to (set of) on/off properties • set union = bitwise OR • set difference = bitwise AND NOT • combine statements into basic blocks • KILL[S1;S2] = KILL[S1]  KILL[S2] • GEN[S1;S2] = GEN[S2]  (GEN[S1] \ KILL[S2]) • sort data-flow graph • depth-first traversal (break cycles!)

  29. Live analysis • a variable is live at node N if the value it holds is used on some path further down the control-flow graph; otherwise it is dead • useful information for register allocation • information must flow “backwards” (up) through the control-flow graph • difficult for symbolic interpretation • easy for data flow equations

  30. Solving data-flow equations • forwards • backwards IN(N) = M  predecessor[N] OUT(M) OUT(N) = IN(N) \ KILL(N)  GEN(N) IN if (cond) OUT OUT(N) = M  successor[N] IN(M) IN(N) = OUT(N) \ KILL(N)  GEN(N)

  31. Exercise (5 min.) • determine the KILL and GEN sets for the property “V is live here” for the following statements

  32. Answers • determine the KILL and GEN sets for the property “V is live here” for the following statements {v} {a,b} \ KILLS[S] {v} {i} \ KILLS[S] {} {i,v} {} {a1, ... ,an} {v} {a1, ... ,an} \ KILLS[S] {} {a,b} {} {} {} {}

  33. Exercise (7 min.) • draw the control-flow graph for the following code fragment • perform backwards live analysis using the KILL and GEN sets from the previous exercise double average(int n, double v[]) {int i; double sum = 0.0; for (i=0; i<n; i++) { sum += v[i]; } return sum/n; }

  34. Answers OUT(N) = M  successor[N] IN(M) IN(N) = OUT(N) \ KILL(N)  GEN(N) 1 2 3 4

  35. Answers OUT(N) = M  successor[N] IN(M) IN(N) = OUT(N) \ KILL(N)  GEN(N) 1 sum = 0.0; i = 0; if (i<n) 2 3 sum += v[i]; i++; 4 return sum/n;

  36. live: n live: n, i, sum live: n, i, sum live: n, i, sum live: n, sum Answers OUT(N) = M  successor[N] IN(M) IN(N) = OUT(N) \ KILL(N)  GEN(N) 1 sum = 0.0; i = 0; if (i<n) 2 3 sum += v[i]; i++; 4 return sum/n;

  37. Answers process nodes top to bottom (from 0 to 4) processing nodes in reverse (from 4 to 0) requires only two iterations

  38. Summary manual methods for annotating the AST • threading • symbolic interpretation • data-flow equations

  39. Homework • study sections: • 3.2.4 interprocedural data-flow analysis • 3.2.5.1 live analysis by symbolic interpretation • assignment 1: • replace yacc with LLgen • deadline April 9 08:59 • print handout for next week [blackboard]

More Related