Chapter 5

Chapter 5 Names, Bindings, Type Checking, and Scopes

Contents • Introduction • Names • Variables • The Concept of Binding • Type Checking • Strong Typing • Type Equivalence • Scope • Scope and Lifetime • Referencing Environments • Named Constants

Introduction • Imperative programming languages are abstractions of von Neumann computer architecture • Memory: stores both instruction and data • Processor: provides operations modifying the contents of the memory. • Variables are the abstractions of memory cells • Variables are characterized by attributes • To design a type, must consider scope, lifetime, type checking, initialization, and type compatibility

Names A Name is a string of characters used to indentify some entity in a program. Length Length Case Sensitivity Relation between special characters and reserved words (Ex. variables, formal parameters, methods, etc.) • Meaning: number of characters in the string. • Types of length: • (1) limited such as (a) COBOL: maximum 30 • (b) FORTRAN 90 : maximum 31 • (2) Unlimited such as Ada, C#, and JAVA

Names Length Case Sensitivity Relation between special characters and reserved words • Meaning: distinguish between capital letters and small letters. • Types: some languages are • (1) case sensitive such as C, and Java • 2) not case sensitive such as Ada, and Pascal. • Disadvantage of case sensitivity: • readability (names that look alike are different)

Names Length Case Sensitivity Relation between special characters and reserved words • Reserved Word (RW) • Meaning: A reserved word is a special word that cannot be used as a • user- defined name. • Example: RW in Java: if, for, while, return,….. • Disadvantage: if the language contains large number of RW then • many collisions occur. • (e.g., COBOL has 300 reserved words!) • Special Characters(SC) • Meaning: Other than letters and digits. • Example: In some languages, we use the SC in the name. • (1) PHP: all variable names must begin with $. • (2) Perl: all variable names begin with special characters, which • specify the variable’s type.

Names Length Case Sensitivity Relation between special characters and reserved words • Keywords(KW) • Meaning: A keyword is a word that is special only in certain contexts, • Example: In Fortran, • Real VarName (Real is a data type followed with a name, • therefore Real is a keyword). • Real = 3.4 (Real is a variable).

Variables • Meaning: A variable is a named location in memory that is used to hold a value that can be modified by the program. • Attributes: Name Address Value Lifetime Type Scope • Name : not all variables have them. • Example: • The result of applying “my.username” is assigned to “p”. • Code: http://www.www.com/view.php?p=my.username • The result of applying “my.username” is not assigned to any variable. • Code: http://www.www.com/my.username

Attributes: Name Value Scope Address Type Lifetime Address • Meaning: the memory address with which it is associated • A variable may have different addresses at • different times during execution (ex. pointer) • different places in a program. Type • Meaning: determines the range of values of variables and the set of operations that are defined for values of that type. • Example: integer type in Java • Range of values :-2,147,483,648 .. 2,147,483,647 • Set of Operations: +, -, *, Mod, Div int n;

Attributes: Name Address Type Value Lifetime Scope Value • Meaning: the contents of the location with which the variable is associated. • The l-value of a variable is its address • The r-value of a variable is its value

The Concept of Binding • Meaning: A binding is the association of data/code with an identifier or an association between an attribute and an entity, or between an operation and a symbol • Binding time is the time at which a binding takes place. • Language design time : bind operator symbols to operations • (ex. * is bound to multiplication) • Language implementation time: bind floating point type to a representation • (in java float represents as 4 bytes according to IEEE-754 • Compile time: bind a variable to a type in C or Java • (ex. intn  n is bound integer type at compilation) • 4. Load time:bind a C or C++ static variable to a memory cell • (i.e. allocate in memory) • 5. Runtime:bind a non-static local variable to a memory cell. • (ex. object in Java; a value is bound to variable at runtime) Time of Binding

Example: • int count; • count = count + 5; • The type of count is bound at compile time. • The set of possible values of count is bound at compiler design time. • The meaning of the operator symbol + is bound at compile time, when the types of its operands have been determined. • The internal representation of the literal 5 is bound at compiler design time. • The value of count is bound at execution time with this statement.

Type of Binding • Static: a binding is static if it first occurs before run time and remains unchanged throughout program execution. • Dynamic: a binding is dynamic if it first occurs during execution orcan change during execution of the program

Type Binding Before a variable can be referenced in a program it must be bound to a type. We have two key issues in binding a type to an identifier • How is a type specified? • When does the binding take place? Static Type Binding Dynamic Type Binding

How is a type specified? Static Type Binding • done through explicit or implicit declaration. • An explicit declaration is a program statement used for declaring the types of variables (a statement in a program that lists variable names and specifies their types). • Ex. int sum=0; (most language require explicit dec.) • An implicit declaration is a default mechanism for specifying types of variables (the first appearance of the variable in the program). (means of associating variables with types through default conventions. ) • Ex. Sum =0; (FORTRAN has implicit dec.) listLn; • Advantage: writability • Disadvantage: reliability (less trouble with Perl)

Dynamic Type Binding • A variable is bound to a type when it is assigned a value in an assignment statement (at run time, take the type of the value on the right-hand side) • Ex. for dynamic type binding in JavaScript list = [2, 4.33, 6, 8]; list = 17.3; • Advantage: flexibility (generic program units) • Disadvantages: • High cost (dynamic type checking and interpretation) i.e. dynamically typed languages are often implemented in interpretation, because the overhead of type checking is not the bottleneck. • Type error detection by the compiler is difficult.

Storage Bindings and Lifetime • When does the binding take place? (How does one associate memory locations to variables? ) • Allocation - getting a cell from some pool of available cells. • i.e. the process of binging a variable to a memory cell that is taken from a pool of available memory. • Deallocation - putting a cell back into the pool. • I.e. the process of unbinding a variable and placing its memory cell back in the pool of available memory. • The lifetime of a variable is the time during which it is bound to a particular memory cell.

Attributes: Name Address Type Value Lifetime Scope Meaning: The lifetimeof a variable is the time during which it is bound to a particular memory cell. Storage binding and lifetime The logical organization of the memory that is used by a running problem: 3 main areas for variables: - the global or static area - the stack (contains "activation records") - the heap +-----------------+ | static area | +-----------------+ | stack | +-----------------+ | | | | v | | | | ^ | | | | +-----------------+ | heap | +-----------------+

Categories of Variables by Lifetimes • static • stack-dynamic • explicit heap-dynamic variables • implicit heap-dynamic variables

Categories of Variables by Lifetimes (1) Static variables: bound to memory cells before execution begins and remains bound to the same memory cell throughout execution. • e.g., C and C++ static variables • Advantages: # efficiency (direct addressing), # history-sensitive subprogram support int f(int x) { static int a = 0; a += 2; printf("%d\n",a); } f(2) --> 2 f(4) ---> 4 • Disadvantage: lack of flexibility (no recursion). • lifetime = entire program execution.

Categories of Variables by Lifetimes (2) Stack-dynamic variables: Storage bindings are created for variables when their declaration statements are elaborated. I.e. A declaration is elaborated when the executable code associated with it is executed • lifetime = while the subprogram is active • Advantage: allows recursion; conserves storage • Disadvantages: • Overhead of allocation and deallocation • Subprograms cannot be history sensitive • Inefficient references (indirect addressing) • int f(int x) { int a = 0; • ... • f(y+a); • … }

Categories of Variables by Lifetimes (3)Explicit heap-dynamic variables: are nameless (abstract) memory cells that are allocated and deallocated by explicit instructions, specified by the programmer, which take effect during execution. • Referenced only through pointers or references, e.g. dynamic objects in C++ (via new and delete), all objects in Java. • C++: int *intnode; // Create a pointer intnode = new int; // Create the heap-dynamic variable . . . delete intnode; // Deallocate the heap-dynamic variable // to which intnode points • Advantage: high flexibility (provides for dynamic storage management) • Disadvantage: inefficient and unreliable. • lifetime = from explicit allocation to explicit deallocation

Categories of Variables by Lifetimes (4) Implicit heap-dynamic variables: Allocation and deallocation caused by assignment statements. • all strings and arrays in Perl, JavaScript, and PHP • Advantage: highest degree of flexibility (generic code) • Disadvantages: • Inefficient, because all attributes are dynamic • Loss of error detection • lifetime = from implicit allocation to implicit deallocation

Type Checking (section 6.12 from chapter 6) • Generalize the concept of operands and operators to include subprograms and assignments • Type checking is the activity of ensuring that the operands of an operator are of compatible types • A compatible type is one that is either legal for the operator, or is allowed under language rules to be implicitly converted, by compiler- generated code, to a legal type • This automatic conversion is called a coercion. • A type error is the application of an operator to an operand of an inappropriate type

If all type bindings are static, nearly all type checking can be static • If type bindings are dynamic, type checking must be dynamic

Strong Typing (section 6.13 from chapter 6) • A programming language is strongly typed if type errors are always detected • Advantage of strong typing: allows the detection of the misuses of variables that result in type errors Language examples: • C and C++ are not: parameter type checking can be avoided; unions are not type checked • Ada is, almost (UNCHECKED_CONVERSION is loophole) • Java and C# are similar to Ada • ML is strongly typed, even though the types of some function parameters may not be known at compile time. • F# is strongly typed.

Coercion rules strongly affect strong typing--they can weaken it considerably (C++ versus Ada) • Although Java has just half the assignment coercions of C++, its strong typing is still far less effective than that of ML and F#

Variable Attributes: Scope • The scope of a variable is the range of statements in which it is visible. • A variable is visible in a statement if it can be referenced in that statement. • A variable is local in a program unit or block if it is declared there. • The nonlocal variables of a program unit or block are those that are visible within the program unit or block but are not declared there. • The scope rules of a language determine how references to names are associated with variables

Static Scope • Based on program text • To connect a name reference to a variable, you (or the compiler) must find the declaration • Search process: search declarations, first locally, then in increasingly larger enclosing scopes, until one is found for the given name • Enclosing static scopes (to a specific scope) are called its static ancestors; the nearest static ancestor is called a static parent • Some languages allow nested subprogram definitions, which create nested static scopes (e.g., Ada, JavaScript, and PHP)

Static Scope Example • Consider the following JavaScript function, big, in which the two functions sub1 and sub2 are nested: function big() { function sub1() { var x = 7; sub2(); } function sub2() { var y = x; } var x = 3; sub1(); }

Scope (continued) • Variables can be hidden from a unit by having a "closer" variable with the same name • C++ and Ada allow access to these "hidden" variables • In Ada: unit.name • In C++: class_name::name

Blocks • A method of creating static scopes inside program units--from ALGOL 60 • Example: Consider the following skeletal C function: void sub() { int count; . . . while (. . .) { int count; count++; . . . } . . . }

MAIN MAIN A C A B D C D B E Evaluation of Static Scoping • Assume MAIN calls A and B A calls C and D B calls A and E E The tree structure the program The structure of a program

Static Scope Example MAIN MAIN A B A B C D E C D E The graph of the desirable calls in the program The potential call graph of the program

Suppose the spec is changed so that D must now access some data in B • Solutions: • Put D in B (but then C can no longer call it and D cannot access A's variables) • Move the data from B that D needs to MAIN (but then all procedures can access them) • Same problem for procedure access • Overall: static scoping often encourages many nonlocals

Dynamic Scope • Based on the calling sequence of subprograms, not on their spatial relationship to each other • References to variables are connected to declarations by searching back through the chain of subprogram calls that forced execution to this point

Scope Example Big calls Sub1; Sub1 calls Sub2; Sub2 uses X Big - declaration of X Sub1 - declaration of X - ... call Sub2 ... Sub2 ... - reference to X - ... ... call Sub1 … • function big() { • function sub1() { • var x = 7; • sub2(); • } • function sub2() { • var y = x; • } • var x = 3; • sub1(); • }

Scope Example • Static scoping • Reference to X is to Big's X • Dynamic scoping • Reference to X is to Sub1's X

Evaluation of Dynamic Scoping • Disadvantages: • Subprograms are alwaysexecuted in the environment of all previously called subprograms that have not yet completed their executions (less reliability) • The inability to type check references to nonlocalsstatically • Poor readability • Accesses to nonlocal variables in dynamic-scoped languages take far longer than accesses to nonlocals when static scoping is used • Advantages: • No parameters need to be passed from caller to subprogram

Referencing Environments • The referencing environment of a statement is the collection of all variables that are visible in the statement • In a static-scoped language, it is the local variables plus all of the visible variables in all of the enclosing scopes • A subprogram is active if its execution has begun but has not yet terminated • In a dynamic-scoped language, the referencing environment is the local variables plus all visible variables in all active subprograms

Named Constants • A named constant is a variable that is bound to a value only once • Advantages: readability and modifiability • Used to parameterize programs • The binding of values to named constants can be either static (called manifest constants) or dynamic • Languages: • FORTRAN 95: constant-valued expressions • Ada, C++, and Java: expressions of any kind (allow dynamic binding of values to named constants) • C# has two kinds, readonly and const - the values of const named constants are bound at compile time - The values of readonly named constants are dynamically bound

Variable Initialization • The binding of a variable to a value at the time it is bound to storage is called initialization • If the variable is statically bound to storage, binding and initialization occur before run time. In these cases, the initial value must be specified as a literal or an expression whose only nonliteral operands are named constants that have already been defined. • If the storage binding is dynamic, initialization is also dynamic and the initial values can be any expression. • Initialization is often done on the declaration statement, e.g., in Java int sum = 0;

Summary • Case sensitivity and the relationship of names to special words represent design issues of names • Variables are characterized by the sextuples: name, address, value, type, lifetime, scope • Binding is the association of attributes with program entities • Variables are categorized according to their lifetime as: static, stack dynamic, explicit heap dynamic, implicit heap dynamic • Strong typing means detecting all type errors

Chapter 5

Chapter 5

Presentation Transcript

Chapter 5

Chapter 5

Chapter 5

Chapter 5

Chapter 5 5

chapter 5

Chapter 5

Chapter 5

Chapter 5

Chapter 5

Chapter 5

CHAPTER 5

Chapter 5

CHAPTER 5

Chapter 5

Chapter 5

Chapter 5

Chapter 5

Chapter 5

Chapter 5

Chapter 5

Chapter 5