1 / 34

Data Abstraction and Encapsulation in Programming

This chapter explores the concepts of data abstraction and encapsulation in programming, discussing the benefits and requirements of these constructs. Examples and evaluations of different programming languages are provided.

kstuart
Download Presentation

Data Abstraction and Encapsulation in Programming

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chapter 11 Abstract Data Types and Encapsulation Constructs

  2. Abstraction - The concept of abstraction is fundamental in programming - Nearly all programming languages support process abstraction with subprograms - Nearly all programming languages designed since 1980 have supported data abstraction with some kind of module Encapsulation - Original motivation: - Large programs have two special needs: 1. Some means of organization, other than simply division into subprograms 2. Some means of partial compilation (compilation units that are smaller than the whole program) - Obvious solution: a grouping of subprograms that are logically related into a unit that can be separately compiled - These are called encapsulations

  3. Examples of Encapsulation Mechanisms 1. Pascal - Nested subprograms 2. C - Files containing one or more subprograms can be independently compiled 3. C++ (and other contemporary languages) - separately compilable modules Def: An abstract data type is a user-defined data type that satisfies the following two conditions: 1. The representation of and operations on objects of the type are defined in a single syntactic unit; also, other units can create objects of the type. 2. The representation of objects of the type is hidden from the program units that use these objects, so the only operations possible are those provided in the type's definition.

  4. Advantage of Restriction 1: - Program organization, modifiability (everything associated with a data structure is together), and separate compilation Advantage of Restriction 2: - Reliability--by hiding the data representations, user code cannot directly access objects of the type. User code cannot depend on the representation, allowing the representation to be changed without affecting user code. Built-in types are abstract data types e.g. int type in C - The representation is hidden - Operations are all built-in - User programs can define objects of int type - User-defined abstract data types must have the same characteristics as built-in abstract data types

  5. Language Requirements for Data Abstraction: 1. A syntactic unit in which to encapsulate the type definition. 2. A method of making type names and subprogram headers visible to clients, while hiding actual definitions. 3. Some primitive operations must be built into the language processor (usually just assignment and comparisons for equality and inequality) - Some operations are commonly needed, but must be defined by the type designer - e.g., iterators, constructors, destructors Language Design Issues: 1. Encapsulate a single type, or something more? 2. What types can be abstract? 3. Can abstract types be parameterized? 4. What access controls are provided?

  6. Language Examples: 1. Earlier Languages - Provided encapsulation, but no information hiding 2. Modern Languages - The encapsulation construct is the package - Packages usually have two parts: 1. Specification package (the interface) 2. Body package (implementation of the entities named in the specification - Any type can be exported - Information Hiding - Hidden types are named in the spec package in, as in: type NODE_TYPE is private;

  7. - Representation of an exported hidden type is specified in a special invisible (to clients) part of the spec package (the private clause), as in: package … is type NODE_TYPE is private; … type NODE_TYPE is record … end record; … - A spec package can also define unhidden types simply by providing the representation outside a private clause - The reasons for the two-part type definition are: 1. The compiler must be able to see the representation after seeing only the spec package (the compiler can see the private clause) 2. Clients must see the type name, but not the representation (clients cannot see the private clause)

  8. - Private types have built-in operations for assignment and comparison with = and /= - Limited private types have no built-in operations Evaluation of Ada Abstract Data Types 1. Lack of restriction to pointers is better - Cost is recompilation of clients when the representation is changed 2. Cannot import specific entities from other packages

  9. 4. C++ - Based on C struct type - The class is the encapsulation device - All of the class instances of a class share a single copy of the member functions - Each instance of a class has its own copy of the class data members - Instances can be static, stack dynamic, or heap dynamic - Information Hiding: - Private clause for hidden entities - Public clause for interface entities - Protected clause - for inheritance - Constructors: - Functions to initialize the data members of instances (they DO NOT create the objects) - May also allocate storage if part of the object is heap-dynamic - Can include parameters to provide parameterization of the objects - Implicitly called when an instance is created - Can be explicitly called - Name is the same as the class name

  10. Destructors • - Functions to cleanup after an instance is • destroyed; usually just to reclaim heap storage • - Implicitly called when the object’s lifetime ends • - Can be explicitly called • - Name is the class name, preceded by a tilda (~) • - Friend functions or classes - to provide access • to private members to some unrelated units or • functions (NECESSARY in C++) • Evaluation of C++ Support for • Abstract Data Types • - Classes are similar to Ada packages for • providing abstract data type • - Difference: packages are encapsulations, • whereas classes are types

  11. A Related Language: Java - Similar to C++, except: - All user-defined types are classes - All objects are allocated from the heap and accessed through reference variables - Individual entities in classes have access control modifiers (private or public), rather than clauses - Java has a second scoping mechanism, package scope, which can be used in place of friends - All entities in all classes in a package that do not have access control modifiers are visible throughout the package

  12. Parameterized Abstract Data Types 1. Ada Generic Packages - Make the stack type more flexible by making the element type and the size of the stack generic 2. C++ Templated Classes - Classes can be somewhat generic by writing parameterized constructor functions e.g. stack (int size) { stk_ptr = new int [size]; max_len = size - 1; top = -1; } stack (100) stk;

  13. - The stack element type can be parameterized by making the class a templated class - Java does not support generic abstract data types

  14. Chapter 12 Support for Object-Oriented Programming

  15. Categories of languages that support OOP: 1. OOP support is added to an existing language - C++ (also supports procedural and data- oriented programming) - Ada (also supports procedural and data- oriented programming) - Scheme (also supports functional programming) 2. Support OOP, but have the same appearance and use the basic structure of earlier imperative languages - Java (based on C++) 3. Pure OOP languages - Smalltalk

  16. Paradigm Evolution 1. Procedural - 1950s-1970s (procedural abstraction) 2. Data-Oriented - early 1980s (data-oriented) 3. OOP - late 1980s (Inheritance and dynamic binding) Origins of Inheritance Observations of the mid-late 1980s : - Productivity increases can come from reuse - ADTs are difficult to reuse--never quite right - All ADTs are independent and at the same level Inheritance solves both--reuse ADTs after minor changes and define classes in a hierarchy

  17. OOP Definitions: - ADTs are called classes - Class instances are called objects - A class that inherits is a derived class or a subclass - The class from which another class inherits is a parent class or superclass - Subprograms that define operations on objects are called methods - The entire collection of methods of an object is called its message protocol or message interface - Messages have two parts--a method name and the destination object - In the simplest case, a class inherits all of the entities of its parent

  18. - Inheritance can be complicated by access controls to encapsulated entities - A class can hide entities from its subclasses - A class can hide entities from its clients - Besides inheriting methods as is, a class can modify an inherited method - The new one overrides the inherited one - The method in the parent is overriden - There are two kinds of variables in a class: 1. Class variables - one/class 2. Instance variables - one/object - There are two kinds of methods in a class: 1. Class methods - messages to the class 2. Instance methods - messages to objects - Single vs. Multiple Inheritance

  19. - Disadvantage of inheritance for reuse: - Creates interdependencies among classes that complicate maintenance Polymorphism in OOPLs - A polymorphic variable can be defined in a class that is able to reference (or point to) objects of the class and objects of any of its descendants - When a class hierarchy includes classes that override methods and such methods are called through a polymorphic variable, the binding to the correct method MUST be dynamic - This polymorphism simplifies the addition of new methods - A virtual method is one that does not include a definition (it only defines a protocol) - A virtual class is one that includes at least one virtual method - A virtual class cannot be instantiated

  20. Design Issues for OOPLs 1. The Exclusivity of Objects a. Everything is an object advantage - elegance and purity disadvantage - slow operations on simple objects (e.g., float) b. Add objects to a complete typing system Advantage - fast operations on simple objects Disadvantage - results in a confusing type system c. Include an imperative-style typing system for primitives but make everything else objects Advantage - fast operations on simple objects and a relatively small typing system Disadvantage - still some confusion because of the two type systems 2. Are Subclasses Subtypes? - Does an is-a relationship hold between a parent class object and an object of the subclass?

  21. 3. Implementation and Interface Inheritance - If only the interface of the parent class is visible to the subclass, it is interface inheritance Disadvantage - can result in inefficiencies - If both the interface and the implementation of the parent class is visible to the subclass, it is implementation inheritance Disadvantage - changes to the parent class require recompilation of subclasses, and sometimes even modification of subclasses 4. Type Checking and Polymorphism - Polymorphism may require dynamic type checking of parameters and the return value - Dynamic type checking is costly and delays error detection - If overriding methods are restricted to having the same parameter types and return type, the checking can be static

  22. 5. Single and Multiple Inheritance - Disadvantage of multiple inheritance: - Language and implementation complexity - Potential inefficiency - dynamic binding costs more with multiple inheritance (but not much) - Advantage: - Sometimes it is extremely convenient and valuable 6. Allocation and Deallocation of Objects - From where are objects allocated? - If they all live in the heap, references to them are uniform - Is deallocation explicit or implicit?

  23. 7. Dynamic and Static Binding - Should ALL binding of messages to methods be dynamic?

  24. Overview of Smalltalk - Smalltalk is a pure OOP language - Everything is an object - All computation is through objects sending messages to objects - It adopts none of the appearance of imperative languages - The Smalltalk Environment - The first complete GUI system - A complete system for software development - All of the system source code is available to the user, who can modify it if he/she wants Introduction to Smalltalk - Expressions - Four kinds: 1. Literals (numbers, strings, and keywords) 2. Variable names (all variables are references) 3. Message expressions 4. Block expressions

  25. - Message expressions - Two parts: the receiver object and the message itself - The message part specifies the method and possibly some parameters - Replies to messages are objects - Messages can be of three forms: 1. Unary (no parameters) e.g., myAngle sin (sends a message to the sin method of the myAngle object) 2. Binary (one parameter, an object) e.g., 12 + 17 (sends the message “+17” to the object 12; the object parameter is “17” and the method is “+”) 3. Keyword (use keywords to organize the parameters) e.g., myArray at: 1 put: 5 (sends the objects “1” and “5” to the at:put: method of the object myArray) - Multiple messages to the same object can be strung together, separated by semicolons

  26. Methods - General form: message_pattern [| temps |] statements - A message pattern is like the formal parameters of a subprogram - For a unary message, it is just the name - For others, it lists keywords and formal names - temps are just names--Smalltalk is type-less! Assignments - Simplest Form: name1 <- name2 - It is simply a pointer assignment - RHS can be a message expression e.g., index <- index + 1 Blocks - A sequence of statements, separated by periods, delimited by brackets e.g., [index <- index + 1. sum <- sum + index]

  27. Blocks (continued) - A block specifies something, but doesn’t do it - To request the execution of a block, send it the unary message, value e.g., […] value - If a block is assigned to a variable, it is evaluated by sending value to that variable e.g., addIndex <- [sum <- sum + index] … addIndex value - Blocks can have parameters, as in [:x :y | statements] - If a block contains a relational expression, it returns a Boolean object, true or false Iteration - The objects true and false have methods for building control constructs - The method WhileTrue: from Block is used for pretest logical loops. It is defined for all blocks that return Boolean objects.

  28. Iteration (continued) e.g., [count <= 20] whileTrue [sum <- sum + count. count <- count + 1] - timesRepeat: is defined for integers and can be used to build counting loops e.g., xCube <- 1. 3 timesRepeat: [xCube <- xCube * x] Selection - The Boolean objects have the method ifTrue:ifFalse: , which can be used to build selection e.g., total = 0 ifTrue: […] ifFalse: […]

  29. Large-Scale Features of Smalltalk - Type Checking and Polymorphism - All bindings of messages to methods is dynamic - The process is to search the object to which the message is sent for the method; if not found, search the superclass, etc. - Because all variables are type-less, methods are all polymorphic - Inheritance - All subclasses are subtypes (nothing can be hidden) - All inheritance is implementation inheritance - No multiple inheritance - Methods can be redefined, but the two are not related

  30. C++ - General Characteristics: - Mixed typing system - Constructors and destructors - Elaborate access controls to class entities - Inheritance - A class need not be subclasses of any class - Access controls for members are 1. Private (visible only in the class and friends) 2. Public (visible in subclasses and clients) 3. Protected (visible in the class and in subclasses) - In addition, the subclassing process can be declared with access controls, which define potential changes in access by subclasses - Multiple inheritance is supported

  31. Inheritance (continued) - Dynamic Binding - A method can be defined to be virtual, which means that they can be called through polymorphic variables and dynamically bound to messages - A pure virtual function has no definition at all - A class that has at least one pure virtual function is an abstract class - Evaluation - C++ provides extensive access control (unlike Smalltalk) - C++ provides multiple inheritance - In C++, the programmer must decide at design time which methods will be statically bound and which must be dynamically bound - Static binding is faster! - Smalltalk type checking is dynamic (flexible, but somewhat unsafe)

  32. Java - General Characteristics - All data are objects except the primitive types - All primitive types have wrapper classes that store one data value - All objects are heap-dynamic, are referenced through reference variables, and most are allocated with new - Inheritance - Single inheritance only, but there is an abstract class category that provides some of the benefits of multiple inheritance (interface) - An interface can include only method declarations and named constants e.g., public class Clock extends Applet implements Runnable - Methods can be final (cannot be overriden)

  33. - Dynamic Binding - In Java, all messages are dynamically bound to methods, unless the method is final - Encapsulation - Two constructs, classes and packages - Packages provide a container for classes that are related - Entities defined without an scope (access) modifier have package scope, which makes them visible throughout the package in which they are defined - Every class in a package is a friend to the package scope entities elsewhere in the package

  34. End of Lecture

More Related