1 / 54

2. Data Formats

2. Data Formats. Chapt. 3. Real World. Computer. Data. Data. Dear Mom:. Keyboard. 10110010…. Digital camera. 10110010…. Introduction. Examples. Input device. pp. 59.-61. Format must be appropriate.

Albert_Lan
Download Presentation

2. Data Formats

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 2. Data Formats Chapt. 3

  2. Real World Computer Data Data Dear Mom: Keyboard 10110010… Digitalcamera 10110010… Introduction • Examples Input device pp. 59.-61

  3. Format must be appropriate • The internal representation must be appropriate for the type of processing to take place (e.g., text, images, sound)

  4. Rules/Conventions • Proprietary formats • Unique to a product or company • E.g., Microsoft Word, Corel Word Perfect, IBM Lotus Notes • Standards • Evolve two ways: • Proprietary formats become de facto standards (e.g., Adobe PostScript, Apple Quick Time) • Committee is struck to solve a problem (Motion Pictures Experts Group, MPEG) pp. 61-62

  5. Standards Organizations • ISO – International Standards Organization • CSA – Canadian Standards Association • ANSI – American National Standards Institute • IEEE – Institute for Electrical and Electronics Engineers • Etc.

  6. Examples of Standards

  7. Why Standards? • Standard are “arbitrary” • They exist because they are • Convenient • Efficient • Flexible • Appropriate • Etc.

  8. Alphanumeric Data • Problem: Distinguishing between the number 123 (one hundred and twenty-three) and the characters “123” (one, two, three) • Four standards for representing letters (alpha) and numbers • BCD – Binary-coded decimal • ASCII – American standard code for information interchange • EBCDIC – Extended binary-coded decimal interchange code • Unicode pp. 63-69

  9. Standard Alphanumeric Formats Next 2 slides • BCD • ASCII • EBCDIC • Unicode

  10. Binary-Coded Decimal (BCD) • Four bits per digit Note: the following bit patterns are not used: 1010 1011 1100 1101 1110 1111

  11. Example • 709310 = ? (in BCD) 7 0 9 3 0111 0000 1001 0011

  12. Standard Alphanumeric Formats • BCD • ASCII • EBCDIC • Unicode Next 22 slides

  13. The Problem • Representing text strings, such as “Hello, world”,in a computer

  14. Codes and Characters • Each character is coded as a byte • Most common coding system is ASCII (Pronounced ass-key) • ASCII = American National Standard Code for Information Interchange • Defined in ANSI document X3.4-1977

  15. ASCII Features • 7-bit code • 8th bit is unused (or used for a parity bit) • 27 = 128 codes • Two general types of codes: • 95 are “Graphic” codes (displayable on a console) • 33 are “Control” codes (control features of the console or communications channel)

  16. ASCII Chart

  17. Most significant bit Least significant bit

  18. e.g., ‘a’ = 1100001

  19. 95 Graphic codes

  20. 33 Control codes

  21. Alphabetic codes

  22. Numeric codes

  23. Punctuation, etc.

  24. H e l l o , w o r l d = = = = = = = = = = = = Binary 01001000 01100101 01101100 01101100 01101111 00101100 00100000 01110111 01100111 01110010 01101100 01100100 = = = = = = = = = = = = Hexadecimal 48 65 6C 6C 6F 2C 20 77 67 72 6C 64 = = = = = = = = = = = = Decimal 72 101 108 108 111 44 32 119 103 114 108 100 “Hello, world” Example

  25. Common Control Codes • CR 0D carriage return • LF 0A line feed • HT 09 horizontal tab • DEL 7F delete • NULL 00 null Hexadecimal code

  26. Terminology • Learn the names of the special symbols • [ ] brackets • { } braces • ( ) parentheses • @ commercial ‘at’ sign • & ampersand • ~ tilde

  27. Escape Sequences • Extend the capability of the ASCII code set • For controlling terminals and formatting output • Defined by ANSI in documents X3.41-1974 and X3.64-1977 • The escape code is ESC = 1B16 • An escape sequence begins with two codes: ESC [ 1B16 5B16

  28. Examples • Erase display: ESC [ 2 J • Erase line: ESC [ K

  29. Standard Alphanumeric Formats • BCD • ASCII • EBCDIC • Unicode Next 1 slides

  30. EBCDIC • Extended BCDInterchange Code (pronounced ebb’-se-dick) • 8-bit code • Developed by IBM • Rarely used today • IBM mainframes only

  31. Standard Alphanumeric Formats • BCD • ASCII • EBCDIC • Unicode Next 2 slides

  32. Unicode • 16-bit standard • Developed by a consortia • Intended to supercede older 7- and 8-bit codes

  33. Unicode Version 2.1 • 1998 • Improves on version 2.0 • Includes the Euro sign (20AC16 = ) • From the standard: …contains 38,887 distinct coded characters derived from the supported scripts. These characters cover the principal written languages of the Americas, Europe, the Middle East, Africa, India, Asia, and Pacifica. http://www.unicode.org

  34. Keyboard Input • Key (“scan”) codes are converted to ASCII • ASCII code sent to host computer • Received by the host as a “stream” of data • Stored in buffer • Processed • Etc. pp. 69

  35. Shift Key • inhibits bit 5 in the ASCII code a Shift a

  36. Control Key • inhibits bits 5 & 6 in the ASCII code c Ctrl c Controlcode

  37. Other Input • OCR – optical character recognition • Bar code readers • Voice/audio input • Punched cards • Images / objects • Pointing devices pp. 69-86

  38. OCR Hello, world Optical scan 10110110… Page of text Computer file

  39. Other Input • OCR – optical character recognition • Bar code readers • Voice/audio input • Punched cards • Images / objects • Pointing devices pp. 69-86

  40. Bar Codes • An automatic identification (Auto ID) technology that streamlines identification and data collection • See http://www.digital.net/barcoder/barcode.html

  41. Other Input • OCR – optical character recognition • Bar code readers • Voice/audio input • Punched cards • Images / objects • Pointing devices pp. 69-86

  42. Voice/audio Input • Input device: microphone • Audio input is “digitized” and stored • Processed in two ways • As is (no recognition) • Recognized and converted to alphanumeric data (ASCII) Digitize 10110010…

  43. Other Input • OCR – optical character recognition • Bar code readers • Voice/audio input • Punched cards • Images / objects • Pointing devices pp. 69-86

  44. Punched Cards • Invented by Herman Hollerith (founder of IBM) • Each card holds 80 characters

  45. Other Input • OCR – optical character recognition • Bar code readers • Voice/audio input • Punched cards • Images / objects • Pointing devices pp. 69-86

  46. Images • Typically images are pictures that are optically scanned and saved as a “bit map” or in some other format • Many formats • gif, jpeg, …

  47. Typical “Save As” Dialog

More Related