1 / 34

Chapter 3

Chapter 3. Data Formats. Data Conversion. Types of Data. Alphanumeric Numeric Image Sound Video Graphics and Fonts All MUST be converted to binary form before they can be used by the computer. Proprietary Formats. Used by individual programs Unique: cannot be read by other programs

svea
Download Presentation

Chapter 3

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chapter 3 Data Formats

  2. Data Conversion

  3. Types of Data • Alphanumeric • Numeric • Image • Sound • Video • Graphics and Fonts All MUST be converted to binary form before they can be used by the computer.

  4. Proprietary Formats • Used by individual programs • Unique: cannot be read by other programs • Example - a Microsoft WORD document is different from a WordPerfect document even through they are both word processing documents • As more info is shared over the Web, new proprietary formats are less desirable.

  5. Standards • Formats which are recognized by a large variety of hardware and software programs • Possible to use on different platforms • Possible for different systems to share data

  6. How Are Standards Developed? • People use it because it is the most popular - so-called, defacto standard. • Adobe’s PostScript • A committee is formed because a need for a standard is recognized • MPEG2 JPEG • MP3

  7. Characteristics of Standards • A well designed data standard should: • simplify interconnections • usefully reflect the ways the data is used • be recognized by a wide variety of applications

  8. Data Formats • Alphanumeric Character Data • EBCDIC • ASCII • Image data • Bit map images • Object images • Audio data • .MOD • .MIDI • .WAV

  9. Alphanumeric CharactersASCII • American Standard Code for Information Interchange (pg. 67) • 7-bit code => 128 characters • Includes Latin alphabet, Arabic numerals, standard English punctuation characters, and a few non-printable characters • Collating sequence • Most personal computers use ASCII • Latin-I ASCII - 8-bit code for Western European cultures

  10. Alphanumeric CharactersEBCDIC • Extended Binary Coded Decimal Interchange Code (pg. 68) • 8-bit code => 256 characters • Developed by IBM • Use is generally restricted to IBM and IBM-compatible mainframe computers • Limited: no ~ [ ] ^ representations

  11. Alphanumeric CharactersUNICODE • Supports many additional character alphabets including Cyrillic, Greek, Hebrew, Arabic, Thai, Chinese, Japanese, Korean, etc. • 16-bit standard => 65,536 characters (49,000 of which are defined) • Code values of 0-255 corresponds to ASCII Latin-I codes => conversion from ASCII to Unicode is simple (just append 8 0’s to ASCII code) • Unicode is standard in Windows OS

  12. Keyboard Input

  13. Keyboard Conversion • A binary scan code is generated via the keyboard circuitry for each key stroke • Computer software converts scan code to Unicode or ASCII or EBCDIC • NOTE: This permits different keyboards to be used for different natural languages • Characters are echoed to monitor

  14. Image Data • Bit Map (Raster) Image - image is represented by a set of picture elements (pixels - points) • Object (Vector) Image - image is represented by a set of graphical shapes such as lines and curves

  15. Bit Map Images • A pixel, a single point of the image, is stored as binary data • Each pixel contains intensity level and color - both of which may have large ranges of values (requiring perhaps up to 3 bytes/pixel) • Single image requires large amounts of data - 1.5+ MB of data • Image processing requires large arrays of data - representing pixels and their locations

  16. Graphics Interchange Format • GIF - most common method of storing bit map images • CompuServe proprietary format - 1987 • GIF89a - latest standard also supporting animated GIF images • GIF used extensively on Web • 8-bits/pixel - 256 colors (not good for details of paintings and photographs) • GIF is better suited for line drawings and simple images such as clip art, logos, and areas with solid colors

  17. GIF Screen Layout

  18. GIF File Format Layout

  19. Joint Photographers Expert Group • JPEG suited to photographs and paintings. • 24-bits per pixel - 16.7 million colors • Lossy Compression - assumes some data can be lost without noticing • subtle color changes will not show • some clarity is lost in order to have a smaller file • very small file sizes

  20. Object Images • Image is composed of geometric shapes represented mathematically • Efficient • Flexible • Stored in compact form • Images can be easily moved, scaled, rotated

  21. An Object Image from Textbook

  22. PostScript • Page Description Language for storing, transmitting, displaying and printing object images. • Contains procedures and statements that describe each of the objects on a page. • Program is stored in ASCII or Unicode and thus is a text file. • Program in printer or computer interprets PostScript file and creates pages that can be printed or displayed. • Large set of PostScript functions for manipulating objects.

  23. PostScript Program for Drawing Concentric Circles

  24. PostScript Program for Drawing Pie Segments

  25. Video Images • Video images require massive amounts of data. • Video Camera: 640x480 resolution, true color, 30 frames/second => 28 MB/sec of data => 1.6GB for 1-minute of film • Solution to massive files: • limit number of colors • limit image size • reduce frame rate • compress data

  26. Video Compression • MPEG - family of digital video compression standards • high compression achieved by storing only what changes from frame to frame • file sizes still large and take a long time to download • additional hardware support provided for displaying real-time video

  27. Audio Data • Analog sound wave must be converted to digital form • Analog waveform is sampled at regular time intervals • Amplitude is measured and converted to the binary equivalent (A-to-D converter) • Sampling rate - how often the sound is digitized (50,000 times/sec) • Higher rates - better quality and more storage space

  28. Digitizing Audio Waveform

  29. Audio Formats • MOD - store samples of sounds that are subsequently used to produce a new sound • MIDI - used to coordinate sounds and signals between computer and musical devices (such as keyboards) • MP3 - used for transmission and storage of high-quality audio signals. Popular for Web use. Low-cost, portable devices available for handling MP3 data. • WAV - simple Microsoft format supporting various sampling rates in mono or stereo

  30. MP3 • Popular file format for compressing and playing CD-quality music • 12:1 compression with no loss in quality • Has caused much controversy in the music world • Procedure: encode a song, distribute over the Internet, download to a PC, transfer to a $200 portable MP3 player or listen on computer • Over 80 MP3 sites and 20,000 songs and growing

  31. WAV Files • Some are found on your hard drive • Also can be downloaded from the Web • Create your own WAV file from an audio CD • Uses for Sound Files • to signify an event on the system (I.e. a file opening) • add sound to your web page

  32. WAV Sound Format

  33. Streaming Audio and Video • Streaming - data is downloaded continuously from web server or network server • Solution to large file size download time - start the download and play while more is being downloaded in the background • RealAudio and RealVideo most common players - over 125 Million registered users • Delivers content to 85% of all streaming enabled web sites • Also used for live broadcasts

  34. Data Compression • Lossless Compression (GIF Files) • no data is lost when decompressed • compression algorithms attempt to eliminate redundancies in data, such as a string of 0-bits • Lossy Compression (JPEG) • assume user can live with a certain amount of data degradation • used in multimedia applications

More Related