510 likes | 525 Views
Learn about representing information as bits, binary/hexadecimal, byte representations, numbers, characters and strings, instructions, bit-level manipulations, boolean algebra, and data representation in C.
E N D
Bits and BytesSpring, 2015 COMP 21000 Comp Org & Assembly Lang Topics • Why bits? • Representing information as bits • Binary / Hexadecimal • Byte representations • Numbers • Characters and strings • Instructions • Bit-level manipulations • Boolean algebra • Expressing in C
How then do we represent data? • Each different “type” in a programming language has its own representation. • Next few slides discuss the representation for the common types: • Ints • Pointers • Floats • Strings
Alpha C Linux/Alpha A Linux C Linux/Alpha B Sun B Sun C Sun A 00 FF 00 93 6D 6D 6D 00 FF 00 3B C4 3B 3B 3B C4 3B 00 FF 00 00 93 6D 6D 00 00 FF 00 00 00 00 00 Linux/Win/OS X: little endian Solaris: big endian Representing Integers Decimal: 15213 Binary: 0011 1011 0110 1101 Hex: 3 B 6 D • int A = 15213; • int B = -15213; • long int C = 15213; Two’s complement representation (Covered next lecture)
D4 EF A0 FF F8 FC FF FB FF BF 2C FF 01 00 00 00 Linux/Win/OS X: little endian Solaris: big endian Alpha P Representing Pointers • int B = -15213; • int *P = &B; Alpha Address Hex: 1 F F F F F C A 0 Binary: 0001 1111 1111 1111 1111 1111 1100 1010 0000 Sun P Sun Address Hex: E F F F F B 2 C Binary: 1110 1111 1111 1111 1111 1011 0010 1100 Linux P Linux Address Hex: B F F F F 8 D 4 Binary: 1011 1111 1111 1111 1111 1000 1101 0100 Different compilers & machines assign different locations to objects
Linux/W in/OS X F Solaris F 46 00 6D IEEE Single Precision Floating Point Representation Hex: 4 6 6 D B 4 0 0 Binary: 0100 0110 0110 1101 1011 0100 0000 0000 15213 1110 1101 1011 01 (in binary) B4 B4 6D 00 46 Representing Floats • Float F = 15213.0; IEEE Single Precision Floating Point Representation Hex: 4 6 6 D B 4 0 0 Binary: 0100 0110 0110 1101 1011 0100 0000 0000 15213: 1110 1101 1011 01 IEEE Single Precision Floating Point Representation Hex: 4 6 6 D B 4 0 0 Binary: 0100 0110 0110 1101 1011 0100 0000 0000 15213: 1110 1101 1011 01 Not same as integer representation, but consistent across machines Can see some relation to integer representation, but not obvious
31 31 35 35 32 32 31 31 33 33 00 00 Representing Strings char S[6] = "15213"; • Strings in C • Represented by array of characters • Each character encoded in ASCII format • Standard 7-bit encoding of character set • Character “0” has code 0x30 • Digit i has code 0x30+i • String should be null-terminated • Final character = 0 • Compatibility • Byte ordering not an issue • Text files generally platform independent • Except for different conventions of line termination character(s)! • Unix (‘\n’ = 0x0a = ^J) • Mac (‘\r’ = 0x0d = ^M) • DOS and HTTP (‘\r\n’ = 0x0d0a = ^M^J) Linux/Win/ OS X S Solaris S
ASCII Chart ASCII ‘0’ is 0x30 Reference: http://jimprice.com/jim-asc.htm
Characters Other popular encoding standards: • Unicode • 16 bit code • Allows more characters: most languages can be encoded • See http://en.wikipedia.org/wiki/Unicode • The most commonly used encodings are UTF-8, UTF-16 and the now-obsolete UCS-2 • UTF-8 • uses one byte for any ASCII character, • These bytes have the same code values in both UTF-8 and ASCII encoding, • and up to four bytes for other characters • ISO Latin-8 • 8-bit encoding standard with dotted consonants • International standard for Latin letters
Machine-Level Code Representation • Encode Program as Sequence of Instructions • Each instruction is a simple operation • Arithmetic operation • Read or write memory • Conditional branch • Instructions encoded as bytes • Alpha’s, Sun’s, Mac’s use 4 byte instructions • Reduced Instruction Set Computer (RISC) • PC’s (and intel Mac’s) use variable length instructions • Complex Instruction Set Computer (CISC) • Different instruction types and encodings for different machines • Most code is not binary compatible • Programs are Byte Sequences Too!
Alpha sum Solaris sum PC sum 55 81 01 00 08 90 45 89 C3 00 80 0C 02 89 E5 E0 FA 03 00 EC 30 08 8B 45 6B 09 42 5D C3 Representing Instructions • int sum(int x, int y) • { • return x+y; • } • For this example, Alpha & Sun use two 4-byte instructions • Use differing numbers of instructions in other cases • PC uses 7 instructions with lengths 1, 2, and 3 bytes • Same for NT, OSX (intel) and for Linux • NT / Linux/OSX not fully binary compatible Different machines use totally different instructions and encodings
What do we know now? • A computer is a processor and RAM. • Everything else is peripheral • Everything is 1’s and 0’s. • A processor executes instructions • Instructions are encoded in 1’s and 0’s • each instruction has binary representation • RAM stores 1’s and 0’s • We interpret these 1’s and 0’s based on context • Every type is interpreted from binary • integers: binary • float: special representation in binary • char: binary (ASCII encoding)
How do we get to bits? We write programs in an abstract language like C How do we get to the binary representation? Compilers & Assemblers! We write x = 5. The compiler changes it into mov 5, 0x005F The assembler changes it into: 80483c7: 8b 45 5F
Manipulating bits Since all information is in the form of bits, we must manipulate bits when programming at low levels • We did some of this with the show_bytes program. • To understand how to do more, we must understand Boolean algebra • We can manipulate bits from C; this is where C and hardware meet.
And • A&B = 1 when both A=1 and B=1 Or • A|B = 1 when either A=1 or B=1 Not • ~A = 1 when A=0 Exclusive-Or (Xor) • A^B = 1 when either A=1 or B=1, but not both Boolean Algebra • Developed by George Boole in 19th Century • Algebraic representation of logic • Encode “True” as 1 and “False” as 0
A&~B ~A&B A ~B ~A B Application of Boolean Algebra • Applied to Digital Systems by Claude Shannon • 1937 MIT Master’s Thesis • Reason about networks of relay switches • Encode closed switch as 1, open switch as 0 Connection when A&~B | ~A&B = A^B
Representing & Manipulating Sets • Representation • Width w bit vector represents subsets of {0, …, w–1} • aj = 1 if jA 01101001{ 0, 3, 5, 6 } 76543210 01010101{ 0, 2, 4, 6 } 76543210 • Operations • & Intersection 01000001 { 0, 6 } • | Union 01111101 { 0, 2, 3, 4, 5, 6 } • ^ Symmetric difference 00111100 { 2, 3, 4, 5 } • ~ Complement 10101010 { 1, 3, 5, 7 }
General Boolean Algebras Operate on Bit Vectors • Operations applied bitwise • Let x = 105 and y = 85 (think about how this is actually stored) x & y: x | y: x ^ y ~y All of the Properties of Boolean Algebra Apply 01101001 & 01010101 01000001 01101001 | 01010101 01111101 01101001 ^ 01010101 00111100 ~ 01010101 10101010 01000001 01111101 00111100 10101010
Bit-Level Operations in C • Operations &, |, ~, ^ Available in C • Apply to any “integral” data type • long, int, short, char, unsigned • View arguments as bit vectors • Arguments applied bit-wise • Examples (Char data type) • ~0x41 --> 0xBE ~010000012 --> 101111102 • ~0x00 --> 0xFF ~000000002 --> 111111112 • 0x69 & 0x55 --> 0x41 011010012 & 010101012 --> 010000012 • 0x69 | 0x55 --> 0x7D 011010012 | 010101012 --> 011111012
Contrast: Logic Operations in C • Contrast to Logical Operators • &&, ||, ! • View 0 as “False” • Anything nonzero as “True” • Always return 0 or 1 • Early termination • Examples (char data type) • c = ‘A’ then !c is !0x41 --> 0x00 • c = ‘\0’ then !c is !0x00 --> 0x01 • c = ‘A’ the !!c is !!0x41 --> 0x01 • 0x69 && 0x55 --> 0x01 • 0x69 || 0x55 --> 0x01 • p && *p++ (avoids null pointer access) • Or p && (*p = 10) also protects Works because of short-circuit evaluation in C
Shift Operations • Left Shift: x << y • Shift bit-vector x left y positions • Throw away extra bits on left • Fill with 0’s on right • Right Shift: x >> y • Shift bit-vector x right y positions • Throw away extra bits on right • Logical shift • Fill with 0’s on left • Arithmetic shift • Replicate most significant bit on right • Useful with two’s complement integer representation Argument x 01100010 << 3 00010000 00010000 00010000 Log. >> 2 00011000 00011000 00011000 Arith. >> 2 00011000 00011000 00011000 Argument x 10100010 << 3 00010000 00010000 00010000 Log. >> 2 00101000 00101000 00101000 Arith. >> 2 11101000 11101000 11101000 Which right shift does C do? Depends on the compiler! Often: int do ASR While unsign does LSR
Effect of LSL Each shift doubles the number. If the carry bit is 1 after a shift, then overflow (if the number is unsigned). Decimal: shift multiplies by 10. 35 << 350 Binary: 0 1 0 1 0 = 1 x 23 + 0 x 22 + 1 x 21 + 0 x 20 1 0 1 0 0 = 1 x 24 + 0 x 23 + 1 x 22 + 0 x 21+ 0 x 20 = 2 x (1 x 23 + 0 x 22 + 1 x 21 + 0 x 20)
Effect of LSR • Each shift halves the number. • Round down (because you drop the least significant bit) • Decimal: shift divides by 10. 350 >> 35 • Binary: 1 0 1 0 0 = 1 x 24 + 0 x 23 + 1 x 22 + 0 x 21+ 0 x 20 0 1 0 1 0 = 1 x 23 + 0 x 22 + 1 x 21 + 0 x 20 = (1 x 24 + x 23 + 0 x 22 + 1 x 21 + 0 x 20) ÷ 2
Truncating • Casting to a shorter form truncates the number. • int x = 53191; • short sx = (short) x; /* -12345 */ • int y = sx; /* -12345*/ • On a typical 32-bit machine: • Casting int to short truncate 32-bit to 16-bits • When cast back to 32 bits, do sign extension (see next lecture) so value doesn’t change.
Masks • Can use bit patterns as masks • Example • Mask = 0xFFFFFF00 when used with the ‘&’ operator filters out (i.e., zeros out) the last byte • Mask = 0x000000FF when used with the ’ |’ operator places 1’s in the last byte, rest is unaffected.
Masks note that we can assign hex values to an integer. Just another way of representing int! • Can use bit patterns as masks • Example • int x = 0xFFFFFFF0; // this will be the mask. • int y = 31; // this is stored as 0…011111 • int z = y & x; // z now contains 16 or 0…010000
Masks note that we can assign hex values to an integer. Just another way of representing int! • Can use bit patterns as masks • Example • int x = 0x0000000F; // this will be the mask. • int y = 16; // this is stored as 0…010000 • int z = y | x; // z now contains 31 or 0…011111
Masks • Can perform a mod by a power of 2 with a mask • Examples: • x % 2 = 0 (x is even) or 1 (x is odd) • 1111 = 1 x 23 + 1 x 22 + 1 x 21 + 1 x 20 : all places except the last are divisible by 2. So if the last bit is 0, mod is 0. If last bit is 1, mod is 1 • x % 4 • 1111 = 1 x 23 + 1 x 22 + 1 x 21 + 1 x 20 : all places except the last two are divisible by 4. So if the last two bits are 0, mod is 0. Otherwise the mod is whatever value the last two bits have
Masks • Can perform a mod by a power of 2 with a mask • Examples: • x % 8 = 0 • 1111 = 1 x 23 + 1 x 22 + 1 x 21 + 1 x 20 : all places except the last three are divisible by 8. So if the last three bits are 0, mod is 0. . Otherwise the mod is the value in the last three bits • so • x % 2 == x & 00000001 (or 0x01) • x % 4 == x & 00000011 (or 0x03) • x % 8 == x & 00000111 (or 0x07)
Converting (revisited) #include <stdlib.h> #define LOWDIGIT 07 #define DIG 20 void octal_print(register int x) { register inti = 0; char s[DIG]; if ( x < 0 ){ /* treat negative x */ putchar ('-'); /* output a minus sign */ x = - x; } do{ /* mod performed with & */ s[i++] = ((x & LOWDIGIT) + '0'); } while ( ( x >>= 3) > 0 ); /* divide x by 8 via shift */ putchar('0'); /* octal number prefix */ do{ putchar(s[--i]); /* output characters on s */ } while ( i > 0 ); putchar ('\n'); }
Converting to binary • void printBits(unsigned int word, int nBits){ • for (int i=nBits-1;i>=0;i--){ • printf("%u", !!(word&(1<<i))); • } • printf("\n"); • }
A^B B A^B (A^B)^B = A (A^B)^A = B A B A Cool Stuff with Xor • Bitwise Xor is form of addition • With extra property that every value is its own additive inverse A ^ A = 0 void funny(int *x, int *y) { *x = *x ^ *y; /* #1 */ *y = *x ^ *y; /* #2 */ *x = *x ^ *y; /* #3 */ } *x *y Begin A B 1 2 3 End
Bit techniques • INTEGER CODING RULES: • Replace the "return" statement in each function with one • or more lines of C code that implements the function. Your code • must conform to the following style: • int Funct(arg1, arg2, ...) { • /* brief description of how your implementation works */ • int var1 = Expr1; • ... • int varM = ExprM; • varJ = ExprJ; • ... • varN = ExprN; • return ExprR; • }
Bit techniques Each "Expr" is an expression using ONLY the following: 1. Integer constants 0 through 255 (0xFF), inclusive. You are not allowed to use big constants such as 0xffffffff. 2. Function arguments and local variables (no global variables). 3. Unary integer operations ! ~ 4. Binary integer operations & ^ | + << >> Some of the problems restrict the set of allowed operators even further. Each "Expr" may consist of multiple operators. You are not restricted to one operator per line. You are expressly forbidden to: 1. Use any control constructs such as if, do, while, for, switch, etc. 2. Define or use any macros. 3. Define any additional functions in this file. 4. Call any functions. 5. Use any other operations, such as &&, ||, -, or ?: 6. Use any form of casting. 7. Use any data type other than int. This implies that you cannot use arrays, structs, or unions.
Bit techniques • /* pow2plus1 - returns 2^x + 1, where 0 <= x <= 31 */ • int pow2plus1(int x) { • /* exploit ability of shifts to compute powers of 2 */ • return • } • /* pow2plus4 - returns 2^x + 4, where 0 <= x <= 31 */ • int pow2plus4(int x) { • /* exploit ability of shifts to compute powers of 2 */ • return result; • } (1 << x) + 1; int result = (1 << x) result += 4;
Bit techniques • / * bitNor - ~(x|y) using only ~ and & • * Example: bitNor(0x6, 0x5) = 0xFFFFFFF8 • * Legal ops: ~ & • * Max ops: 8 • * Rating: 1 • */ • int bitNor(int x, int y) { • } • /* evenBits - return a word with all even-numbered bits set to 1 • * Legal ops: ! ~ & ^ | + << >> • * Max ops: 8 • * Rating: 1 • */ • int evenBits(void) { • } use DeMorgan’s Law: ~(A & B) = (~A) | (~B) And ~(A | B) = (~A) & (~B) return (~x & ~y) int byte = 0x55; int word = byte | byte<<8; return word | word<<16;
Bit techniques • /* • * isNegative - return 1 if x < 0, return 0 otherwise • * Example: isNegative(-1) = 1. • * Legal ops: ! ~ & ^ | + << >> • * Max ops: 6 • * Rating: 2 • */ • int isNegative(int x) { • } • #ifdef CODE • /* • * isNotEqual - return 0 if x == y, and 1 otherwise • * Examples: isNotEqual(5,5) = 0, isNotEqual(4,5) = 1 • * Legal ops: ! ~ & ^ | + << >> • * Max ops: 6 • * Rating: 2 • */ • int isNotEqual(int x, int y) { • } return (x>>31) & 0x1; return !!(x ^ y); // when not equal must return 1 not some other num
Some Other Uses for Bitvectors • Representation of small sets • Representation of polynomials: • Important for error correcting codes (see later slides) • Arithmetic over finite fields, say GF(2^n) • Example 0x15213 : x16 + x14 + x12 + x9 + x4 + x + 1 • Representation of graphs (intersection of sys & algo) • A ‘1’ represents the presence of an edge • Representation of bitmap images, icons, cursors, … • Exclusive-or cursor patent (see later slides) • Representation of Boolean expressions and logic circuits
UDP • The User Datagram Protocol (UDP) is one of the core members of the Internet protocol suite, the set of network protocols used for the Internet. With UDP, computer applications can send messages, in this case referred to as datagrams, to other hosts on an Internet Protocol (IP) network without prior communications to set up special transmission channels or data paths. The protocol was designed by David P. Reed in 1980 and formally defined in RFC 768. • http://en.wikipedia.org/wiki/User_Datagram_Protocol
Transport Layer “no frills,”“bare bones” Internet transport protocol “best effort” service, UDP segments may be: lost delivered out of order to app connectionless: no handshaking between UDP sender, receiver each UDP segment handled independently of others Why is there a UDP? no connection establishment (TCP: 3-way handshake) simple: no connection state at sender, receiver (TCP keeps buffers, sequence no., ack, etc.) small segment header (20 bytes for TCP, 8 for UDP) no congestion control: UDP can blast away as fast as desired UDP: User Datagram Protocol [RFC 768] With UDP applications basically talk directly to IP
Transport Layer UDP checksum Goal: detect “errors” (e.g., flipped bits) in transmitted segment Result: UDP does error detection not error correction 1. may just discard a segment with errors 2. may pass it to the application with a warning.
Transport Layer Sender: treat segment contents as sequence of 16-bit integers checksum: addition (1’s complement sum) of segment contents sender puts checksum value into UDP checksum field Receiver: compute checksum of received segment check if computed checksum equals checksum field value: NO - error detected YES - no error detected. But maybe errors nonetheless? More later …. UDP checksum Goal: detect “errors” (e.g., flipped bits) in transmitted segment
Transport Layer Internet Checksum Example • Note • When adding numbers, a carryout from the most significant bit needs to be added to the result • sender: add two 16-bit integers 1 1 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 1 1 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 1 1 0 1 1 1 0 1 1 1 0 1 1 1 0 1 1 1 1 0 1 1 1 0 1 1 1 0 1 1 1 1 0 0 1 0 1 0 0 0 1 0 0 0 1 0 0 0 0 1 1 wraparound sum Checksum (complement the sum)
Transport Layer Internet Checksum Example • Note • When adding numbers, a carryout from the most significant bit needs to be added to the result • receiver: add two 16-bit integers plus the checksum: should get all 1’s (why?) 1 1 1 1 0 0 1 1 0 0 1 1 0 0 1 1 0 1 1 1 0 1 0 1 0 1 0 1 0 1 0 1 0 1 1 1 0 1 1 1 0 1 1 1 0 1 1 1 0 1 1 1 0 1 1 1 0 1 1 1 0 1 1 1 1 0 0 0 1 0 0 0 1 0 0 0 1 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Do XOR of sum & checksum; should get all 1’s wraparound sum Checksum Result For an explanation of 1’s complement addition see http://mathforum.org/library/drmath/view/54379.html
UDP checksum code (part 1) // make 16 bit words out of every two adjacent 8 bit words and // calculate the sum of all 16 vit words typedefunsigned short u16; typedef unsigned long u32; u16 udp_sum_calc(u16 len_udp, u16 src_addr[],u16 dest_addr[], BOOL padding, u16 buff[]) { u16 prot_udp=17; u16 padd=0; u16 word16; u32 sum; // Find out if the length of data is even or oddnumber. Ifodd, // add a padding byte = 0 at the end of packet if (padding&1==1){ padd=1; buff[len_udp]=0; } //initialize sum to zero sum=0; See http://www.netfor2.com/udpsum.htm for the full code.
UDP checksum code (part 2) // make 16 bit words out of every two adjacent 8 bit words and // calculate the sum of all 16 bit words • for (i=0; i < len_udp+padd; i=i+2){ • word16 =((buff[i]<<8)&0xFF00)+(buff[i+1]&0xFF); • sum = sum + (unsigned long)word16; • } • // add the UDP pseudo header which contains the IP source and destinationn addresses • for (i=0;i<4;i=i+2){ • word16 =((src_addr[i]<<8)&0xFF00)+(src_addr[i+1]&0xFF); • sum=sum+word16; • } • for (i=0;i<4;i=i+2){ • word16 =((dest_addr[i]<<8)&0xFF00)+(dest_addr[i+1]&0xFF); • sum=sum+word16; • } // overflow bits are added on the next slide See http://www.netfor2.com/udpsum.htm for the full code.
UDP checksum code (part 3) // the protocol number and the length of the UDP packet sum = sum + prot_udp + len_udp; // keep only the last 16 bits of the 32 bit calculated sum and add the carries while (sum>>16) sum = (sum & 0xFFFF)+(sum >> 16); // Take the one's complement of sum sum = ~sum; return ((u16) sum); } /* here’s where the carry’s are added in. Turns out you can save the overflow bits and add them back at the end */ See http://www.netfor2.com/udpsum.htm for the full code.
XOR patent dispute for (y=0; y < y_height; ++y) { int loc = (y+y_off)*stride + x_off; vidmem[loc] ^= 255; /* XOR */ } // another way of doing this: for (y=0; y < y_height; ++y) { int loc = (y+y_off)*stride + x_off; for (x=0; x < 8; ++x) if (vidmem[loc] & (1 << x)) vidmem[loc] &= ~(1 << x); else vidmem[loc] |= (1 << x); } Closely related is a patent which apparently covers results, not process. The exclusive-or-cursor patent is a simple example of this. There are a number of mathematically equivalent ways of expressing the xor algorithm, but it appears to be widely believed that all of them are covered by the patent. The patent apparently is understood to cover the idea of representing an onscreen cursor by complementing each "visible" pixel of the cursor with the contents of whatever is on screen. This is somewhat surprising, since it is predated by hardware implementations that do the exact same thing for text-only modes--rendering a block or underline cursor so its appearance is the same as above. See http://nothings.org/computer/patents.html for a full discussion
More Bitvector Magic • Count the number of 1’s in a word MIT Hackmem 169: int bitcount(unsigned int n) { unsigned int tmp; tmp = n - ((n >> 1) & 033333333333) - ((n >> 2) & 011111111111); return ((tmp + (tmp >> 3)) & 030707070707)%63; } • MIT Hackmem Count. The main idea. • 1. Consider a 3 bit unsigned number as being 4a+2b+c, • example: consider the three bit number 101 • then the unsigned number represents 1 x 22 + 0 x 21 + 1 x 20 = 1 x 4 + 0 x 2 + 1 • let “a” represent the leftmost bit (the 1), “b” the next bit (the 0) and c the rightmost bit (the rightmost 1). • then we get a x 4 + b x 2 + c or 4a+2b+c • 2. If we shift the number right 1 bit, we get 010. If we follow the convention of using “abc” for the bits, then after the shift (logical) we get 0ab or 0 x 22 + a x 21 + b x 20 = 2a+b. • 3. Subtracting this shifted number (2a+b) from the original number (4a+2b+c) gives 2a+b+c. 4. If we right-shift the original 3-bit number by two bits, we go from “abc” to “00a” which is just a x 20 • 5. If we subract this new shifted number (a) from the result of 3, we get 2a+b+c – a = a+b+c, 6. Since a, b, and c represent the bits (1, 0, and 1 in our example), a+b+c is just the number of bits in the original number!
More Bitvector Magic • Count the number of 1’s in a word MIT Hackmem 169: int bitcount(unsigned int n) { unsigned int tmp; tmp = n - ((n >> 1) & 033333333333) - ((n >> 2) & 011111111111); return ((tmp + (tmp >> 3)) & 030707070707)%63; } MIT Hackmem Count. (continued) How is this insight employed in the code? 1. The key insight is that we’re actually going to consider the parameter n to be composed of 11 groups of 3 (3 x 11 = 33, or one more bit than we actually have in a 32-bit number). 2. So we’re going to do the algorithm from the previous page one each three bit group of bits. 3. Consider the bits 101 111 010 To apply the previous algorithm on each group we want to: a. (101 – 010 – 001) (111 – 011 – 001) (010 – 001 – 000) b. If we shift n we get: 010 111 101 Note that the shifted number in each position has a “wrong” first bit (it’s the shifted bit from the previous group) but the correct last two digits. c. We know that the first bit must be 0 since we’re doing logical shift right d. So we can take the result of shifting and mask out the first bit in each group. e. Since each group can be represented by an octal number, the mask for each group will be 3 in octal (011). This explains the line ((n >> 1) & 033333333333). Note that a number beginning in “0” in C is taken as an octal number. f. A similar explanation can be made for the next part of the expression, i.e., to shift each group twice, shift n right twice and mask with 001 (or octal 1) for each group. Now each octal digit in the number n represents the number of 1’s the original number in that position. In our example, the 9 digits become
More Bitvector Magic • Count the number of 1’s in a word MIT Hackmem 169: int bitcount(unsigned int n) { unsigned int tmp; tmp = n - ((n >> 1) & 033333333333) - ((n >> 2) & 011111111111); return ((tmp + (tmp >> 3)) & 030707070707)%63; } MIT Hackmem Count. (continued) The last return statement sums these octal digits to produce the final answer. The key idea is to add adjacent pairs of octal digits together and then compute the remainder modulus 63. 1. Right-shifting tmp by three bits, 2. adding it to tmp itself and 3. ANDing with a suitable mask. This step saves every other group of 3 bits. We want this because we’ve added each 3-bit group with its right neighbor group; if we save every group we will have counted each group twice. 4. The result is a number in which groups of six adjacent bits (starting from the LSB) contain the number of 1's among those six positions in n. 5. This number modulo 63 yields the final answer. For 64-bit numbers, we would have to add triples of octal digits and use modulus 1023. This is HACKMEM 169, as used in X11 sources. Source: MIT AI Lab memo, late 1970's.