1 / 33

Polyalphabetic CIPHERS

Polyalphabetic CIPHERS. Linguistics 484. Summary. The idea How to recognize: index of coincidence How many alphabets: Kasiski. The idea. Remove the invariant that a plaintext letter always maps to the same cryptotext letter. Smooth out the frequency distribution, removing clues.

portia
Download Presentation

Polyalphabetic CIPHERS

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Polyalphabetic CIPHERS • Linguistics 484

  2. Summary • The idea • How to recognize: index of coincidence • How many alphabets: Kasiski

  3. The idea • Remove the invariant that a plaintext letter always maps to the same cryptotext letter. • Smooth out the frequency distribution, removing clues.

  4. Monoalphabetic Ciphertext Plaintext Cryptosystem

  5. Polyalphabetic A B Ciphertext Plaintext C Cryptosystem

  6. Polyalphabetic A B Ciphertext Plaintext C Cryptosystem

  7. Polyalphabetic A B C Ciphertext Plaintext Cryptosystem

  8. Polyalphabetic A Ciphertext Plaintext B C Cryptosystem

  9. Polyalphabetic system • Cryptosystem with several components. • Systematic way of moving from one cryptosystem to the next.

  10. Vigenère (simplified) • Component ciphers are shift ciphers, using so called Direct Standard Alphabet • You use each alphabet for one character, then move on.

  11. Vigenère (simplified)

  12. Vigenère (simplified) • You and your friend agree a single letter key, say ‘S’. • Encrypt the first letter with the ‘S’ alphabet, second with ‘T’ alphabet, and so on.

  13. Vigenère (simplified) key=”s” BOOK JWVP

  14. Polyalphabetic system • Cryptosystem with several components. • Systematic way of moving from one cryptosystem to the next. • But two weaknesses in simplified Vigenère. • Direct standard alphabets. Breaking one character gets whole alphabet. • Pattern of movement is too obvious.

  15. Full Vigenère • Use keyword to control jump between alphabets • Pattern of movement no longer as obvious.

  16. Vigenère key=”SYMBOL” THE ATOMIC ENERGY L..

  17. Exercise • Encipher THE ATOMIC ENERGY with the keyword SYMBOL • Decipher AVYUL HWLEE UCZLL LTYVI YOFJI ZSLNI knowing that the keyword is HOUSE

  18. Vigenère key=”HOUSE” • AVYUL TH...

  19. Breaking Vigenère. • How many alphabets? • Index of co-incidence • Babbage-Kasiski examination • Once you have how many alphabets, use frequency analysis as for regular shift ciphers.

  20. Index of co-incidence • Based on arguments about probability. • Intuition: measure roughness of frequency distribution • Mathematical details follow

  21. Roughness of distributions • Smoothest distribution has each letter happening 1/26th of the time. • Roughest has one letter happening 100% of the time • Normal English has some uneveness, less smooth than totally uniform.

  22. Index of co-incidence • Get a frequency f[letter] for each letter. • Multiply f[letter]*(f[letter]-1) to get number of co-incidences involving that letter. Add the results for all letters together. • Divide by the number of co-incidences you would expect if all the letters were the same.

  23. Index of co-incidence IC = sum(f[letter]*(f[letter]-1)) / N(N-1)

  24. Index of coincidence • IC has a value of 0.038 if the letters are evenly distributed, which is what you get if the polyalphabet uses many many alphabets • It has a value of 0.066 for English text, monoalphabetic encryptions of English text, many other things

  25. Idea to quantify roughness • Count the number of times a pair of letters drawn at random from the text happen to be the same. • For the roughest possible, we always get the same letter, so a text of length N has N(N-1) repeats. • For the smoothest possible, we get way fewer.

  26. Babbage-Kasiski method • Explained well in Code Book p 67-72 • Key idea: what does it mean if we find a sequence of repeated characters in a message that has been encoded using a repeated keyword.

  27. Babbage-Kasiski method • Key idea: what does it mean if we find a sequence of four or more repeated characters in a message that has been encoded using a repeated keyword. • Most likely: a sequence of four or more English characters in the plaintext has been encoded twice starting from the same place in the repeating keyword. • Less likely: it’s an accident, some other arrangement of English letters gives rise to a repeat by chance.

  28. Repeats • OK, if a repeat is due to the fact that the same thing is encoded twice in the same way, then the keyword must be used a whole number of times to get from one to the other. • So, keep track of the spacing between repeats.

  29. Repeats • So, keep track of the spacing between repeats. • Nearly every repeat will have a spacing that divides evenly by the length of the keyword. • So, break the spacings into factors and look for something that (almost) always turns up.

  30. Repeats • So, break the spacings into factors and look for something that (almost) always turns up.

  31. Breaking Vigenère. • Once you have how many alphabets, use frequency analysis as for regular shift ciphers. • If there are five different alphabets, tally up characters 1,6,11,... into one table, 2,7,12,... into a second, 3,8,13,... into the third, and so on up to the fifth. • The results will show the characteristic frequency pattern of a shifted alphabet (high A, E close to each other, low J,K next to each other, X,Y,Z all low, etc.)

  32. Breaking Vigenère. • Once you have how many alphabets, use frequency analysis as for regular shift ciphers. • The results will show the characteristic frequency pattern of a shifted alphabet (high A, E close to each other, low J,K next to each other, X,Y,Z all low, etc.) • See if the keyword is sensible. Might be an English word. Then plug in letters and check whether the message works out.

More Related