1 / 25

Lesson 21

Lesson 21. Continue Direct Methods Patterson Methods. E's. The real intense data can be located by dividing the observed value of F by the expectation value at that theta. The results are called normalized structure factors and are given the symbol E.

vanig
Download Presentation

Lesson 21

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Lesson 21 • Continue Direct Methods • Patterson Methods

  2. E's • The real intense data can be located by dividing the observed value of F by the expectation value at that theta. • The results are called normalized structure factors and are given the symbol E. • After the E's are calculated they are renormalized so that the average value of E is 1.0 • Therefore E>1.0 are strong reflections.

  3. Another Use of E's • Calculate the mean value of |E2-1| • If this value is about 0.97 suggests the structure is centric • If value is near 0.74 suggests structure is accentric • Works best if all the atoms have about the same atomic number. • Can be completely wrong!

  4. The Sayre equation (1952)‏ In the same issue of Acta Crystallographica in 1952, Sayre, Cochran and Zachariasen independently of each other proposed phase relations that could be summarized with the equation: Fh = qh’ (Fh’ Fh-h’)‏ This is known today as the Sayre equation. q is a constant that depends on sin()/ of the reflection h (hkl). The summation is performed over all reflections h’ (h’k’l’). For a structure with equal point atoms, the electron density (0 oder Z) is proportional to its square (0 or Z2). The convolution theorem then leads directly to the Sayre equation. This equation is under these conditions an exact relationship, provided that the summation is performed over all reflections, including F000 (which can not normally be measured).

  5. Normalized structure factors Direct methods turn out to be more effective when the observed structure factors Fh are modified to allow for the effects of the electron distribution within an atom and its thermal motion. The resulting normalized structure factors Ehcorrespond to the structure factors of a point atom structure. Eh2 = (Fh2/) / <F2/>resolution shell The statistical factor  is usually 1, but can be greater for special reflection classes (e.g. 4 for 00l in tetragonal space groups). <F2/> can either be calculated directly or modeled with the help of an exponential function (Wilson plot).

  6. The triplet relation For the Sayre equation with E instead of F: Eh = qh’ (Eh’ Eh-h’)‏ one can compare the phases of the left and right sides for a single h’ contribution instead of a summation: h = h’ + h-h’ (modulo 360°)‏ If this equation were exact, it would be possible to use it to derive all the phases from a small number of starting phases. However it should be understood in a statistical sense. Starting from suitable assumptions, for example that the structure consists of equal, randomly distributed atoms, it is possible to derive a probability distribution for this equation(Cochran, 1955): P() = g exp( 2|EhEh’Eh-h’ |cos()/N½ )‏ where  = h – h’ – h-h’, g is a normalizing factor and N is the number of atoms in the corresponding primitive unit-cell. It follows that the phase relations are more reliable for small structures and for the largest E-values!

  7. The tangent formula (Karle & Hauptman, 1956)‏ The tangent formula, often in a well disguised form, is still the key formula for the solution of structures by direct methods. The formula is nothing more than the standard method of calculating the phase of a sum of complex numbers; it is used to calculate the phase of the sum represented by the Sayre equation: h’ | Eh’ Eh–h’ | sin( h’ + h–h’ ) tan(h) = h’ | Eh’ Eh–h’ | cos( h’ + h–h’ )‏ In the calculation of the phases, the signs of the numerator and denominator must both be taken into account, so that the resulting angle is in the full range 0–360°. With sufficient number crunching power it is possible to solve small structures starting from random phases by repeated application of this equation. The Nobel prize for chemistry for 1985 was awarded to J. Karle and H. A. Hauptman for their contributions to direct methods.

  8. How can one find the Minimum? The phase problem is really a search problem; there are various structures (minima) that are reasonably consistent with the tangent or related equations, including both the correct solution and the statistically too perfect uranium atom pseudo-solution which has all phases zero!

  9. The limits of direct methods in reciprocal space Conventional direct methods, using improved versions of the tangent formula, are computationally extremely efficient for the solution of structures of up to about 100 unique atoms. Small structures can even be solved in one or two seconds. In spite of this, these methods rapidly run out of steam for structures larger than about 200 independent equal atoms, and few structures larger than 200 atoms have been solved in this way. The efficiency of the tangent formula as a search algorithm lies in its ability to relate phases of reflections that lie far apart in reciprocal space. The weaknesses of this formula – for example the tendancy to produce a uranium atom solution – become more serious as the structures get larger. It is necessary to constrain the phases in some way so that they correspond to a chemically sensible structure. All such direct methods require data to a resolution of 1.2 Å or better (atomic resolution). Whether this is because the atoms have to be resolved from each other, or simply because the data to parameter ratio needs to be high enough, is not clear.

  10. Direct Methods in Practice • Calculate the E's • Try to find a few reflections that can be used to produce the entire data set. • The parity groups of these reflections are important • Eee, eeo, eoe, oee, ooe, oeo, eoo, ooo • Some programs normalize E's so that each parity group's average is 1.0 • Arbitrarily assign phases to these starting reflections and expand to derive as many phases as possible.

  11. A Problem • This approach produces many possible solutions. Obviously most are not correct. • How can the quality be evaluated. • Do a Fourier Map on the E's and analyze this for chemical correctness. This is way to slow. • Check on consistency • do the expanding set of phases suggest common phases on other data • do the phases predict the weak relfections are weak

  12. FOM • The indicator used is the FOM –Figure of Merit. • Its value depends on the program used. • Need some familiarity with the individual program to have a feel for what good values are

  13. Programs • MULTAN –original program rarely used anymore • SHELXS – part of SHELXTL –FOM should be as small as possible usually much less than one • The Sir programs • SIR2002 • SIR2004 Typically FOM is greater than 3 but since this package is so automatic user does not even notice.

  14. Dual space methods random starting phases Weeks, Miller, DeTitta, Hauptman et al. (1993)‏ SF-calculation reciprocal space:refine phases real space:assign peaks to atoms FT many cycles, E > Emin When the figures of merit indicate a good solution, it can be extended to the full structure by using all the data. Dual space methods were first implemented in the computer program Shake and Bake, and later in other programs. In this way it is possible to solve structures containing up to about 1000 independent atoms (not counting H). Atomic resolution is however still necessary.

  15. The heavy atom method By interpretation of the Patterson function, it is possible to find the positions of a small number of heavy atoms in the cell. These enable the calculated structure factor Fc (a complex number) or the corresponding amplitude Fc and phase c to be calculated for each reflection. A difference Fourier synthesis with coefficients |Fo–Fc| and phases c enables further atoms to be found as peaks; the approximation = c becomes better as the structure becomes more complete. The full structure may be found by iterative application of such difference maps. Although it is no longer considered necessary to prepare heavy atom derivatives such as bromobenzoates to solve small molecule structures (direct methods are so effective), heavy atom derivatives are still useful in macromolecular crystallography. Methionine can be replaced with selenomethionine and thymine with bromouracil; heavy ions such as Hg2+, Pb2+ and I– can be introduced by soaking the crystals in solutions of the appropriate salts, and even xenon derivatives may be prepared in special pressure chambers.

  16. The Patterson function FT F2 Patterson The Patterson function is a Fourier synthesis performed with all phases set to zero and the measured F2-values as coefficients. The dimensions of the unit-cell and the lattice type of the Patterson are identical with those of the crystal structure. The Patterson is a vector map: each atom-atom vector in the structure corresponds to a peak in the Patterson. The peak height is proportional to the product of the atomic numbers of the two atoms and to the number m of vectors that are equal in magnitude and direction and so overlap: Hm ZiZj Usually the origin peak (height proportional to Zi2) is scaled to 999, so that the peak heights of the remaining peaks are given by: H = 999 m Zi Zj/Zi2 This summation is over all atoms in the unit-cell. FT

  17. One heavy atom in P1 In this space group, for each atom at x, y, z there is a symmetry equivalent atom at –x, –y, –z and thus a Patterson peak at 2x, 2y, 2z. Crystals of C32H24AuF5P2 have two formula units in the unit-cell. The two gold atoms are related to each other by the inversion center. In addition to the origin peak (999) there is a peak of height 374 with X=0.318, Y=0.471, Z=0.532, and of course a peak at –X, –Y, –Z, that are much higher than the rest (145). This peak height is in good agreement with the calculated value of 377 for a Au—Au vector. In order to calculate x from 2x it would be far too simple just to divide by 2! We have to take into account that there must also be peaks in the next unit-cell of the Patterson with X = 2x+1 etc. Thus we calculate the possible gold coordinates as follows: x = 0.318/2 or 1.318/2 = 0.159 or 0.659 y = 0.471/2 or 1.471/2 = 0.236 or 0.736 z = 0.532/2 or 1.532/2 = 0.266 or 0.766 These eight possible positions correspond to the eight possible ways of putting the origin on an inversion center in space group P1.

  18. Two independent heavy atoms in P1 The compound [C24H20S4Ag]+ [AsF6]– crystallizes in P1 with a cell volume of 1407 Å3. Two formula units per cell (72 atoms) correspond to 19.5 Å3 per atom. It is useful to calculate a difference vector table: x1 –x1 x2 –x2 x10 –2x1 x2–x1 –x1–x2 –x1 2x1 0x1+x2 x1–x2 x2 x1–x2–x1–x20 –2x2 –x2 x1+x2x2–x1 2x2 0 (x1+x2) and (x2–x1) have m = 2, the null-vectors (0) have m = 4; With Z(Ag) = 47, Z(As) = 33, Z2 = 11384 and H = 999 mZi Zj / Z2 the following Patterson peaks can be predicted: Ag—As m = 2 height = 272; Ag—Ag m = 1 height = 194; As—As m = 1 height = 96.

  19. Interpretation of the Patterson XYZ height interpretation 1 0 0 0 999 origin • 0.765 0.187 0.974 310 x(Ag)+x(As)‏ • 0.392 0.099 0.325 301 x(Ag)–x(As)‏ 4 0.159 0.285 0.298 250 2x(Ag)‏ · · · · · · · · · · · · · · · · · · · · · · · · • 0.364 0.077 0.639 102 2x(As)‏ A consistent solution would be: Ag: x = 0.080, y = 0.143, z = 0.149 As: x = 0.682, y = 0.039, z = 0.820 For the first atom (Ag) there are 8 equivalent solutions, so one can simply divide 2x by 2. The second atom (As) must however be consistent with the Ag, so x(Ag) is subtracted from x of peak 2 and then checked whether this As position agrees with the other vectors.

  20. Two independent heavy atoms in P21 x1 y1z1x2 y2z2 –x1 ½+y1 –z1 –x2 ½+y2 –z2 x1 y1z10–x1+x2 –y1+y2 –z1+z2 –2x1 ½ –2z1–x1–x2 ½–y1+y2 –z1–z2 x2 y2z2x1–x2 y1–y2z1–z20–x1–x2 ½+y1–y2 –z1–z2 –2x2 ½ –2z2 –x1 ½+y1 –z1 2x1 ½ 2z1x1+x2 ½–y1+y2z1+z20x1–x2 –y1+y2z1–z2 –x2 ½+y2 –z2x1+x2 ½+y1–y2z1+z2 2x2 ½ 2z2 –x1+x2 y1–y2 –z1+z20 In the table –½ is replaced by +½ because they are equivalent. These Patterson peaks can be summarized as follows: 0 0 0 m = 4 origin ±{ 2x1 ½ 2z1 } m = 1 Harker section at y = ½ ±{ 2x2 ½ 2z2 } m = 1 ±{ x1–x2y1–y2z1–z2} m = 1 ±{ x1–x2 –y1+y2z1–z2 } m = 1 cross vectors ±{ x1+x2 ½+y1–y2z1+z2} m = 1 ±{ x1+x2 ½–y1+y2z1+z2} m = 1 The effective space group of this Patterson is P2/m, which can be deduced from the last four (general) vectors.

  21. The symmetry of the Patterson function Because for each vectori  j there is always a vector j  i, the Patterson must possess an Inversion center. In general the symmetry of the Patterson is determined by the symmetry of the diffraction pattern, which also possesses an inversion center. Glide planes of the space group become, as in reciprocal space, normal mirror planes, and screw axes become rotation axes without translation. In other words, the Patterson exhibits the same symmetry as the Laue group.

  22. Harker sections The symmetry of the Patterson leads to a concentration of peaks in particular planes (or along particular lines). E.g. P21: Atoms at x y z; –x ½+y –z. Self-vectors at 2x ½ 2z, Harker section at Y = ½. P2: Atoms at x y z; –xy –z. Self-vectors at 2x 0 2z, Harker section at Y = 0. Pm: Atoms at x y z; x –yz. Self-vectors at 0 2y 0, Harker line at X = 0, Z = 0. P2 and Pm would have identical systematic absences, but could be distinguished from one another using the Harker section and line. The 21 axis in P21 results in peaks concentrated in the Harker section at Y = ½; however the systematic absences k = 2n+1 for reflections 0 k 0 would give a more reliable indication of the the space group.

  23. One heavy atom in P21 An organoselenium compound with one unique Se atom (two in the unit-cell) in P21 gave the Y = ½ Harker section in the diagram. The peaks at ±(2x ½ 2z) = ±(0.36 0.50 0.36) correspond to a Se atom at 0.18 0 0.18 and its symmetry equivalent at –0.18 0.50 –0.18. In this space group one heavy atom is not sufficient to find the remaining atoms; a difference Fourier synthesis would be a double image, because the Se atoms correspond to a centrosymmetric substructure and can not resolve the enantiomorph!

  24. Problems with the heavy atom method In many cases, one heavy atom suffices to find the remaining atoms by (possibly iterative) application of difference Fourier syntheses with amplitudes|Fo–Fc| and Phases c. Exceptions are those non-centrosymmetric space groups (such as P21 and C2) in which one unique heavy atom generates a centrosymmetric arrangement of heavy atoms in the cell; then all phases are 0º or 180º and the resulting difference electron density retains the inversion center, i.e. corresponds to a double image consisting of the two possible enantiomorphs. A different problem – which also arises when the structure is determined by direct methods – is the assignment of the element types; electron density is always blue! The density is proportional to the atomic number, but isoelectronic species are particularly difficult to distinguish, as will be seen in the first exercise...

  25. Homework • Assign all possible space groups to data with the following systematic presences • Monoclinic—hkl, h+k=2n; h0l, l=2n • Orthorhombic—0kl, k+l=2n; h0l,h=2n • Orthorhombic-- hkl, h+k=2n, k+l=2n, h+l=2n; 0kl, k+l=4n; h0l, h+l=4n • hexagonal/trigonal-- h-hl, l=2n; 00l, l=2n

More Related