Research ArticleSTRUCTURAL BIOLOGY

Serial femtosecond and serial synchrotron crystallography can yield data of equivalent quality: A systematic comparison

See allHide authors and affiliations

Science Advances  17 Mar 2021:
Vol. 7, no. 12, eabf1380
DOI: 10.1126/sciadv.abf1380

Abstract

For the two proteins myoglobin and fluoroacetate dehalogenase, we present a systematic comparison of crystallographic diffraction data collected by serial femtosecond (SFX) and serial synchrotron crystallography (SSX). To maximize comparability, we used the same batch of micron-sized crystals, the same sample delivery device, and the same data analysis software. Overall figures of merit indicate that the data of both radiation sources are of equivalent quality. For both proteins, reasonable data statistics can be obtained with approximately 5000 room-temperature diffraction images irrespective of the radiation source. The direct comparability of SSX and SFX data indicates that the quality of diffraction data obtained from these samples is linked to the properties of the crystals rather than to the radiation source. Therefore, for other systems with similar properties, time-resolved experiments can be conducted at the radiation source that best matches the desired time resolution.

INTRODUCTION

Structural biology has been highly successful in providing three-dimensional information about the architecture of biomolecules. Usually, these structures display equilibrium state conformations, important snapshots that provide insight into functional states of these biomolecules (1). The advent of x-ray free-electron laser (XFEL) sources promised the possibility to study ultrafast time scales down to the femtosecond domain to watch conformational changes and even chemical reactions (2). Addressing irreversible reactions at these ultrafast time scales and guaranteeing homogeneous reaction initiation required the use of micron-sized protein crystals. However, the extremely intense beam brightness of XFEL sources will destroy these protein microcrystals during a single exposure. This dilemma was circumvented by serial data collection schemes in which thousands of microcrystals are exploited, to each provide a single still diffraction pattern. Because of the diffraction-before-destruction approach, which allows the collection of a useful diffraction pattern by outrunning the onset of damage, it has been argued that serial femtosecond crystallography (SFX) data from XFELs are de facto radiation damage free (36).

Soon after the first protein crystal structures were solved from SFX data, the method was adapted for use at synchrotrons, giving rise to serial synchrotron crystallography (SSX) (7, 8). Most SSX experiments thus far involved static structures. However, since most enzymes have median turnover times in the hundreds of milliseconds range, synchrotrons represent a valid alternative for time-resolved experiments. Historically exploited via Laue diffraction on single crystals (9, 10), time-resolved experiments have recently been the subject of serial data collection approaches to study enzymes on time scales from milliseconds to many seconds (1116). The results of those studies have sparked renewed and expanded interest in serial crystallography methods, and the implementation of time-resolved SSX at a number of synchrotron beamlines can be expected, in particular, as improving synchrotron technology will also bring faster time domains into reach.

Because of the large number of photons compressed into the x-ray laser pulses, SFX exposure times can be very short, down to the femtosecond domain (2, 17). As a consequence of the more limited photon flux at synchrotron sources, accordingly longer exposure times are needed in SSX, and consequently, radiation damage cannot be fully outrun. This is particularly severe for time-resolved experiments, because data need to be collected at room temperature, leading to much increased radiation sensitivity. However, while radiation damage is more pronounced at synchrotron sources, much of its deleterious effects can be mitigated by using low-dose exposure, which still leaves the electron density interpretable with little ambiguity (18, 19). The dose causing a 50% drop in the overall diffraction intensity for lysozyme at room-temperature SSX has been reported to be 380 kilograys (kGy) (20). While this represents the upper dose limit for serial crystallography experiments, which should not be exceeded at room temperature, the onset of site-specific damage could be observed at a much lower dose of approximately 80 kGy (20). Although this value is much lower than doses encountered during typical cryoexperiments, at modern microfocus beamlines, it is still possible to collect multiple frames from a single microcrystal at room temperature within this dose limit (1921).

For a systematic comparison of SFX and SSX data, we analyzed two different proteins at both radiation sources. This study does not intend to discuss the scope of each type of instrument but to compare their performance in the overlapping micron-sized crystal regime. Smaller crystals are clearly beneficial for reaction initiation in time-resolved experiments, permitting a more homogeneous light activation and faster diffusion rates (22, 23). However, under practical considerations, crystals in the submicron domain display experimental challenges with respect to their generation and identification. Since nanocrystals are smaller than the wavelength of visible light, they escape convenient determination by canonical light microscopy, and alternative methods need to be sought. In addition, Ostwald ripening rather supports the growth of larger particles, which, in practice, favors the growth of crystals in the micron-size regime. At the same time, most enzymes have turnover numbers in the high millisecond time domain, and we have recently demonstrated that these can be effectively addressed both by optical excitation and by in situ mixing reaction initiation in micron-sized crystals (16, 24, 25). As indicated by the crystal sizes used in a number of recent experiments (1116), practical simplicity suggests that there is a lot of use for crystals in the micron-size regime. This potential could be further substantiated if different dynamic aspects can be synergistically addressed at both synchrotrons and free-electron lasers without the need to change sample conditions. To minimize sources of potential discrepancies, we used the same batch of micron-sized crystals, the same sample delivery device, and the same data analysis software for each of the datasets.

RESULTS AND DISCUSSION

To evaluate potential differences in quality between microcrystal diffraction data obtained from XFEL (SFX) and those from synchrotron sources (SSX), we exploited two well-established protein systems: (i) the relatively radiation-tolerant fluoroacetate dehalogenase (FAcD) and (ii) the highly radiation-sensitive myoglobin (MB). FAcD is a homodimeric protein from the soil bacterium Rhodopseudomonas palustris and one of the very few enzymes that can cleave fluorine carbon bonds. FAcD shows half-of-the-sites reactivity and incorporates a covalent substrate-enzyme intermediate into its SN2 substitution mechanism (16, 26). MB is a monomeric heme protein involved in the oxygen exchange of mammalian muscle tissue. It was the first protein that had its three-dimensional structure determined and is known to be highly radiation sensitive because of the iron center of its heme chromophore (27, 28). Crystallization conditions for both systems were optimized to fall into a comparable low micron-size regime, a regime quite popular in multiple other protein XFEL and SSX studies (8, 2933). For sample delivery, we made use of our previously described fixed target approach (18, 3437). Briefly, approximately 25,000 protein microcrystals are mounted on a silicon support chip in random orientations. These chips are then raster-scanned in the x-ray beam by means of a fast and accurate closed-loop piezo translation stage system. This system allows the collection of data from up to four chips (i.e., approximately 100,000 microcrystals) per hour. The modular architecture of this system enables its use at different synchrotron and XFEL end stations (15, 18, 21, 25, 3437). For the present comparison, data were collected at the European Molecular Biology Laboratory (EMBL) beamline P14 at the PETRA III synchrotron at Deutsches Elektronen-Synchrotron (DESY) and at SPring-8 Angstrom Compact free electron Laser (SACLA) at the RIKEN SPring-8 Center using comparable beam sizes, which are all smaller than the protein crystals. These installations represent third- and fourth-generation light sources, with a difference in peak brightness of 8 to 10 orders of magnitude (38). In our experiment, the pulse duration differed by nine orders of magnitude, but the total number of photons each crystal received differed by less than one order of magnitude. In addition to applying the same data collection hardware, we used protein crystals from the same crystallization batches in both experiments. The microcrystals of both proteins were isomorphous and of the same physical dimensions. This should reduce any crystal quality differences that may influence overall data quality parameters. Moreover, to reduce any further mismatches in the analysis, we processed the data with the same software package, CrystFEL 0.8.0 (39).

Global data quality comparison

The global data quality parameters for both proteins indicate close equivalence of the SSX and SFX data over the whole resolution range (Table 1 and Fig. 1). For both proteins, signal-to-noise ratio (SNR), multiplicity, Rsplit, and overall completeness are almost identical for the SFX and SSX data (Fig. 2). Only minor resolution differences can be observed, as indicated by the resolution shell half-set correlation coefficients (CC1/2), Rsplit, and refinement Rfree values. While the MB dataset shows virtually no differences in B factors between the SSX and SFX dataset, the B factors are slightly lower for the SFX dataset of FAcD (Table 1). This may correlate with the marginally better data statistics in the high-resolution shell of FAcD as made evident by the SNR and Rsplit values. It is conceivable that the FAcD crystals would have diffracted to a slightly higher resolution in the XFEL beam relative to the synchrotron beam as indicated by the B factors. The current resolution cutoff was defined by the detector distance at the XFEL, allowing data collection to a resolution of 1.75 Å at the edge of the detector, which then was matched by setting the detector at the appropriate distance at the synchrotron. It is conceivable that a slightly higher resolution could have been obtained by a closer detector distance; however, the data quality falloff for the SFX and SSX data is highly similar, indicating only a marginal difference between both radiation sources.

Table 1 Data collection and refinement statistics.

Statistics for the highest-resolution shell are shown in parentheses.

View this table:
Fig. 1 Global data quality parameters as a function of resolution.

(A) Half-set correlation coefficient for SSX (blue, open circles) and SFX (orange, open triangles) for FAcD and MB datasets. The green curve displays the relative correlation coefficient between the SFX and SSX data. The datasets were limited to the same number of diffraction images (10,000 for FAcD and 8000 for MB), and the same resolution cutoff was applied to SSX and SFX data. (B) Corresponding refinement Rfree values.

Fig. 2 Global data quality parameters as a function of dataset size (number of crystals).

Global data quality parameters for SSX and XFEL datasets: (A) CC1/2 values, (B) Rfree values, (C) SNR estimate, (D) Rsplit values, and (E) completeness. SSX data are displayed as blue, open circles and SFX data as orange, open triangles. Overall values are shown as solid lines and highest-resolution shell values as dashed lines.

In addition to the integration statistics, the general quality of the refined structures needs to be assessed. Globally, this can be achieved by comparing the free R values (Rfree) of the SSX and SFX structures (Table 1). The absolute percentage differences between the overall Rfree values for FAcD and MB are negligible, while for the high-resolution shell, absolute Rfree value differences of ~4.6 and ~1.8% can be observed, respectively. On a per-residue basis, model differences between the SFX and SSX structures can conveniently be assessed via the root mean square deviation (RMSD) and B factor values. The all-atom RMSD of the SFX and SSX structures for FAcD is 0.12 Å, compared to a coordinate error of 0.20 Å. For MB, the all-atom RMSD of the SFX and SSX structures is 0.13 Å, compared to a coordinate error of 0.23 Å. By collecting room-temperature data from the same batch of micron-sized crystals, using the same sample delivery device and the same software for data analysis, we have attempted to maximize the comparability of data collection at XFEL and synchrotron sources, leaving only the photon flux and energy as the distinctive parameters in this experiment. This experimental similarity is reflected in highly comparable data statistics for the SFX and SSX data for both protein systems. The data quality indicators obtained (such as CC1/2, Rfree, and RMSD) are nearly identical for data from both radiation sources. However, thus far, this statement only holds true for crystals of suitable size (i.e., on the order of a few micrometers), which enable data collection at synchrotrons. XFELs can make use of much higher flux densities, thereby accommodating much smaller crystals, down to the nanometer scale, which currently is outside the range of synchrotrons (40). In summary, by normalizing all experimental parameters, this comparison demonstrates that in (static) serial x-ray crystallography, data quality is predominantly a crystal-dependent property. Our observations reveal that neither obtainable resolution nor CC1/2 or Rfree values strongly depend on the radiation source. As a consequence, the diffraction properties of protein microcrystals can be explored at synchrotron sources to test their suitability for XFEL experiments. This would relieve XFEL beamlines of the high burden of screening time and increase opportunities for testing crystals, as beamtime at synchrotrons is much more readily available. This not only includes static data collection but, of course, also extends to time-resolved experiments, where crystals can be used for experiments at synchrotron sources for observing somewhat slower time domains, which, for current generation synchrotrons at monochromatic wavelengths, encompasses time points longer than the mid-microsecond regime.

Scaling with dataset size

Subsequently, we correlated the quality of the datasets with the number of crystals used for structure determination, randomly picking subsets of the total acquired datasets. Analysis of correlation coefficients, Rfree values, SNR, Rsplit, and completeness again shows a high degree of similarity between SSX and SFX data. Although the SFX data were recorded with a bandwidth approximately two orders of magnitude larger than available at monochromatic synchrotron beams, the number of diffraction patterns needed to converge is similar for both datasets. While true Laue serial diffraction data require fewer images for convergence, this criterion is apparently not met by the increased bandwidth of XFEL beams compared to monochromatic synchrotron beams (41).

In agreement with our previous observations, the datasets of both proteins show almost full completeness, which is more than 95% in the highest-resolution shell, even for a low number (<1000) of diffraction patterns (42). This indicates that, for serial data collection, completeness should be the first data quality metric to be assessed before turning to other quality indicators such as CC1/2 or Rsplit. We argue that if the data are not fully complete, then other data quality indicators are probably meaningless. For both FAcD and MB, the overall quality indicators CC1/2 and Rfree start to converge to reasonable levels at approximately 5000 diffraction images by standard assessments (Fig. 2). While there are only minor improvements for the overall data quality indicators, the high-resolution shell quality indicators still undergo substantial improvements if more than 5000 diffraction patterns are included. The FAcD and MB crystals used in this analysis were of monoclinic and orthorhombic symmetry, respectively—representing the two most commonly encountered crystal systems in the Protein Data Bank (PDB). This suggests that, for the vast majority of cases, reasonable data statistics should be achievable with a similar number of diffraction images. For triclinic symmetry (representing less than 5% of the structures in the PDB), however, a higher number of diffraction patterns would be required.

This corroborates our previous findings that protein structures and even protein small-molecule complexes obtained by SFX and SSX can be solved from ~5000 diffraction patterns—or fewer if high symmetry permits—yielding reasonable electron density (42, 43). However, merging and refinement statistics, as well as the quality of electron density, clearly improve if further diffraction patterns are added. This can be especially important in time-resolved analyses to reveal low occupancy details or to distinguish subtle electron density differences from the noise.

Thus, as a rule of thumb, the present data suggest obtaining approximately 5000 diffraction patterns for each structure, as, at this point, the overall dataset quality has mostly converged to reasonable quality indicators. Provided that micron-sized crystals are used, SFX and SSX appear to be equivalent alternatives not only regarding the quality of the data but also with respect to the required number of diffraction patterns. Exciting developments in nanofocus x-ray beamlines combined with the increased brilliance of next-generation synchrotron sources may narrow the gap of useful crystal sizes in the future, thereby also enabling the use of nanocrystals at synchrotrons (44, 45). Since microsecond time resolution can also be obtained at current generation synchrotron sources, time resolutions faster than this regime are a practical long time range limit for SFX experiments, although longer time delays are technically feasible via, e.g., the hit-and-return (HARE) method (15). XFELs can therefore be focused on ultrafast time-resolution experiments, the time domain on which XFELs are essential.

Radiation damage

For traditional crystallographic experiments at synchrotrons, radiation damage is a key limiting factor in obtaining high-resolution structures even at cryogenic temperatures. Damage is caused by the interaction of the x-ray photons with the electrons in the protein crystals and the energy they deposit in the sample, either caused by the photoelectric effect or inelastic scattering (Compton scattering) of x-rays, leading to primary, secondary, and tertiary damage (46, 47). The traditional Garman limit for cryotemperature structures is 30 MGy (48). The situation is worse at room temperature, typically encountered during serial crystallography experiments. Radiation damage manifests either globally as a loss of diffraction intensity or specifically via chemical modification of protein residues, i.e., breaking of disulfide bonds (46, 49). Recently, dose limits for room-temperature SSX were inferred from lysozyme crystals, indicating a half-diffraction dose of 380 kGy and site-specific damage starting to appear at approximately 80 kGy (20). Notably, the authors implied that the onset of site-specific damage may occur at much lower doses for radiation-sensitive proteins. This agrees well with previous findings that the dose limit at room temperature is highly sample dependent, varying over an order of magnitude for 15 different structures (50). In comparing synchrotron and XFEL sources radiation, one needs to further distinguish between direct and indirect damage. While direct damage occurs via the primary x-ray absorption at an atom in the protein, indirect damage occurs via x-ray absorption in the bulk solvent (46). The latter leads to the formation of reactive oxygen species on the nanosecond-millisecond time scale that can diffuse through the crystal (51). Thus, the advantages of XFELs with respect to radiation damage originate in their short pulse lengths, which outrun those damage mechanisms caused by the diffusion of radicals encountered during longer exposure times at synchrotrons (52, 53). Because of the short pulse length (10 fs) and high intensity of the SACLA beam, established dose criteria cannot be applied to SFX data in an unmodified manner. For reference, we therefore used an updated version of RADDOSE (RADDOSE-XFEL), which now also allows to correctly calculate XFEL and SSX doses (54). We estimated the average dose in the exposed region, indicating that the 80-kGy dose limit was exceeded by a factor of ~2 for the SSX data (Table 2).

Table 2 Results from RADDOSE-XFEL.

View this table:

No global structural differences were observed in either FAcD or MB. To address the site-specific differences between the SFX and SSX data, we used a resolution shell scaling tool to calculate isomorphous difference densities (55). The difference density maps obtained this way do not display the absolute damage but rather indicate relative differences between the two datasets (Fig. 3). For FAcD, the difference density peaks are evenly distributed over the whole structure, indicating no preferred site of radiation damage. By contrast for MB, damage is less homogeneously dispersed throughout the structure; rather, local differences are primarily concentrated around certain areas such as the radiation-sensitive heme center. This agrees with canonical radiation damage mechanisms induced by the reduction of the iron center (Fig. 3) (28). While the SSX data are clearly not radiation damage free, the serial data collection approach apparently allows to collect data that show only minimal differences to SFX data, which outrun the most deleterious radiation damage effects, even for very radiation damage–sensitive proteins such as MB. While the present SSX data were collected at doses of approximately 170 kGy, with new rapid acquisition detectors (Eiger), it is possible to collect SSX data with <2-ms exposure time, thus at much lower doses (<5 kGy), further mitigating the differences between the SFX and SSX data (20).

Fig. 3 Isomorphous difference maps for FAcD and MB, respectively.

Resolution shell–scaled Fobs-SFXFobs-SSX difference maps of FAcD (A) and MB (B). Difference map peaks are homogeneously distributed over FAcD, while difference map peaks are more concentrated around the radiation-sensitive heme center in MB. Proteins are shown as cartoon representations; all maps are shown at ±3 σ.

Data obtained via serial crystallography approaches at synchrotron and XFEL sources can be of equivalent quality. For example, for the samples tested in this study, data quality is a crystal-dependent property and, in first approximation, does not depend on the radiation source.

We conclude that, as a rule of thumb, ~5000 diffraction patterns are required to obtain data statistics and electron density maps of reasonable quality; higher numbers may be required to tease out lowly populated electron density states. Furthermore, our findings suggest that it is possible to tailor SSX data collection to largely avoid the effects of radiation damage, thereby preserving the clean biological interpretability of the results. Because of the overlapping data quality, the choice of the radiation source should primarily be guided by the required time resolution for time-resolved experiments. This study was limited to crystals in the micrometer-size regime and only carried out at two respective radiation sources. Further research with a larger sample pool is required to establish whether our conclusions hold true for a broader spectrum of crystal sizes and SFX data obtained at even brighter radiation sources. However, since other light sources provide similar peak brightness and exposure times, we anticipate that similar results will also be achievable at further instruments, enabling mutually supportive experiments without requiring a change in sample conditions. Experiments with substantially brighter synchrotrons and XFELs, different (more complex) samples, sample delivery systems, timing regimes, detectors, experimental strategies, data analysis methods, etc. are likely to alter such comparisons in the future.

MATERIALS AND METHODS

Protein expression, purification, and crystallization

Recombinant FAcD was purified from Escherichia coli BL21(DE3) as described (26). FAcD was extracted from E. coli cell-free lysate using Ni chromatography with subsequent cleavage of the His6-tag using tobacco etch virus protease. Size exclusion chromatography was completed using 50 mM tris-H2SO4 (pH 8.5) and 150 mM NaCl and buffer-exchanged to remove the NaCl as a final step in purification. FAcD crystals were grown in crystallization buffer [18 to 20% (w/v) polyethylene glycol (PEG) 3350, 200 mM CaCl2, and 100 mM tris-HCl (pH 8.5)]. From these crystals, a microseed stock was generated at a 4 to 8% higher PEG3350 concentration, using a seed bead kit from Hampton Research (HR2-320). Microcrystals were produced using batch crystallization; 100 to 200 μl of seed stock and an equal volume of 0.5 mM FAcD solution were mixed. Within 24 to 72 hours, crystals grew to approximately 20 × 20 × 10 μm3 in size, with a very homogeneous size distribution as determined by light microscopy. CO-bound sperm whale MB was prepared and crystallized as described previously (18, 35). For microcrystallization, 180 to 200 μl of crystallization buffer [50 to 60 mg/ml of MB in 10 mM tris-HCl (pH 9.0) and 2.5 to 2.6 M ammonium sulfate] was added to 3 ml of CO gas–filled Monoject blood collection tubes (Covidien, Mansfield, MA, USA) at 1 atm. To promote nucleation, the solution was seeded with 10 μl of a 1:100 diluted seed solution [crushed MB crystals in 10 mM tris-HCl (pH 9.0) and 3.2 M ammonium sulfate]. To control the size distribution of the final crystals, 180 μl of buffer [10 mM tris-HCl (pH 9.0) and 3.2 mM ammonium sulfate presaturated with CO gas] was added to each tube after the first crystals appeared. MB crystals displayed a larger size distribution and varied from approximately 50 × 50 × 25 to approximately 10 × 10 × 5 μm3 in size. However, most of the crystals had a size of approximately 30 × 30 × 15 μm3. The MB crystals’ size distribution was determined manually when located inside the features of the chip using a Hirox RH-2000 microscope, part of user facilities on site at SACLA (RIKEN, Japan).

SSX data collection

Single-crystalline silicon chips provide a suitable scaffold material for fixed target crystallography. Chip fabrication and sample loading process are described in detail in (3537). The HARE chips hold randomly oriented crystals in precisely defined, bottomless wells (also called features). Within a humid environment, 100 to 200 μl of a suspension of crystals was loaded onto a single chip by applying vacuum suction of 0.4 mbar for approximately 30 s as described previously. The loaded chips were then covered with 2.5 μm of mylar foil to prevent evaporation, and data were collected immediately after chip loading. SSX diffraction data were collected at room temperature (294 K) at EMBL beamline P14 at DESY, Hamburg, using our previously described fixed-target setup (18, 34). Crystals were not rotated during an exposure; still images were recorded at a wavelength of 0.976 Å (12.703 keV) and an exposure time of 37 ms using an Eiger 16M detector (Dectris, Switzerland). The synchrotron beam had a Gaussian profile with a flux of 4 × 1012 photon/s and full width at half maximum (FWHM) dimensions of 5 × 10 μm2, with a rectangular collimation of 15 × 30 μm2. SSX diffraction data were processed using CrystFEL 0.8.0 (39).

SFX data collection

SFX diffraction data of FAcD and MB were collected at room temperature (294 K) at SACLA at SPring-8, Japan, using the same setup as described above. The uncollimated FWHM size of the beam was 4.7 × 3.4 μm2 with an energy of 10.2 keV, a pulse energy of 0.4 mJ, and an energy distribution of 0.5% (FWHM). The crystals were exposed for 10 fs, with each pulse carrying 2 × 1011 photons. SFX diffraction data were processed using CrystFEL 0.8.0 (39).

Molecular replacement and refinement

The structures were determined by molecular replacement in Phaser (56) using PDB IDs 5K3D and 5JOM as search models for FAcD and MB, respectively. Structure refinement was completed by iterative cycles of refinement in phenix.refine (57) and manual model building in COOT (58, 59).

Dose and radiation damage

Doses for each dataset were calculated using the software RADDOSE-3D (v. 4.0) and its subprogram RADDOSE-XFEL, respectively (54). Cuboid crystal forms with dimensions of 20 × 20 × 10 μm3 for MB and 20 × 20 × 15 μm3 for FAcD were applied. For the SSX data, a Gaussian beam profile with a flux of 4 × 1012 photon/s and a FWHM beam of 5 × 10 μm2, with a rectangular collimation of 15 × 30 μm2, was considered at an energy of 12.703 keV for an exposure time of 37 ms and a rotation wedge of 0°. RADDOSE-3D was run with a resolution of 5 pixels/μm and default angular resolution for a wedge size of 0°. For the XFEL data, a Gaussian beam profile, with an uncollimated FWHM size of 4.7 × 3.4 μm2, an energy of 10.2 keV, a pulse energy of 0.4 mJ, and an energy distribution of 0.5% (FWHM) as well as an exposure time of 10 fs were applied. RADDOSE-XFEL was run with a resolution of 0.5 pixels/μm and simulated 1,000,000 photons. Results from three runs were averaged and are summarized in Table 2. The average dose over the exposed region of the crystal is the correct metric to be used for SSX data, while the average diffraction weighted dose is generally an overestimation in this case. Similarly, RADDOSE-3D style average dose (1 MGy) does not apply for SFX data (20). Isomorphous difference maps were calculated in analogy to a previously published approach scaling the datasets to their resolution shells (55). The SFX datasets were scaled to the reference SSX datasets by dividing the limiting resolution range into 20 equal volume bins containing similar numbers of reflections. The reflection amplitudes in each bin were multiplied by a constant scale factor, calculated to set the average amplitude in each bin equal to that of the reference. The code for shell scaling is part of the Vagabond software suite. The resulting scaled datasets were plugged into sftools (CCP4 software suite), and difference structure factors were calculated Fobs-SFXFobs-SSX for both FAcD and MB, respectively (60).

Data analysis

Molecular images were generated in PyMOL (Schrödinger LLC).

https://creativecommons.org/licenses/by-nc/4.0/

This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license, which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.

REFERENCES AND NOTES

Acknowledgments: The SSX data were collected at beamlines P14 operated by EMBL Hamburg at the PETRA III storage ring (DESY, Hamburg, Germany). We thank A. R. Pearson (University of Hamburg), S. Horrell (Diamond Light Source), M. Agthe, and T. R. Schneider (EMBL Hamburg) for assistance in using the beamline and for continuous support and helpful discussions in the implementation and improvement of SSX instruments at EMBL Hamburg. We acknowledge T. White for most helpful discussions on data processing. The XFEL experiments were performed at the BL3 of SACLA with the approval of the Japan Synchrotron Radiation Research Institute (JASRI) (proposal nos. 2016A8036 and 2016B8052). We thank the beamline scientists for excellent support. Funding: Support was provided by the Max Planck Society and by the Cluster of Excellence “The Hamburg Centre for Ultrafast Imaging” of the Deutsche Forschungsgemeinschaft (DFG)–EXC 1074–project ID 194651731 (to R.J.D.M.), the People Programme (Marie Curie Actions) of the European Union’s Seventh Framework Programme (FP7/2007–2013) under REA grant no. 623994 (to H.M.M.-W.), the Joachim Herz Foundation (Biomedical Physics of Infection) (to E.C.S. and R.J.D.M.), grant RGPIN-2015-04877 from the Natural Sciences and Engineering Research Council of Canada, the Canada Research Chairs Program (to E.F.P.), and the Canada Excellence Research Chairs Program (to O.P.E.). O.P.E. holds the Anne and Max Tanenbaum Chair in Neuroscience at the University of Toronto, and O.P.E. and R.J.D.M. are CIFAR fellows. Author contributions: E.C.S. and P.M. initiated the project. P.M., T.M., and A.K. prepared the protein and crystals. E.C.S., P.M., D.v.S., H.M.M.-W., A.M., H.S., H.M.G., D.A.S., D.A., R.L.O., F.T., A.K., J.E.B., T.M., W.-L.O., O.P.E., E.F.P., and R.J.D.M. collected the data. E.C.S., P.M., R.B., H.M.G., B.T.E., and A.M. processed the data. E.C.S., P.M., R.B., D.v.S., and G.B. analyzed the data. E.C.S., P.M., and R.B. wrote the manuscript. All authors discussed and corrected the manuscript. Competing interests: The authors declare that they have no competing interests. Data and materials availability: All data needed to evaluate the conclusions in the paper are present in the paper. Additional data related to this paper may be requested from the authors. The data are available via the RCSB PDB under the accession codes 7A42, 7A43, 7A44, and 7A45.

Stay Connected to Science Advances

Navigate This Article