Prediction on X-ray output of free electron laser based on artificial neural networks

Li, Kenan; Zhou, Guanqun; Liu, Yanwei; Wu, Juhao; Lin, Ming-fu; Cheng, Xinxin; Lutman, Alberto A.; Seaberg, Matthew; Smith, Howard; Kakhandiki, Pranav A.; Sakdinawat, Anne

doi:10.1038/s41467-023-42573-z

Download PDF

Article
Open access
Published: 08 November 2023

Prediction on X-ray output of free electron laser based on artificial neural networks

Nature Communications volume 14, Article number: 7183 (2023) Cite this article

1950 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Knowledge of x-ray free electron lasers’ (XFELs) pulse characteristics delivered to a sample is crucial for ensuring high-quality x-rays for scientific experiments. XFELs’ self-amplified spontaneous emission process causes spatial and spectral variations in x-ray pulses entering a sample, which leads to measurement uncertainties for experiments relying on multiple XFEL pulses. Accurate in-situ measurements of x-ray wavefront and energy spectrum incident upon a sample poses challenges. Here we address this by developing a virtual diagnostics framework using an artificial neural network (ANN) to predict x-ray photon beam properties from electron beam properties. We recorded XFEL electron parameters while adjusting the accelerator’s configurations and measured the resulting x-ray wavefront and energy spectrum shot-to-shot. Training the ANN with this data enables effective prediction of single-shot or average x-ray beam output based on XFEL undulator and electron parameters. This demonstrates the potential of utilizing ANNs for virtual diagnostics linking XFEL electron and photon beam properties.

Efficient prediction of attosecond two-colour pulses from an X-ray free-electron laser with machine learning

Article Open access 27 March 2024

Artificial intelligence for online characterization of ultrashort X-ray free-electron laser pulses

Article Open access 24 October 2022

Optimizing ultrashort pulse in fiber laser based on artificial intelligence algorithm

Article Open access 04 April 2024

Introduction

Recent advances in X-ray free-electron lasers (XFELs)^1,2,3,4,5,6 at world-wide facilities such as SLAC⁷, SACLA⁸, PAL-XFEL⁹, SwissFEL¹⁰, and the European XFEL¹¹ have demonstrated innovative capabilities and operational configurations that are expected to greatly impact a wide range of proposed science experiments¹². Tunable devices such as variable gap undulators and phase shifters have been integrated into the XFEL to tailor and control the electron beam¹³, opening up fresh opportunities for science experiments. However, as the number of electron beam control parameters increases, so does the complexity of accelerator optimization and tuning. This, along with the shot-to-shot variations from the self-amplified spontaneous emission (SASE) process of XFELs, make it essential to understand the relationship between the electron beam parameters and the actual X-ray beam properties delivered to a sample.

To understand this relationship, several options are possible. First, the wavefront and spectrum of the XFEL pulse can be determined computationally, though this is a challenging task due to the complexity of the underlying physics, discrepancies between real-world and computational models, and the multitude of variables and parameters involved especially with the more recent generation XFELs. Second, real-time nondestructive measurements of the energy spectral and spatial wavefront properties of the XFEL pulse delivered to a sample could also be implemented. One method to do this involves splitting the X-ray pulse into reference and experimental beams using a beam splitter and taking measurements on both beams from shot to shot. This, however, can increase experimental complexity, require additional instrumentation, which may not be feasible depending on the physical constraints of the experimental setups, and reduces photon flux. In addition, accuracy would be highly determined by the quality and performance of the X-ray beam splitter optic.

To overcome these challenges, we develop a virtual diagnostics model based on artificial neural networks (ANNs) and shot-to-shot measurement data of both electron and X-ray beam parameters. ANNs are powerful tools for modeling complex nonlinear relationships, and exploration of their utility to overcome the limitations of conventional methods for accelerator optimization, tuning, and modeling is underway^14,15,16,17. The majority of machine learning models for XFELs have primarily focused only on the electron beam for tasks such as accelerator and undulator tuning and optimization^18,19, with one study incorporating X-ray spectrometer data²⁰. These studies were made possible due to the single-shot diagnostics of the electron beam implemented in the XFEL. With the recent development of high-accuracy single-shot X-ray wavefront sensors for both soft and hard X-rays at XFELs^21,22,23 and the development of single-shot soft X-ray spectrometers based on off-axis zone plates for spectral measurements^24,25, X-ray properties can now be characterized routinely. These diagnostic tools enable us to measure the spatial amplitude and phase, as well as the spectral qualities of the X-ray beam and allow us to further combine the X-ray diagnostics data with that of the electron beam diagnostics data into a model based on ANNs.

In the following set of experiments, we modulate the electron beam parameters via different accelerator operational configurations in the XFEL, including both that of routine operations with full normal electron beams and exploration of the effect of detuning, tapering, and kicking of slotted electron beams, record the electron and X-ray beam parameters on a single-shot basis, and then train an ANN-based model using the data. Detuning plays a critical role in determining XFEL modes through the dispersion relation equation^{2,3,26,27,28,29} and can excite high-order modes³⁰. In conjunction with tapering of the undulators, amplification of these high-order modes are expected. Both the routine case and the specialized cases of detuning, tapering, and kicking were chosen to demonstrate and understand the utility and limitations of the ANN-based virtual diagnostics model.

Results

These experiments were conducted at the Time-resolved atomic, Molecular and Optical Science (TMO) instrument³¹ at LCLS as illustrated in Fig. 1a. LCLS was operated in self-amplified spontaneous emission (SASE) mode, producing ~530 eV X-rays at a repetition rate of 120 Hz. Data from a total of 13 XFEL configurations were recorded, 12 different configurations using the slotted electron beam, and 1 configuration representing routine operations using the normal full SASE beam. In the 12 different configurations of the slotted electron beam, an energy chirp along the electron bunch was introduced for detuning and the taper and kicking parameters were varied. A slotted foil was used to create a short, coherent spike in the electron bunch by spoiling the majority of it when incident upon the foil, leaving an ultrashort unspoiled portion through the slot in the foil³², as shown in Fig. 1a. The unspoiled portion then produces an ultrashort XFEL pulse through lasing. The undulator sections were set to two different states: no taper and optimal taper³³, and for each of these states, the electron bunch was kicked at various locations in the undulator, n sections before the final section, with n = 0, 1, 3, 5, 7, and 9 where 0 indicates no kicking, as illustrated in Fig. 1b. This resulted in a total of 12 different configurations. For each configuration, we recorded the single-shot wavefront intensity, phase, and spectrum, as well as electron parameters from the undulators (spectrum and wavefront were measured separately for the same 12 configurations). In addition to the slotted electron beam, the full SASE beam in routine operation was used to study shot-to-shot wavefront phase variations, with similar recordings of wavefront phase and electron bunch parameters. The X-ray wavefront was measured using a Talbot wavefront sensor, and the spectrum was recorded using an off-axis zone plate on a yttrium aluminum garnet (YAG) screen, shown in Fig. 1c. See the “Methods” section for further details on the XFEL configurations and data acquisition.

**Fig. 1: Overview of experimental setup and data analysis.**

In Fig. 2, we present the average spectra, wavefront intensity, and phase for each configuration, including variations with and without taper and kicking at different points along the undulator. The results show that different configurations result in distinct spectra and wavefronts. For instance, the spectra from taper configurations exhibit a higher energy tail and reduced low energy components compared to that of the no taper cases. The intensity also increases as the electrons are kicked further downstream. The differences among the twelve phase maps indicate the wavefront’s evolution with different kick locations and taper settings.

**Fig. 2: X-ray spectra and wavefronts for different kicking locations and tapers.**

Indeed, the experiments revealed interesting XFEL physics when certain parameters of the electron bunch and the undulators are varied. In the no-taper case, due to the fact that the electrons are continuously losing energy, the radiation spectrum is skewed toward the red-shift side. For the taper case, the taper was over-tapered to introduce a detuning to set the resonant frequency in the blue-shift side compared to the radiation frequency in the exponential growth region, i.e., before the tapered region. Thus, the microbunching will now support high-order modes according to the dispersion relation discussed below in the “Methods” Section. In our case, the donut mode is excited, as shown in the intensity plot in Fig. 2, while the spectrum shows spectral tails at high energy, as seen in the normalized spectrum plot in Fig. 2.

We conducted an investigation into the correlations between X-ray properties and electron parameters by computing Pearson correlation coefficients between recorded electron beam parameters and our X-ray measurements (e.g., Zernike coefficients for wavefront phase). As shown in Fig. 1d, we created a correlation matrix to demonstrate the relationship between electron beam parameters and Zernike coefficients. The correlation matrix highlights that electron parameters exhibit intricate correlations with the resulting X-ray wavefront. These relationships are often implicit yet complex, involving a multitude of parameters that become challenging to depict and solve through conventional methods.

ANNs can solve real-world problems, such as regression or classification, by receiving inputs, performing complex calculations, and providing outputs. To map both X-ray and electron properties, we employed a conventional multilayer perceptron (MLP) model to predict X-ray outputs based on electron parameter readings. The MLP we used in this paper is depicted in Fig. 1e and is comprised of an input layer, multiple hidden layers, and an output layer. The inputs are electron parameters and the outputs are X-ray properties like wavefront or spectrum. Electron parameters consist of readings from bunch length monitors, beam position monitors at various sections, and electron attributes such as position, peak current, bunch charge, coordinates, pulse energy, etc. The X-ray wavefront phase is represented as Zernike coefficients obtained by decomposing the phase into Zernike polynomials. The X-ray beam spectrum is represented as 50 numbers obtained through binning. See the “Methods” section for further details on model training.

We demonstrate the effectiveness of our trained models by presenting predictions for (1) different configurations from different runs with slotted electron beam varying kicking locations and taper states, and (2) shot-to-shot variation within a single run with a full electron beam. Predictions are all single shots, and the averages are calculated based on the predicted single shots. These predictions are discussed in the following subsections.

Analysis of predictions from the slotted electron beam configurations

In Fig. 3, we present a comparison between the measured and predicted average wavefront phase in Zernike coefficients for various configurations. The measurements and predictions are nearly identical, with only minor phase differences observed. The root-mean-square (RMS) prediction error for the average wavefront phases was determined to be 0.0169 rad. Furthermore, the standard deviation of wavefront phase from case to case was found to be 0.236 rad. Based on these values, the estimated relative error for predicting average case-to-case fluctuations is ~7%. Refer to the “Methods” section for further information on the prediction error and accuracy evaluation. The model accurately captured the differences and changes in wavefront phase caused by varying electron parameters and accurately predicted the resulting X-ray wavefront phase. With the single-shot measurements of a comprehensive collection of electron parameters, we can determine the X-ray beam wavefront phase delivered to the end station.

**Fig. 3: Measured and predicted average Zernike coefficients for different kicking locations and tapers.**

Similarly, in Fig. 4, we compare the measured and predicted average spectra for various configurations. There is very little difference between the two. The good agreement observed in the figures is due to the fact that they represent comparisons of the averages. The model effectively captured the differences and changes in the X-ray spectrum caused by varying electron parameters and accurately predicted the resulting X-ray spectra. For instance, kicking at a more upstream location results in more symmetrical spectrum curves, and taper leads to spectral tails at high energy, while no taper results in low energy components in the spectra. The mean similarity between the predicted and measured spectra is 0.999 for average spectra, and 0.924 for single-shot spectra. Refer to the “Methods” section for further information on the prediction error and accuracy evaluation. With the single-shot measurements of a comprehensive collection of electron parameters, we can determine the overall spectrum of the X-ray beam delivered to the end station. The spectral resolution relies on the measurements obtained from the zone plate spectrometer as detailed in the “Methods” section on data acquisition. It is worth mentioning that the spikiness observed in a single-shot spectrum is a random occurrence and cannot be predicted due to the stochastic nature of XFEL startup and the inability to make measurements at the single-electron level. However, what holds significance is the envelope of the single-shot spectrum, as it provides information about the central frequency, bandwidth, and spectral tails at high energy for tapered cases and the tails at low energy for no taper cases. These distinctive features are illustrated in Fig. 4.

**Fig. 4: Measured and predicted average X-ray spectra for different kicking locations and tapers.**

Furthermore, we also built and trained neural network models to perform classification tasks. We used either the wavefront phase Zernike coefficients or the electron parameters to predict the operation configuration from among the twelve options. The prediction accuracy is remarkable, reaching 99% when given the electron parameters and 87% when given the wavefront phase Zernike coefficients at the single-shot level.

Shot-to-shot variations

We utilized a similar technique to predict shot-to-shot variations in the single-shot X-ray wavefront phase within a single run using full SASE beams. Specifically, we employed a neural network to map electron parameter readings from the undulators to the measured single-shot X-ray wavefront phase. The results, depicted in Fig. 5a, illustrate the standard deviations of (1) the measured wavefront phase, (2) the predicted wavefront phase, and (3) the RMS prediction errors of the wavefront phase over all shots in the test dataset. The measured and predicted wavefront phases exhibit similar shot-to-shot variations, as evidenced by their comparable standard deviations for each Zernike term, particularly the two primary Zernike terms that contribute the most to shot-to-shot variations. The decrease in the variation of the difference between the measured and predicted wavefront phases indicates that the model has learned something that has reduced the difference to a level below shot-to-shot variations, and the remaining variation is likely due to shot-to-shot noise. Based on the single-shot wavefront phase data, the RMS prediction error between the predicted and measured wavefront phase is determined to be 0.141 rad. Additionally, the standard deviation of the wavefront phase from shot to shot is calculated to be 0.269 rad. Consequently, the estimated relative error for predicting shot-to-shot fluctuations is ~52%. Refer to the “Methods” section on prediction error and accuracy evaluation for further details.

Figure 5b presents the measurement and prediction results from the test dataset, based on Zernike coefficients (Z3-Z8) versus an example electron beam parameter (electron x coordinate from a beam position monitor). It is worth noting that while a single electron parameter is depicted against Zernike coefficients in this figure, these coefficients are multivariate and rely on the complete set of electron parameters. The figure demonstrates how Zernike coefficients change as electron beam parameters vary and how the model’s predictions compare to the measured data. Figure 5b indicates that the model has captured the correlations between Zernike coefficients and that selected single electron parameter, as well as the variation or dispersion among shots that arises from other electron parameters. The slight reduction in variation or dispersion from the prediction in Fig. 5b and the difference between measured and predicted wavefront phase in Fig. 5a may both be indications of noise sources (either systematic or measurement noise) that were not learned by the model.

Single-shot prediction is vital for XFEL X-ray imaging that relies on wavefront phase, as well as any other experiment that depends on X-ray intensity or spectra on the sample. This capability enables us to determine the wavefront phase in cases where direct, single-shot, in-situ wavefront measurements are not feasible, particularly for the exact shot pulse being used for single-shot imaging due to shot-to-shot variations of XFEL pulses. Although using a grating to split XFEL X-ray beams and measure the wavefront phase and spectrum to determine the X-ray delivered to experiments is possible, it would significantly increase the complexity of the experimental setup, consume more time and space, and result in a loss of photon flux.

Discussion

Our recent experiments at LCLS have confirmed that ANN models can be trained on experiment data to accurately predict XFEL pulse properties such as wavefront and spectra using electron bunch parameters as inputs. The study aims to emphasize the valuable insights provided by electron diagnostics in predicting X-ray output. While acknowledging the complexity of XFEL physics, the study demonstrates the efficacy of the MLP model in capturing the nonlinear relationships between electron parameters and X-ray characteristics. This capability will simplify virtual diagnostics for single-shot X-ray pulses and facilitate electron diagnostics, optimization, and tuning to achieve optimal or desired X-ray output.

Optimal performance in ANN training and tuning necessitates a large dataset encompassing a diverse sample space. In this work, we utilized readily available shot-to-shot recorded electron beam parameters while measuring the XFEL beam, without investing additional effort in obtaining innovative electron measurements. However, to explore further avenues for improvement, it is worth considering to introduce additional parameters that provide a more comprehensive and in-depth characterization of electron information. By incorporating such parameters, the method presented here has the potential to enhance the model’s robustness, reliability, and overall performance. For instance, the Convolutional Neural Network (CNN) can serve as a subnet for processing 2D electron parameters, specifically electron time-energy distribution images obtained from the X-band Transverse CAVity (XTCAV) diagnostic system³⁴. By leveraging its ability to recognize learned patterns in these 2D inputs, the CNN can effectively extract relevant features. Moreover, to capture temporal pulse-pulse correlations, alternative models such as recursive neural network or transformer can be employed. These sequential models excel at extracting features related to the contextual information within the pulses, thereby providing a more comprehensive understanding of the data.

Similarly, further avenues for improvement can be made in the areas of X-ray diagnostics as well. Improvements in the performance of existing diagnostic tools as well as introduction of additional measurement capabilities in the future, for example the ability to measure temporal characteristic of the X-ray beam, can improve the overall performance of this type of model. Incorporation of the various instrument optics performance modeling and their optomechanic or other tuning parameters specific to each instrument can allow the integration of information from any X-ray optics induced characteristics or fluctuations in the beam prior to interaction with the sample. This can lead to a higher fidelity predictive capability in the model as well as improved overall tuning of the accelerator and optics systems for an experiment.

Methods

The slotted electron beam configurations

To understand the mechanism behind the presence of high-order modes in XFEL pulses, we intentionally generate short electron bunches that resemble a single coherent spike. If we used a long electron bunch, it would result in many (order of 100) coherent spikes⁴, which would be different transverse eigenmodes in the post-saturation regime of the XFEL. Observing these different modes becomes difficult when many spikes interfere with each other as they hit the wavefront sensor.

We utilized a slotted foil to spoil the majority of the electron bunch, leaving only a small, ultrashort portion³², as shown in Fig. 1a. This ultrashort, unspoiled portion lases and generates an ultrashort XFEL pulse, which allows us to manipulate the electron bunch properties and undulator configuration to excite different high-order eigenmodes. Additionally, to effectively excite high-order modes, we perturb the electron orbit in the undulator by kicking it at specific locations, shown in Fig. 1b. The kicking occurs at n sections before the final undulator section with n = 0, 1, 3, 5, 7, and 9, where 0 means no kicking.

In the high-gain XFEL, the slowly varying envelope function of the electric field has the form:

$$E={e}^{-i{{\Omega }}\tau }{e}^{i{q}_{\parallel }\zeta }\psi ({{{{{{{\bf{x}}}}}}}}),$$

(1)

where the dimensionless variables measuring spatial and temporal variations are:

$$\tau={\omega }_{w}t,\, \zeta={k}_{r}(z-{v}_{0}t),\, {{{{{{{\bf{x}}}}}}}}=\sqrt{2{k}_{0}{k}_{w}}{{{{{{{\bf{r}}}}}}}},$$

(2)

with r being the transverse coordinates, z the longitudinal coordinate, t the time, v₀ the electron bunch longitudinal velocity, ω_w = k_wc = (2π/λ_w)c and λ_w being the undulator period, c being the speed of light in vacuum, k_r = k₀ + k_w = 2π/λ₀ + k_w and λ₀ being the radiation wavelength.

The eigenfrequencies Ω = Ω_n(q_∥) and the eigenfunctions ψ = ψ_n(q_∥, x) are determined by the dispersion relation²⁶:

$$\left[{{\Omega }}-{q}_{\parallel }+{\nabla }_{\perp }^{2}+\frac{\alpha }{{{{\Omega }}}^{2}}({{\Omega }}-{q}_{\parallel }-1)u(x)\right]\psi ({{{{{{{\bf{x}}}}}}}})=0,$$

(3)

where $\alpha=({n}_{0}{\mu }_{0}{e}^{4}{A}_{w}^{2})/(2{m}^{3}{\gamma }_{0}^{3}{\omega }_{w}^{2})$ with n₀ being the peak density of the electron bunch, γ₀, e, and m being the Lorentz factor, the charge, and the mass of the electron, respectively, μ₀ being the vacuum permeability, and A_w being the vector potential of the undulator.

It is now clear that to excite high-order eigenmodes ψ_n(q_∥), the system should be detuned to support that particular eigenfrequency Ω_n(q_∥). In our experiment, we then introduced an energy chirp along the electron bunch to efficiently excite high-order modes.

Besides introducing energy chirp along the electron bunch for detuning, we can also adjust the taper of the undulator, since the XFEL wavelength is: ${\lambda }_{{{{{{{{\rm{FEL}}}}}}}}}={\lambda }_{w}(1+{K}^{2}/2)/(2{\gamma }_{0}^{2})$, tapering the undulator strength K will directly detune λ_FEL. In the experiment, we study the evolution of high-order modes by setting the undulator sections in two states: no taper and optimal taper³³. On top of these states, the undulator can be over-tapered to introduce the proper effective detuning for efficient excitation, guiding, and amplification of high-order eigenmodes.

Data acquisition and preparation

The single shot data was recorded as two distinct datasets—one for the X-ray beam wavefront/spectra on the photon side and another for the electron parameter readings from undulators on the accelerator side. Both datasets recorded the single shot pulse energies, which were used to synchronize the two datasets on a single shot basis, thus ensuring that the X-ray and electron data is aligned for each individual shot.

The X-ray data includes the X-ray wavefront and spectrum. Highly accurate wavefront measurements were conducted using a Talbot wavefront sensor³⁵, which has recently been successfully demonstrated with XFEL radiation^21,22,23. The number of Zernike terms required for an accurate representation of a wavefront phase depends on the complexity of the wavefront and the desired level of accuracy. To most XFEL experiments, the most important photon beam characteristics are focused beam position and profiles, which typically fluctuate shot-to-shot in current generation XFELs due to the SASE nature of lasing. Low order Zernike terms (up to Z15-Z21) can effectively capture the aberrations associated with those fluctuations, allowing a reasonably accurate determination on the beam features. In our specific case, considering both the absolute values and standard deviations of the higher-order Zernike coefficients to be very small compared to the dominant terms, we retrieved the wavefront phase and decomposed it into 21 Zernike coefficients (Z0-Z20) following the OSA/ANSI convention. By utilizing these Zernike coefficients, we were able to represent and characterize the wavefront phase. Each coefficient corresponds to a specific property of the wavefront, such as oblique and vertical astigmatism (Z3, Z5), defocus (Z4), trefoil (Z6, Z9), and coma (Z7, Z8). Decomposing the wavefront into Zernike polynomials serves as a featurization step, converting diverse forms of data into numerical representations suitable for basic machine learning algorithms.

For spectral measurements, we utilized an off-axis zone plate and captured the spectra on a YAG screen using a CCD camera. The spectrometer demonstrated a sub-eV spectral resolution (0.5–0.7 eV) in the vicinity of 530 eV. On the CCD, the pixel-to-eV ratio was 29 pixels per eV around 530 eV. To facilitate training and prediction, the resulting spectrum was subsequently binned into 50 values at a 6:1 ratio (equivalent to 0.2 eV per value after binning), offering a comprehensive representation of the overall spectrum shape.

We did not intentionally choose specific electron parameters and attributes; instead, we utilized all the directly accessible single-shot parameters. The model relied on a total of 192 parameters to generate the X-ray output. These electron parameters encompass readings from a range of sources such as bunch length monitors and beam position monitors at different sections (undulator soft line, linac-to-undulator soft line, electron dump soft line) and include electron beam positions (x and y coordinates), bunch charges, peak current, raw waveform, X-ray pulse energy, etc.

Model training

To prepare the data for model training, we initially screened the pulse energy data to eliminate outliers by removing shots that were exceptionally weak or empty. In order to capture the intricate relationship between the electron beam parameters as input and X-ray output, we employed an MLP model. The MLP functions as a black box, taking the electron input and generating predictions for the corresponding X-ray output. Its focus is on establishing a nonlinear mapping rather than simulating the complex physics of XFEL systems.

The architecture of the MLP comprises several layers, including an input layer, three hidden layers with 256, 128, and 64 nodes respectively, and an output layer. The number of nodes in the input layer corresponds to 192 electron beam parameters, while the output layer consists of either 18 nodes for wavefront phase or 50 nodes for the spectrum. The electron parameters, which encompass parameters of the electron bunch and the undulators, serve as the input for the neural network. Prior to training, these parameters are normalized to enhance performance.

The output of the network is either the wavefront phase, represented by Zernike coefficients, or the normalized spectrum numbers. To ensure that the model accurately captures the nonlinear relationship and maintains generalization capability, we carefully select hyperparameters to prevent both underfitting and overfitting. The MLP utilizes the hyperbolic tangent (tanh) activation function, which allows for output normalization within the range of (−1, 1), effectively capturing both positive and negative influences from the input data.

For training the model, we employ the Mean Squared Error (MSE) as the loss function, along with dropout regularization (rate of 0.1) to prevent overfitting. An Adam optimizer and a batch size of 256 are utilized during the training process. We trained the model using 80% of approximately 10,000 total shots, while the remaining 20% was reserved for evaluating its predictive capabilities. To ensure the reliability of the model, we performed 5-fold cross-validation. This process involved dividing the data into 5 subsets and conducting training and evaluation on different combinations of these subsets. The consistently minimal errors observed during cross-validation indicated that the model was not prone to overfitting or selection bias.

Prediction error and accuracy evaluation

When we have two 2D wavefront phase maps, the RMS difference between these wavefronts can be computed as $\sqrt{\overline{\parallel {{\Delta }}{{{{{{{\bf{X}}}}}}}}{\parallel }_{2}^{2}}}$. Here, ΔX represents the phase difference within the circular aperture. Alternatively, this difference can be expressed in terms of Zernike coefficients as $\parallel {{\Delta }}{{{{{{{\bf{Z}}}}}}}}{\parallel }_{2}=\sqrt{\mathop{\sum }\nolimits_{j}^{}{\left({{\Delta }}{Z}_{j}\right)}^{2}}$, where ΔZ_j signifies the discrepancy on each Zernike coefficient. This formula is used to calculate the RMS error between the measured wavefront and the predicted wavefront. Additionally, we can assess the shot-to-shot or case-to-case variations by considering ΔZ_j as the standard deviation of each Zernike coefficient. By dividing the RMS prediction error by the standard deviation of the wavefronts, the wavefront prediction error can be evaluated as a relative error.

To evaluate the accuracy of spectrum shape prediction, we measure the similarity between the predicted and measured spectra using the cosine similarity formula ${S}_{C}({{{{{{{\bf{A}}}}}}}},{{{{{{{\bf{B}}}}}}}})=\frac{{{{{{{{\bf{A}}}}}}}}\cdot {{{{{{{\bf{B}}}}}}}}}{{\left|{{{{{{{\bf{A}}}}}}}}\right|}_{2}{\left|{{{{{{{\bf{B}}}}}}}}\right|}_{2}}$. This calculation allows us to quantify the level of resemblance between the predicted and measured spectra, providing a metric for assessing the accuracy of the prediction.

Data availability

The processed data subset can be accessed on Zenodo. Additional raw datasets that support the findings of this study are available from the corresponding authors upon request. Source data are provided with this paper.

Code availability

The code used for the data analysis is available from the corresponding authors upon request.

References

Madey, JohnM. J. Stimulated emission of bremsstrahlung in a periodic magnetic field. J. Appl. Phys. 42, 1906–1913 (2003).
Article ADS Google Scholar
Bonifacio, R., Pellegrini, C. & Narducci, L. M. Collective instabilities and high-gain regime in a free electron laser. Opt. Commun. 50, 373–378 (1984).
Article ADS CAS Google Scholar
Murphy, J. B., Pellegrini, C. & Bonifacio, R. Collective instability of a free electron laser including space charge and harmonics. Opt. Commun. 53, 197–202 (1985).
Article ADS CAS Google Scholar
Saldin, E. L., Schneidmiller, E. A. & Yurkov, M. V. The Physics of Free Electron Lasers (Springer, 2000).
Attwood, D. & Sakdinawat, A. X-Rays and Extreme Ultraviolet Radiation: Principles and Applications 2 edn (Cambridge University Press, 2017).
Kim, Kwang-Je, Huang, Z. & Lindberg, R. Synchrotron Radiation and Free-Electron Lasers: Principles of Coherent X-Ray Generation (Cambridge University Press, 2017).
Emma, P. et al. First lasing and operation of an ångstrom-wavelength free-electron laser. Nat. Photonics 4, 641–647 (2010).
Article ADS CAS Google Scholar
Ishikawa, T. et al. A compact x-ray free-electron laser emitting in the sub-ångström region. Nat. Photonics 6, 540–544 (2012).
Article ADS CAS Google Scholar
Kang, Heung-Sik et al. Hard x-ray free-electron laser with femtosecond-scale timing jitter. Nat. Photonics 11, 708–713 (2017).
Article ADS CAS Google Scholar
Milne, C. J. et al. Swissfel: the Swiss x-ray free electron laser. Appl. Sci. 7, 720 (2017).
Article Google Scholar
Decking, W. et al. A MHz-repetition-rate hard x-ray free-electron laser driven by a superconducting linear accelerator. Nat. Photonics 14, 391–397 (2020).
Article ADS CAS Google Scholar
Posen, S. E. Science Opportunities and Capabilities Enabled by LCLS-II and LCLS-II HE. Technical report (Fermi National Accelerator Lab., 2020).
Maroju, PraveenKumar et al. Attosecond pulse shaping using a seeded free-electron laser. Nature 578, 386–391 (2020).
Article ADS CAS PubMed Google Scholar
Huang, X. Beam-based Correction and Optimization for Accelerators (Taylor & Francis, 2020).
Edelen, A. L. et al. Neural networks for modeling and control of particle accelerators. IEEE Trans. Nucl. Sci. 63, 878–897 (2016).
Article ADS Google Scholar
Edelen, A. et al. Opportunities in machine learning for particle accelerators. Preprint at https://arxiv.org/abs/1811.03172 (2018).
Duris, J. et al. Bayesian optimization of a free-electron laser. Phys. Rev. Lett. 124, 124801 (2020).
Article ADS CAS PubMed Google Scholar
Edelen, A., Neveu, N., Mayes, C., Emma, C. & Ratner, D. Machine learning models for optimization and control of x-ray free electron lasers. In NeurIPS Machine Learning for the Physical Sciences Workshop (2019).
Emma, C. et al. Machine learning-based longitudinal phase space prediction of particle accelerators. Phys. Rev. Accel. Beams 21, 112802 (2018).
Article ADS CAS Google Scholar
Sanchez-Gonzalez, A. et al. Accurate prediction of x-ray pulse properties from a free-electron laser using machine learning. Nat. Commun. 8, 1–9 (2017).
Article Google Scholar
Liu, Y. et al. High-accuracy wavefront sensing for x-ray free electron lasers. Optica 5, 967–975 (2018).
Article ADS CAS Google Scholar
Liu, Y. et al. X-ray free-electron laser wavefront sensing using the fractional Talbot effect. J. Synchrotron Radiat. 27, 254–261 (2020).
Article PubMed PubMed Central Google Scholar
Li, K. et al. Wavefront preserving and high efficiency diamond grating beam splitter for x-ray free electron laser. Opt. Express 28, 10939–10950 (2020).
Article ADS CAS PubMed Google Scholar
Larsen, K. A. et al. Compact single-shot soft X-ray photon spectrometer for free-electron laser diagnostics. Opt. Express 31, 35822–35834 (2023).
Döring, F. et al. A zone-plate-based two-color spectrometer for indirect x-ray absorption spectroscopy. J. Synchrotron Radiat. 26, 1266–1271 (2019).
Article PubMed PubMed Central Google Scholar
Krinsky, S. & Yu, L. H. Output power in guided modes for amplified spontaneous emission in a single-pass free-electron laser. Phys. Rev. A 35, 3406–3423 (1987).
Article ADS CAS Google Scholar
Yu, Li-Hua, Krinsky, S. & Gluckstern, R. _L. Calculation of universal scaling function for free-electron-laser gain. Phys. Rev. Lett. 64, 3011 (1990).
Article ADS CAS PubMed Google Scholar
Chin, YongHo, Kim, Kwang-Je & Xie, M. Three-dimensional theory of the small-signal high-gain free-electron laser including betatron oscillations. Phys. Rev. A 46, 6662 (1992).
Article ADS CAS PubMed Google Scholar
Huang, Z. & Stupakov, G. Free electron lasers with slowly varying beam and undulator parameters. Phys. Rev. Spec. Top. Accel. Beams 8, 040702 (2005).
Article ADS Google Scholar
Wu, J. & Yu, Li. Hua Eigenmodes and mode competition in a high-gain free-electron laser including alternating-gradient focusing. Nucl. Instrum. Methods Phys. Res. A: Accel., Spectrometers, Detect. Assoc. Equip. 475, 79–85 (2001).
Article ADS CAS Google Scholar
Walter, P. et al. The time-resolved atomic, molecular and optical science instrument at the linac coherent light source. J. Synchrotron Radiat. 29, 957–968 (2022).
Article CAS PubMed PubMed Central Google Scholar
Emma, P. et al. Femtosecond and subfemtosecond x-ray pulses from a self-amplified spontaneous-emission–based free-electron laser. Phys. Rev. Lett. 92, 074801 (2004).
Article ADS CAS PubMed Google Scholar
Kroll, N., Morton, P. & Rosenbluth, M. Free-electron lasers with variable parameter wigglers. IEEE J. Quantum Electron. 17, 1436–1468 (1981).
Article ADS Google Scholar
Ding, Y. et al. Femtosecond x-ray pulse temporal characterization in free-electron lasers using a transverse deflector. Phys. Rev. Spec. Top. Accel. Beams 14, 120701 (2011).
Article ADS Google Scholar
Lohmann, A. W. & Silva, D. E. An interferometer based on the Talbot effect. Opt. Commun. 2, 413–415 (1971).
Article ADS Google Scholar

Download references

Acknowledgements

The authors thank Daniel Ratner for informative discussions about machine learning, and Franz-Josef Decker, Yuantao Ding, Andy Aquila, Matthieu Chollet and Peter Walter for experimental assistance and discussions. K. Li, Y. Liu, J. Wu, and A. Sakdinawat were supported by the U.S Department of Energy, Office of Science, Office of Basic Energy Sciences FWP No. 100622 at SLAC National Accelerator Laboratory, under contract No. DE-AC02-76SF00515. Use of the Linac Coherent Light Source (LCLS), SLAC National Accelerator Laboratory, is supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences under Contract No. DE-AC02-76SF00515. Part of this work was performed at nano@stanford, supported by the National Science Foundation under award ECCS-2026822.

Author information

Authors and Affiliations

SLAC National Accelerator Lab, 2575 Sand Hill Road, Menlo Park, CA, 94025, USA
Kenan Li, Guanqun Zhou, Yanwei Liu, Juhao Wu, Ming-fu Lin, Xinxin Cheng, Alberto A. Lutman, Matthew Seaberg, Howard Smith, Pranav A. Kakhandiki & Anne Sakdinawat
School of Applied and Engineering Physics, Cornell University, 142 Sciences Dr, Ithaca, NY, 14853, USA
Pranav A. Kakhandiki

Authors

Kenan Li
View author publications
You can also search for this author in PubMed Google Scholar
Guanqun Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Yanwei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Juhao Wu
View author publications
You can also search for this author in PubMed Google Scholar
Ming-fu Lin
View author publications
You can also search for this author in PubMed Google Scholar
Xinxin Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Alberto A. Lutman
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Seaberg
View author publications
You can also search for this author in PubMed Google Scholar
Howard Smith
View author publications
You can also search for this author in PubMed Google Scholar
Pranav A. Kakhandiki
View author publications
You can also search for this author in PubMed Google Scholar
Anne Sakdinawat
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.S., J.W., K.L., and Y.L. conceived the idea, designed the experiments, and supervised the research. K.L. fabricated the X-ray diffractive optics for the X-ray diagnostic tools. M.L., X.C., M.S., K.L., and Y.L. carried out X-ray wavefront and spectral measurements. G.Z., J.W., A.A.L., H.S., and P.A.K. performed accelerator operations, electron parameters recording, and accelerator side data and model review. K.L. performed data preparation, data analysis, and model training. K.L., J.W., Y.L., and A.S. analyzed the results and wrote the manuscript. All authors commented on the manuscript.

Corresponding authors

Correspondence to Kenan Li or Anne Sakdinawat.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Elena Fol, Hirokazu Maesaka and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, K., Zhou, G., Liu, Y. et al. Prediction on X-ray output of free electron laser based on artificial neural networks. Nat Commun 14, 7183 (2023). https://doi.org/10.1038/s41467-023-42573-z

Download citation

Received: 29 March 2023
Accepted: 16 October 2023
Published: 08 November 2023
DOI: https://doi.org/10.1038/s41467-023-42573-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.