Finite element-based nonlinear dynamic optimization of nanomechanical resonators

Introduction

Design of mechanical structures that move or vibrate in a predictable and desirable manner is a central challenge in many engineering disciplines. This task becomes more complicated when these structures experience large-amplitude vibrations, since linear analysis methods fail and nonlinear effects need to be accounted for. This is particularly important at the nanoscale, where forces on the order of only a few pN can already yield a wealth of nonlinear dynamic phenomena worth exploiting^1,2,3,4,5.

Although design optimization of micro and nanomechanical resonators in the linear regime is well-established⁶, the use of design optimization for engineering nonlinear resonances has received less attention⁷. This is because designers tend to avoid the nonlinear regime, and optimizing structures’ nonlinear dynamics is more complex, which requires extensive computational resources. As a result, available literature on nonlinear dynamic optimization is limited, although some recent advances have been made that combine analytical methods with gradient-based shape optimization, to optimize nonlinearities in micro beams^8,9. For nonlinear modeling of more complex structures, several approaches have been developed based on nonlinear reduced order modeling (ROM) of finite element (FE) simulations^10,11,12. A particularly attractive class known as STEP (STiffness Evaluation Procedure)¹³ can determine nonlinear coefficients of an arbitrary mechanical structure and can be implemented in virtually any commercial finite element method (FEM) package. This, for instance, has been recently shown by using COMSOL to model the nonlinear dynamics of high-stress Si₃N₄ string¹⁴ as well as graphene nanoresonators¹⁵. Since the number of degrees of freedom in the ROM is much smaller than that in the full FE model, the nonlinear dynamics of the structure can be simulated much more rapidly using numerical continuation packages¹⁶.

In this work, we present a route for nonlinear dynamic optimization that is based on an FE-based ROM. The methodology, which is a combination of Particle Swarm Optimization (PSO) with STEP¹³ (OPTSTEP), has several beneficial features. First of all, because it uses a derivative-free optimization routine for approaching the optimal design, it can be implemented and combined with FEM packages that are not able to obtain gradients easily. Secondly, the ROM parameters generated in OPTSTEP can facilitate explicitly expressing the optimization goals. Finally, as will be shown, the developed procedure allows using multiple objective functions to approximate a Pareto front, which can help designers in decision-making processes when having to balance performance trade-offs among different objectives. Considering the outstanding performance as ultrasensitive mechanical detectors and the mature fabrication procedure^17,18, we select high-stress Si₃N₄ for the experimental validation of our methodology.

The manuscript is structured as follows. We first introduce and describe the general OPTSTEP methodology. Then we demonstrate the method on the specific challenge of the optimization of the support structure for a high-stress Si₃N₄ nano string, while taking the maximization of its Q-factor and nonlinear Duffing constant β as examples of linear and nonlinear objectives. By comparing the PSO results to the Q and β values that result from a brute-force simulation of a large number of designs that span the design space, we validate that OPTSTEP finds the optimum designs much faster with the same computational resources. Subsequently, we turn to the problem of dealing with multiple objective functions and focus on simultaneously maximizing both Q and β, demonstrated by a Pareto front. For validation, the results are compared to experimental measurements of fabricated devices. We conclude by demonstrating the potential of OPTSTEP for optimizing the performance of resonant sensors by using more complex objective functions that are relevant for engineering their response time, sensitivity, and power consumption.

OPTSTEP methodology

An overview of the OPTSTEP method is schematically shown in Fig. 1. In the current work, we use it for engineering a parameterized geometry. We use nanomechanical string resonators with compliant supports, which are shown in Fig. 1a, to demonstrate the methodology. We keep the length L and width w of the central string constant, while varying the width w_s, length L_s and angle θ of the supports, as well as the thickness h of the device. It is noted that the OPTSTEP methodology might be used with a larger number of parameters, or even might be extended towards shape or topology optimization of nonlinear dynamic structures. However, such extension is out of the scope of the current work.

**Fig. 1: Schematic of the OPTSTEP method.**

For a certain set of geometrical parameters, a ROM for the parameterized structure is generated using the STEP method¹³, which we implemented with shell elements in COMSOL¹⁴. Besides geometric parameters and boundary conditions (see Fig. 1a), the COMSOL simulation contains material parameters (see Methods), and the initial pre-stress distribution is calculated using a static analysis¹⁴. We conduct this static analysis assuming the material is isotropic and pre-stressed (σ₀ = 1.06 GPa). We then calculate the stress redistribution during the sacrificial layer underetching process, whereby the high-stress Si₃N₄ layer releases from the silicon substrate. Note that in the present study we only consider θ ≥ 0, such that the central string is always in tension (in contrast to ref. ¹⁴). After the static analysis, an eigenfrequency analysis is performed to obtain the out-of-plane eigenmodes ϕ_i (see Fig. 1b). These eigenmodes, together with the redistributed stress field obtained from the static analysis, are then used to determine the effective mass m_eff, resonance frequency f₀, and Q-factor. We can calculate Q-factors^19,20 of the ith eigenmode Q⁽ⁱ⁾ based on the stored tension energy ({W}_{{rm{t}}}^{(i)}) and bending energy ({W}_{{rm{b}}}^{(i)}):

$$begin{array}{lll}{W}_{{rm{t}}}^{(i)},=,displaystylefrac{h}{2}iint {sigma }_{xx}{left(displaystylefrac{partial {phi }_{i}}{partial x}right)}^{2}+{sigma }_{yy}{left(displaystylefrac{partial {phi }_{i}}{partial y}right)}^{2}+2{sigma }_{xy}displaystylefrac{partial {phi }_{i}}{partial x}displaystylefrac{partial {phi }_{i}}{partial y}{rm{d}}x{rm{d}}y,\ {W}_{{rm{b}}}^{(i)},=,displaystylefrac{E{h}^{3}}{24(1-{nu }^{2})}iint {left(displaystylefrac{{partial }^{2}{phi }_{i}}{partial {x}^{2}}right)}^{2}+{left(displaystylefrac{{partial }^{2}{phi }_{i}}{partial {y}^{2}}right)}^{2}+2nu displaystylefrac{{partial }^{2}{phi }_{i}}{partial {x}^{2}}displaystylefrac{{partial }^{2}{phi }_{i}}{partial {y}^{2}}+2(1-nu ){left(displaystylefrac{{partial }^{2}{phi }_{i}}{partial xpartial y}right)}^{2}{rm{d}}x{rm{d}}y,end{array}$$

(1)

where σ_xx, σ_yy and σ_xy is the stress in the Cartesian coordinate, Q₀ is the intrinsic Q-factor of stress-free Si₃N₄²¹.

As indicated in Fig. 1b the STEP method generates a set of coupled nonlinear differential equations^13,14,15, where the effective nonlinear elastic force acting on the ith mode is given by the function γ⁽ⁱ⁾ that depends on the quadratic a_ij, cubic b_ijk coupling coefficients, and the generalized coordinates q_i. q_i describes the instantaneous contribution of the corresponding mode shapes ϕ_i to the deflection of the structure.

Thus, the finite element model with several thousand or even millions of degrees of freedom (DOFs) is reduced to a condensed ROM, that can usually describe the nonlinear dynamics to a good approximation with less than ten degrees of freedom. We can visualize the resulting frequency response curves for different harmonic drive levels by numerical continuation¹⁶, as shown in Fig. 1c.

The resulting ROM parameters, including effective mass ({m}_{{rm{eff}}}^{(i)},Q)-factor, linear stiffness ({k}^{(i)}={m}_{{rm{eff}}}^{(i)}{(2pi {f}^{(i)})}^{2}) and nonlinear stiffness terms a_jk, b_jkl, are passed to the PSO optimizer (see Fig. 1d). The algorithm randomly generates many different initial designs by varying the geometric parameters, as shown in Fig. 1a. For each of these designs, known as a “particle” in PSO, a ROM is generated by STEP and the corresponding objective functions are computed accordingly and passed to the optimizer. The optimizer then generates a next generation of particles based on the designs from the current generation, the objective functions, and the constraints, with the aim of improving their design parameters to optimize the objectives (see Supplementary Note 1). The optimization loop will iterate until it reaches the predefined maximum generation. If multiple objective functions are selected to be optimized, there is an additional step that selects the nondominated particles according to Pareto dominance²². Because each particle is evaluated independently, PSO enables efficient parallel computing to evaluate all particles in one generation on a high-performance computing cluster.

OPTSTEP implementation and validation

Single-objective optimization with OPTSTEP

We implement the presented OPTSTEP methodology to optimize the support geometry of the string resonator shown in Fig. 1a. The motion of the fundamental mode of the resonator can be described with the following nonlinear equation of motion:

$$ddot{q}+frac{2pi {f}_{0}}{Q}dot{q}+{(2pi {f}_{0})}^{2}q+beta {q}^{3}={F}_{{rm{exc}}}sin (2pi ft),$$

(2)

where q is the displacement at the string center, f₀ is the resonance frequency, Q is the Q-factor, β = b₁₁₁/m_eff is the mass-normalized Duffing constant, and ({F}_{{rm{exc}}}sin (2pi ft)) is the mass-normalized harmonic drive force. To demonstrate the single-objective optimization capability of OPTSTEP, we present results for two optimization objectives, respectively: maximizing the Q-factor (shown in Fig. 2a, c, d) or maximizing the mass-normalized Duffing constant β (shown in Fig. 2b, e, f) of the fundamental mode. We emphasize that a maximum Q or β does not necessarily result in the best performance for all applications of nanomechanical resonators. We choose these optimization objectives as examples to demonstrate that the OPTSTEP methodology can be used to find extreme values of a single objective function, that can be suitably chosen depending on the application requirements. As design parameters, we use the support parameters (L_s, w_s, θ and h in Fig. 1a). The PSO algorithm can freely initialize and vary these variables between preset constraints 10 μm < L_s < 100 μm, 1 μm < w_s < 7 μm, 0 rad < θ < 0.4 rad, and 40 nm < h < 340 nm.

**Fig. 2: Optimal designs found by particle swarm optimization (PSO) and simulation of different dynamical properties.**

We initialize the PSO algorithm with 10 randomly generated particles, as indicated by the blue circles at the first generation in Fig. 2a, b. The Q and β values of the best performing particle per generation are highlighted by the red line, which converges towards an optimum. Simulated response curves at different drive levels of the initial design (median performance of the initialized particles) and the optimized design are shown in Fig. 2c, d for Q and Fig. 2e, f for β. It is obvious that the resonance peaks become narrower from Fig. 2c to Fig. 2d, indicative of an increase in Q-factor. From the backbone curves shown in Fig. 2e, f, we see that the resonance frequency of the optimized device shifts more at the same vibration amplitude, which suggests a larger, optimized value of β.

Numerical validation

In order to validate the PSO results, we compare them to a brute-force parametric study where we simulate a large number of designs that span the full design parameter space, and plot the resulting values of Q and β in the contour plots in Fig. 2g, h. Each of these subfigures consists of 16 small contour plots, each of which has a different combination of L_s and h, while along the axes the parameters w_s and θ are varied. The red-colored regions in the plots contain the optimal values of Q and β, which are indicated by a triangle and a star. In Supplementary Table S1, we compare the optimized design parameters from the OPTSTEP method to the best devices from the parametric study. The close agreement between both approaches provides evidence that the OPTSTEP method is able to optimize both linear (Q) and nonlinear (β) parameters of the ROM. The results in Fig. 2a are obtained in 30 minutes using a high performance computing cluster, while the parametric study in Fig. 2g takes over 325 hours on the same cluster with the same amount of nodes. This illustrates the advantage in computation time that can be realized with OPTSTEP, although it is noted that these times strongly depends on the resolution of the parameter grid and other simulation parameters.

Experimental characterization

To compare the OPTSTEP method to experimental results, we also perform an experimental parametric study on 15 string resonators with varying support design parameters. For this, we fabricated a set of devices with 10 μm < L_s < 90 μm and 0 rad < θ < 0.2 rad, while keeping h = 340 nm and w_s = 1.0 μm fixed. Figure 3a shows a Scanning Electron Microscope (SEM) image of an array of nanomechanical resonators with varying support designs made of high-stress Si₃N₄ (see “Methods” for more details). To characterize the nonlinear dynamics of the devices, as shown in Fig. 3b, we fix the chip to a piezo actuator that drives the resonator by an out-of-plane harmonic base actuation in the out-of-plane direction. We use a Zurich Instruments HF2LI lock-in amplifier, connected to an MSA400 Polytec Laser Doppler vibrometer, to measure the out-of-plane velocity at the center of the string resonator as a function of driving frequency (see Fig. 3c). We use a velocity decoder with a calibration factor of 200 mm/s/V. We perform all measurements in a vacuum chamber with a pressure below 2 × 10⁻⁶ mbar at room temperature.

**Fig. 3: Experimental set-up and experimental validation of the simulations.**

Figure 3c shows the frequency response at the center of the string at various drive levels for a device with L_s = 90 μm, w_s = 1 μm, θ = 0.20 rad and h = 340 nm. We estimate the linear resonator parameters of all devices by fitting the measured frequency response curves at various drive levels with the following harmonic oscillator function¹⁴ (see Supplementary Note 2):

$${q}_{{rm{d}}}(f)=frac{{q}_{max ,{rm{l}}}/Q}{sqrt{{left[1-{left(f/{f}_{0}right)}^{2}right]}^{2}+{f}^{2}/{({f}_{0}Q)}^{2}}},$$

(3)

where q_d(f) is the measured amplitude, ({q}_{max ,{rm{l}}}) is set equal to the maximum measured amplitude ({q}_{max ,{rm{nl}}}) as the peak amplitude of the linear oscillator, and f is the drive frequency. To determine the nonlinear stiffness, we measure the resonator’s frequency response at increasing drive levels, construct the backbone curve, and use the relation between the nonlinear peak amplitude q_max,nl and the peak frequency ({f}_{max }) to fit and obtain the mass-normalized Duffing constant β using the following equation^23,24:

$${f}_{max }^{2}={f}_{0}^{2}+frac{3}{16{pi }^{2}}beta {q}_{max ,{rm{nl}}}^{2}.$$

(4)

To compensate for small drifts in f₀ during the experiments, before fitting with Eq. (4), we plot the frequency response curves along the f − f₀ axis¹⁴. The fitting procedure to obtain f₀, Q and β using Eqs. (3) and (4) is explained in more detail in Supplementary Note 2.

In Fig. 3d–f, we compare the dynamical properties between FE-based ROMs (dots) and measurements on 15 string resonators (diamonds) as a function of L_s and θ. It is evident that the fundamental resonance frequency f₀, Q-factor, and the mass-normalized Duffing constant β of the fabricated devices, are all well predicted by FE-based ROMs. It can also be seen that for short support lengths L_s the device performance is similar, whereas increasing L_s allows tuning f₀, Q and β as we studied in more detail earlier^14,19. In the next section we will compare these experimental results to multi-objective optimization as further validation of OPTSTEP.

Multi-objective optimization with OPTSTEP

For actual device design there are often multiple performance specifications that need to be met. It might sometimes be possible to condense these performance specifications into a single figure of merit, like the f₀ × Q product for nanomechanical resonators. However, to make the best design decisions, it is preferred that the optimizer works with two (or more) objective functions like enhancing f₀ and Q, simultaneously. To enable this, we implement OPTSTEP with a multi-objective particle swarm optimization (MOPSO), which is an extension of single-objective PSO. After multi-objective optimization, the nondominated particles in the swarm are used to determine an approximation of the Pareto front, which is the set of designs for which improving one of the objectives will always lead to a deterioration of the other objective(s). By performing MOPSO, we aim at finding the Pareto front in the design space for multiple objectives, that represents the boundary on which all optimized designs reside for the chosen variables. As the red dots show in Fig. 1d illustrate, the Pareto front represents the boundary between feasible and unfeasible combinations of objectives and thus allows the designer to make the best trade-off among different objectives.

To demonstrate that multi-objective optimization can be combined with OPTSTEP, we use it to simultaneously maximize Q and β. Devices with high quality factor and nonlinear stiffness can be of interest in cases where we are looking for designs that can drive a string into the nonlinear regime with a minimum driving force and power consumption.

The resulting Pareto fronts are shown in Fig. 4a. Since we are also interested in the effect of the constraints on the optimum solutions, we include Pareto fronts with: no constraint (purple), a thickness constraint of h = 340 nm (gray), and with thickness and support width constraint (multi-colored). These three Pareto fronts show that there is a clear trade-off between Q and β, with higher Q-factor leading to lower nonlinearity β. The experimental devices share the same constraints (w_s = 1 μm and h = 340 nm) as the multi-colored Pareto and are plotted as the hollow diamonds with error bars in Fig. 4a (see Supplementary Table 2). We observe that all experimental points reside in the region on the left hand side of the Pareto front, confirming the area enclosed by the Pareto front indeed captures the feasible devices, and experimentally strengthening the confidence in the OPTSTEP approach for multi-objective designs. The color of the points links the points in the Q − β graph in Fig. 4a to the corresponding design parameters in Fig. 4b. In Fig. 4b the schematic support geometries are shown as insets for both maximum β (dark blue) and maximum Q (dark red). We choose some of the fabricated devices close to the Pareto front to show typical measured frequency response curves and microscopic images in Fig. 4c–f, which correspond to the star, triangle, circle and square data markers in Fig. 4a, b. Together with the microscopic images, it is apparent that with minor alterations in the support region, the response of the string resonators can be largely tuned. To further explore the effect of other design parameters numerically, we release the constraint on w_s, keeping only h = 340 nm constrained, and conduct MOPSO (see the gray Pareto front). We can see from the comparison between the gray and multicolored fronts that the performance gain from changing w_s is not very large. In contrast, if we further relax the constraint on h = 340 nm, which shares the same design space in Fig. 2g, h, we obtain the purple Pareto front. The thinner h pushes the Pareto front to have much higher Q. The long plateau at fixed β is mainly attributed to the increase in Q that results from the dependence of the intrinsic quality factor Q₀ on h (see Methods). Besides validating the MOPSO approach by comparing with experimental data, we also use the data from the parametric study in Fig. 2 to extract and generate reference Pareto fronts that are shown as black solid, dotted, and dashed lines in Fig. 4a (see Supplementary Note 3), with constraints that match those from the MOPSO optimization.

**Fig. 4: Trade-offs between Q-factor and the mass-normalized Duffing constant β as obtained by combining OPTSTEP with multi-objective particle swarm optimization (MOPSO).**

Discussion

The OPTSTEP methodology that is presented in this work enables the optimization of the nonlinear dynamic properties of resonant structures using standard FEM software, since it is based on the STEP and uses a derivative-free optimization method. The exclusive reliance on FEM outputs, without requiring information from the full mass and stiffness matrices, increases its generality and allows multi-physics optimization, including also e.g., electromagnetic or thermodynamic phenomena. We note that although derivative-free techniques like PSO are able to efficiently find near-optimal values of design parameters, optimality guarantees can typically not be given, and the techniques are therefore also called metaheuristic optimization techniques. Here, in order to validate the OPTSTEP methodology numerically and experimentally, we have focused on β and Q maximization of the fundamental mode of a string resonator by geometric support design. After having established the methodology, it is now of interest to apply it to explore performance parameters that are more relevant to applications. For example, as shown in Fig. 5, our methodology can directly be extended to optimize the power consumption P, sensitivity (the limit of detection expressed in Allan Deviation, assuming averaging time τ = 1s) σ_y and response time τ_r of resonant sensors^25,26, since these figure-of-merits can be directly expressed in terms of m_eff, f₀, Q and β (see Supplementary Note 4). In Fig. 5, 1000 nondominated particles are found by OPTSTEP to form a 3D surface that approaches the Pareto frontier with the objective of minimizing P, σ_y and τ_r simultaneously. The particles have the same design constraints as in the example in Fig. 2 and the purple Pareto front in Fig. 4a, which are 10 μm < L_s < 100 μm, 1 μm < w_s < 7 μm, 0 rad < θ < 0.4 rad, and 40 nm < h < 340 nm. The competing design trade-offs between these three objective functions are obtained from OPTSTEP, and are visualized in Fig. 5 by showing four typical designs near the Pareto frontier. As demonstrated by the designs at the upper right corner of the Pareto frontier, we can conclude that the devices with shorter response time are more likely to have thicker supports, which lead to a higher resonance frequency f₀ combined with a low Q, thus resulting in a smaller Q/f₀ ratio. At the same time, these thicker supports also contribute to a larger onset of nonlinearity a_1dB¹⁴, so the resonators are able to work at much larger amplitudes in the linear regime, which provides a better sensitivity σ_y. However, the larger a_1dB and m_eff will require more energy to sustain the oscillation at resonance that causes higher power consumption P. In contrast, the devices with much lower power consumption P while maintaining comparably high sensitivity σ_y, which are shown at the lower left corner in Fig. 5, are equipped with more slender supports. With only a slight increase of support angle θ from 0, the low torsional stiffness of supports is maintained while the stress in the central string can be significantly increased¹⁹, leading to a higher Q, which can be confirmed by Fig. 2g. Consequently, when aiming at designing a resonant sensor with relatively low power consumption P, high sensitivity σ_y and short response time τ_r with compliant supports, a pair of slender and slightly angled supports, together with a medium thickness of Si₃N₄ layer is generally favored.

**Fig. 5: Trade-offs among the power consumption P, sensitivity σ_y(τ = 1s) and response time τ_r of a string resonator with four design parameters.**

In other cases, like approaching the quantum regime with a nonlinear nanomechanical resonator²⁷, it is beneficial to maximize Q and β simultaneously. The OPTSTEP methodology can also be used for more complex design problems that involve multiple modes^5,8,14,28, for avoiding or taking advantage of mode coupling, for instance by optimizing nonlinear coupling coefficients (a_jk and b_jkl in Fig. 1b) and resonance frequency ratios. Since OPTSTEP generates the ROM parameters at each generation, it is particularly suited for dealing with cases where the device specifications can be expressed in terms of these parameters. Interesting challenges include increasing frequency stability by coherent energy transfer^29,30, signal amplification³¹ and stochastic sensing^4,32. Moreover, intriguing paths for further research involve inclusion of nonlinear damping or extension to full topology optimization⁶. Also the use of alternative optimization strategies, like binary particle swarm optimization (BPSO)³³, that could generate radically new geometries, is an interesting direction.

Conclusions

To sum up, we presented a methodology (OPTSTEP) for optimizing the nonlinear dynamics of mechanical structures by combining an FE-based ROM method with a derivative-free optimization technique (PSO). We demonstrated and validated the methodology by optimizing the support design of high-stress Si₃N₄ nanomechanical resonators. The method was verified numerically by comparing its results to a brute-force parametric study, for both single- and multi-objective optimization. Experimental data on the Q-factor and Duffing nonlinearity were in correspondence with the OPTSTEP results. The capability of the method was also demonstrated by multi-objective optimization of the support for the nanomechanical resonator, targeting improvements in power consumption, sensitivity and response time in resonant sensing. We thus conclude that the method can be applied to a wide range of complex design challenges including nonlinear dynamics, and is expected to be compatible to most FE codes and derivative-free optimization routines. It holds the potential to facilitate and revolutionize the way (nano)dynamical systems are designed, thus pushing the ultimate performance limits of sensors, mechanisms and actuators for scientific, industrial, and consumer applications.

Methods

Sample fabrication

We produce our nanomechanical resonators using electron beam lithography and reactive ion etching techniques on high-stress Si₃N₄ layers, chosen for their reliability and precision in achieving design specifications²⁰. These layers are deposited via low pressure chemical vapor deposition (LPCVD) onto a silicon substrate. Following this, the devices undergo suspension through a fluorine-based deep reactive ion underetching process. The mechanical properties of the high-stress Si₃N₄ are characterized in our previous works¹⁴, with an initial isotropic stress σ₀ = 1.06 GPa, Young’s modulus E = 271 GPa, Poisson’s ratio ν = 0.23, mass density ρ = 3100 kg/m³. The intrinsic quality factor is a function of thickness h²¹, which is ({Q}_{0}^{-1}=2800{0}^{-1}+{left(6times 1{0}^{10}hright)}^{-1}).