Mobility and threshold voltage extraction in transistors with gate-voltage-dependent contact resistance

Introduction

The electron and hole mobilities of emerging semiconductors are frequently estimated from measured current vs. voltage characteristics (e.g., from drain current I_d vs. gate-source voltage V_gs) of field-effect transistors (FETs)¹. Many such transistors have contact resistance that is a function of gate voltage due to electrostatic gate fields that affect the energy barrier and charge density at the contact/channel interface^2,3. This contact gating effect is often associated with back-gated FETs (Fig. 1a), where the back gate can directly modulate the mobile charge carrier density at the contacts (Fig. 1b). However, recent work has shown that top-gated FETs (Fig. 1c) can also electrostatically control the contacts at their edges (Fig. 1d)^4,5.

Mobility and threshold voltage extraction in transistors with gate-voltage-dependent contact resistance — **Fig. 1: Contact gating in field-effect transistors (FETs).**

Further complicating matters, the channel resistance R_ch and contact resistance R_C of contact-gated FETs often change at different rates (Fig. 1e), causing these devices to potentially exhibit two apparent threshold voltages: one associated with channel turn-on, and another dictated by contact turn-on⁶ (Fig. 1f). When the channel turns on before the contacts (i.e., at lower V_gs in n-channel FETs, as in Fig. 1e, f), I_d is limited by R_C and can remain low even when the channel is fully turned on (Fig. 1e, f, Region I). As V_gs increases, the contacts begin to turn on (Fig. 1e, f, Region II) before the device eventually reaches a channel-dominated regime (Fig. 1e, f, Region III), leading to a distinct kink⁷ in the I_d vs. V_gs characteristics associated with the transition between the contact- and channel-limited regimes.

Because I_d is contact-limited or contact-influenced in Region II of Fig. 1f, both I_d and the transconductance (g_m = ∂I_d/∂V_gs) here are dominated by the contacts rather than by the channel. Thus, attempting to estimate the channel mobility (μ) using the conventional linear extrapolation method (i.e., asserting μ ∝ g_m) in this region can result in severe μ overestimation^3,6,7,8,9 when R_C dominates and decreases as |V_gs| increases. (In the special case where R_C remains constant as |V_gs| increases, the mobility can be underestimated instead.) Therefore, μ should instead be extracted from the slope of Region III in Fig. 1f (where devices are channel-limited)^7,8. However, this approach is often infeasible for emerging semiconductor devices whose large R_C and/or early gate dielectric breakdown can make this high-|V_gs| region hard to reach experimentally. Furthermore, when Region III of the I_d vs. V_gs curve is inaccessible before dielectric breakdown (e.g., due to large R_C and/or high threshold voltage |V_T|), the I_d vs. V_gs curve may show only a single linear region simply because the V_gs sweep ends early. For this reason, it can even be challenging to establish if a device is channel- or contact-limited based on its transfer characteristics¹⁰ alone.

To avoid μ overestimation due to contact gating, researchers can use four-terminal geometries to directly probe and subtract the voltage drop across the contacts^9,11. However, care must be taken to ensure that the voltage probes are entirely non-invasive, which can be difficult in practice^12,13,14,15. The Y-function method¹⁶ can also correct for mild contact gating¹⁷ but relies upon accurate V_T extraction, making this method unreliable for devices that cannot access Region III⁶ in Fig. 1f. We demonstrate later in this work that the transfer length method (TLM) approach¹ can be similarly unreliable for extracting μ from contact-gated FETs.

In this work, we propose a method for extracting the channel μ and V_T of transistors that remains valid even for strongly contact-gated devices. This approach takes inspiration from the conventional TLM method¹ and can analyze families of two-terminal devices that cannot access the channel-limited regime (Fig. 1e, f, Region III) in their I_d vs. V_gs measurements. We validate our proposed method using synthetic data generated by a technology computer-aided design (TCAD) simulator¹⁸ and find that it accurately extracts μ even for devices where conventional methods overestimate μ by 2–3×, enabling accurate μ and V_T extraction in devices with strong contact gating.

Results

Our proposed extraction is summarized in Fig. 2 and explained in detail below. We provide Python code to automate this extraction in a GitHub Repository¹⁹ and in Supplementary Section 4. Additionally, we provide a tutorial for this code in Supplementary Section 1.

**Fig. 2: Summary of our proposed extraction.**

Model derivation

Here, we treat a contact-gated FET as a channel between gate-voltage-dependent source and drain resistors (R_s and R_d respectively; the total contact resistance is 2R_C = R_s + R_d), as shown in Fig. 2a²⁰. The intrinsic gate-to-source and drain-to-source biases (after considering the voltage drops across R_s and R_d) are ({V}_{{rm{gs}}}^{{prime} }={V}_{{rm{g}}}-{V}_{{rm{s}}}^{{prime} }) and ({V}_{{rm{ds}}}^{{prime} }={V}_{{rm{d}}}^{{prime} }-{V}_{{rm{s}}}^{{prime} }), where ({V}_{{rm{s}}}^{{prime} }) and ({V}_{{rm{d}}}^{{prime} }) are defined in Fig. 2a. In the linear region of an n-channel FET (({V}_{{rm{gs}}}^{{prime} } > {V}_{{rm{T}}}) and ({V}_{{rm{ds}}}^{{prime} } < {V}_{{rm{gs}}}^{{prime} }-{V}_{{rm{T}}})), I_d is:

$$frac{{I}_{{rm{d}}}}{W}=frac{mu {C}_{{rm{ox}}}}{{L}_{{rm{ch}}}}left({V}_{{rm{gs}}}^{{prime} }-{V}_{{rm{T}}} – frac{{{V}_{{rm{ds}}}^{{prime} }}}{2}right){V}_{{rm{ds}}}^{prime}$$

(1)

where C_ox is the gate insulator capacitance per unit area, and W and L_ch are the width and length of the channel. In this extraction, we build a system of equations based on Eq. (1) that we use to simultaneously solve for μ and V_T. In Eq. (1), V_T refers explicitly to the true channel V_T associated with channel inversion (as defined in Fig. 1f); thus, this channel V_T often cannot be extracted directly in contact-gated devices. [For emerging FETs with intrinsic channels (no counter-doping), such as two-dimensional (2D) FETs, the channel is considered inverted when the carrier concentration is approximately equal to the density of states at the relevant band edge²¹.]

To build our system of equations, we use a TLM-like approach where we consider a family of devices with various L_ch (using multiple two-terminal devices or a larger TLM-like test structure). As R_s and R_d are V_gs-dependent, we use I_d vs. V_ds sweeps at fixed values of V_gs to ensure these resistances remain constant. Further, as R_s and R_d contain Schottky diodes, they are nonlinear circuit elements, i.e., their resistances are functions of I_d. To ensure constant R_C, we therefore perform the extraction at a constant current.

With these considerations in mind, we begin by choosing a target drain current ({I}_{{rm{d}}}^{{rm{T}}}). We then perform I_d vs. V_ds sweeps for each L_ch at a common V_gs, recording the V_ds at which the device with the i^th channel length reaches I_d = ({I}_{{rm{d}}}^{{rm{T}}}) as ({V}_{{rm{ds}}}={V}_{{rm{ds}}}^{(i)}) (Fig. 2b). Then, we plot ({V}_{{rm{ds}}}^{(i)}) vs. L_ch and perform linear regression (Fig. 2c); the y-intercept of the line of best fit yields the voltage drop across the contacts ({Delta V}_{{rm{C}}}) at ({I}_{{rm{d}}}={I}_{{rm{d}}}^{{rm{T}}}). (ΔV_C is the summed voltage drop across the source and drain contacts, not the average voltage drop across either contact.) Here, ({I}_{{rm{d}}}^{{rm{T}}}) must be chosen such that the extracted ({V}_{{rm{ds}}}^{(i)}) values are small (all ({V}_{{rm{ds}}}^{(i)}) ≪ V_gs − V_T) to minimize the quadratic term in Eq. (1); otherwise, the linear fit in Fig. 2c becomes invalid. Additionally, a small ({V}_{{rm{ds}}}^{(i)}) ensures that the vertical electric field near the drain is similar for all channel lengths, helping to ensure that R_d remains constant across all devices.

Next, we partition ({Delta V}_{{rm{C}}}) into the voltage drops across the source and drain, ({Delta V}_{{rm{s}}}) and ({Delta V}_{{rm{d}}}). As R_s and R_d contain reverse- and forward-biased Schottky diodes, respectively, we have R_s > R_d²⁰. Further, as R_d approaches 0 for high drain bias²², we have:

$$0le {Delta V}_{{rm{d}}}le {Delta V}_{{rm{C}}}/2$$

(2)

$${Delta V}_{{rm{C}}}/2,le {Delta V}_{{rm{s}}}le {Delta V}_{{rm{C}}}$$

(3)

For simplicity, we take the centers of these ranges, i.e., ({Delta V}_{{rm{d}}}approx frac{1}{4}Delta {V}_{{rm{C}}}) and ({Delta V}_{{rm{s}}}approx frac{3}{4}Delta {V}_{{rm{C}}}), and estimate the true intrinsic voltages as ({V}_{{rm{ds}}}^{{prime} (i)}={V}_{{rm{ds}}}^{(i)}-{{Delta }}{V}_{{rm{C}}}) and ({V}_{{rm{gs}}}^{{prime} }={V}_{{rm{gs}}}-{{Delta }}{V}_{{rm{s}}}approx {V}_{{rm{gs}}}-frac{3}{4}Delta {V}_{{rm{C}}}).

We use the above approach to extract ({V}_{{rm{ds}}}^{{prime} (i)}) and ({V}_{{rm{gs}}}^{{prime} }) at multiple fixed V_gs values to compile the table in Fig. 2d. Next, we use these tabulated values to build a system of equations from which we extract μ and V_T. To do so, we rearrange Eq. (1) into:

$$frac{2{V}_{{rm{gs}}}^{{prime} }{V}_{{rm{ds}}}^{{prime} }-{V}_{{rm{ds}}}^{{prime} 2}}{{L}_{{rm{ch}}}}=2{V}_{{rm{T}}}frac{{V}_{{rm{ds}}}^{{prime} }}{{L}_{{rm{ch}}}}+frac{{2I}_{{rm{d}}}}{mu {C}_{{rm{ox}}}W}$$

(4)

As Eq. (4) is in the form y = mx + b [with m = (2{V}_{{rm{T}}}), x = ({V}_{{rm{ds}}}^{{prime} }/{L}_{{rm{ch}}}), and b = (2{I}_{{rm{d}}}/(mu {C}_{{rm{ox}}}W))], we use rows from Fig. 2d to plot ((2{V}_{{rm{gs}}}^{{prime} }{V}_{{rm{ds}}}^{{prime} left(iright)}-{V}_{{rm{ds}}}^{{prime} left(iright)2})/{L}_{{rm{ch}}}^{(i)}) as a function of ({V}_{{rm{ds}}}^{{prime} (i)}/{L}_{{rm{ch}}}^{(i)}) and perform linear regression (Fig. 2e). We then use the extracted slope and intercept to calculate V_T and μ from known quantities.

Although we present this derivation for n-type devices, this procedure can easily be adapted to p-type devices by repeating the derivation starting from the p-type analog of Eq. (1). Alternatively, one could apply the above procedure to p-type devices by negating the input V_gs, taking the absolute value of V_ds, and then negating the extracted V_T.

We note that (Delta {V}_{{rm{C}}}) extracted in Fig. 2c is accompanied by an associated error that leads to uncertainty in the extracted μ and V_T. Simple analytic techniques cannot easily propagate this error to the final estimated μ and V_T because the quantities along both the x- and y-axes in Fig. 2e are error-prone, where the errors in x and y values are not mutually independent. Hence, we instead use the Monte Carlo approach described in Supplementary Section 2 to propagate this error, allowing us to improve the estimates for the nominal μ and V_T and their standard errors. This Monte Carlo approach is implemented in Python code that we provide in Supplementary Section 4 and in an online GitHub Repository¹⁹.

Model validation

We validate our proposed extraction by using it to estimate μ and V_T from current-voltage characteristics generated by Sentaurus Device TCAD¹⁸. This approach allows us to assess the accuracy of the extraction because μ and V_T are known a priori: μ is a simulation input parameter, and the channel V_T can be extracted from equivalent devices (simulated in Sentaurus) without contact resistance.

The contact-gated devices in our TCAD simulations have nominal Schottky barrier heights ϕ_B = 0.15, 0.3, 0.45, and 0.6 eV. These are “nominal” values because the TCAD simulations include image force lowering (IFL)²³ and tunneling at the contacts; these listed ϕ_B are barrier heights before IFL (i.e., the ϕ_B we list are the differences between the semiconductor’s electron affinity and the metal’s work function). All devices are back-gated transistors (Fig. 1a) with HfO₂ gate insulators (relative dielectric constant κ = 20 and equivalent oxide thickness EOT = 10 nm). The channel thickness is 0.615 nm (corresponding to monolayer MoS₂^24,25) and the mobility is set to μ = 50 cm²V^-1s^-1.

In Fig. 3a–d, we plot TCAD-generated I_d vs. V_ov sweeps (where the overdrive voltage V_ov = V_gs − V_T) at V_ds = 0.1 V for each ϕ_B at L_ch = 200, 400, …, 1000 nm. Devices with ϕ_B ≥ 0.3 eV clearly display the signature kink of contact gating⁷ (especially at small L_ch, where R_C exceeds the channel resistance at low V_ov). Next, we plot the extracted μ and V_T for each ϕ_B using our proposed extraction method and three conventional techniques: the linear extrapolation, Y-function, and TLM approaches^1,16,26. These conventional techniques are applied directly on the synthetic I_d vs. V_gs data shown in Fig. 3a–d, whereas our extraction uses separate synthetic I_d vs. V_ds data. The linear extrapolation and Y-function techniques both use the device with the longest channel (L_ch = 1000 nm), whereas our proposed method and the TLM approach use the full range of L_ch = 200–1000 nm in Fig. 3a–d. We calculate standard error using the method described in Supplementary Section 2 for our proposed extraction method, and we use the standard error from linear regression (equivalent to the 68% confidence interval) for the TLM approach. (We do not include standard error for the linear extrapolation or Y-function approaches because the synthetic data is noiseless.)

**Fig. 3: Validation of our proposed extraction.**

In Fig. 3e–h, we plot the μ extracted from each method vs. V_ov/EOT. The horizontal axes in these plots are shown only up to the point where the μ obtained by the four extraction methods have converged. At ϕ_B = 0.15 eV (Fig. 3e), we find that all extractions yield reasonable estimates for μ, with a worst-case underestimation of ~15% for the Y-function method. As ϕ_B increases, however, we find that conventional methods begin to severely overestimate μ due to contact gating. We also note the TLM approach predicts a small standard error (<10%) in Fig. 3g, h (ϕ_B = 0.45 and 0.6 eV) despite overestimating μ by over 2×. In other words, the standard error estimated from the TLM approach does not accurately reflect the true uncertainty in the extracted μ when ϕ_B is large and V_ov is limited (e.g., by early dielectric breakdown). In contrast, our method estimates μ more accurately than conventional methods, with a worst-case overestimation of ~20% at ϕ_B = 0.6 eV and low V_ov (Fig. 3h), and with the true μ being captured within our error bars (unlike the TLM method).

We note that the TLM approach requires that devices with different L_ch be measured at a common carrier density, i.e., at a common V_ov¹. In the present work, V_ov is referenced with respect to the V_T estimated by linear extrapolation; in Supplementary Section 3, we study the accuracy of the TLM approach when instead using V_T defined at a constant current (e.g., 100 nA/μm).

Next, in Fig. 3i–l, we plot the V_T extracted from our proposed method, the linear extrapolation method, and the Y-function method vs. V_ov/EOT using the same horizontal x-axis limits as in Fig. 3e–h. Importantly, in these transistors, contact gating obscures the channel turn-on, causing the linear extrapolation and Y-function methods to significantly overestimate V_T. In comparison, we find that our proposed extraction tends to yield much more accurate V_T estimates, with a worst-case V_T error of 0.2 V in the range of V_ov/EOT plotted in Fig. 3i–l. We note that our method and the conventional methods tested here do not always converge to the true V_T at higher V_ov, but this is acceptable because the error in estimated V_T is less impactful (i.e., has smaller impact on the predicted charge carrier density) at large V_ov.

To ensure that our proposed extraction is applicable to a variety of devices (and not limited to those presented in Fig. 3), we repeat similar extractions for back-gated transistors with (i) μ = 5 cm²V⁻¹s⁻¹ and EOT = 10 nm and (ii) μ = 50 cm²V⁻¹s⁻¹ and EOT = 100 nm (channel thickness = 0.615 nm and ϕ_B = 0.45 eV for all devices) in Fig. 4. These scenarios are relevant because they correspond to typical devices used to test emerging semiconductor channels. We plot I_d vs. V_ov in Fig. 4a, b, extracted μ in Fig. 4c, d, and extracted V_T in Fig. 4e, f. We find that the trends observed here are similar to those of Fig. 3, suggesting that our proposed extraction remains applicable at higher EOTs or lower μ. Thus, the method we propose in this work appears to facilitate accurate extractions from a variety of contact-gated transistors with high R_C and/or early dielectric breakdown that cannot access the higher V_ov range necessitated by conventional methods.

**Fig. 4: Validation of our proposed extraction on low-mobility and high-EOT transistors.**

Effect of device-to-device variation

To assess the robustness of our extraction, we apply it to devices whose μ and V_T have a certain amount of variation, as would be seen experimentally. For each device, we randomly select μ and V_T according to Gaussian distributions with means (standard deviations) of 50 cm²V⁻¹s⁻¹ (10%) and 0.56 V (0.1 V), respectively. As before, we use Sentaurus TCAD¹⁸ to generate current-voltage characteristics that we analyze with our proposed method and the TLM approach. Because the Y-function and linear extrapolation methods are applied to one device at a time, they are not affected by variations between devices; thus, we do not re-analyze them here. We quantify each method’s accuracy in terms of its mean absolute error (MAE) and its confidence interval coverage probability (CICP; the probability that the true μ lies within the range of estimated value ± the error). In other words, an MAE near 0% (or as small as possible) and a CICP close to 100% (or as large as possible) are desirable. All devices are identical to those used in Fig. 3c, i.e., back-gated with EOT = 10 nm, channel thickness = 0.615 nm, and ϕ_B = 0.45 eV.

We perform 100 extractions on families of devices with 5 channel lengths (L_ch = 200, 400, …, 1000 nm), starting at high V_ov/EOT = 0.64 V/nm. We find that our proposed approach and the TLM approach offer reasonably small MAE = 14.3% and 10.9% (on the same order as the μ standard deviation), respectively, and CICPs of 99% and 65%, respectively (Fig. 5a, b), indicating that random variation does not significantly affect the accuracy or reliability of these methods at high V_ov.

**Fig. 5: Robustness to variation of our method and of the TLM approach.**

Next, we repeat this procedure at smaller V_ov/EOT = 0.3 V/nm, which lies within the contact-influenced region of the I_d vs. V_ov curves in Fig. 3c. Here, the MAE of our proposed method increases to 20.2% and its CICP remains high at 94% (Fig. 5c). However, the MAE of the TLM approach increases greatly to 116.0% (>2× mobility overestimation), whereas its CICP falls to 0%, i.e., the TLM approach did not estimate μ to within error bars across any of the 100 trials (Fig. 5d). The MAE of our approach can be improved by adding more devices; repeating the extraction at V_ov/EOT = 0.3 V/nm using three of each L_ch (Fig. 5e; with devices subject to the same random variations as before) decreases the MAE of our approach to 13.3%, though the CICP also worsens slightly to 78% (which may occur in part because the estimated standard error shrinks). However, the MAE of the TLM approach only decreases slightly to 110.0% (~2× mobility overestimation) and the CICP remains at 0% (Fig. 5f), indicating that adding more devices to the TLM analysis is ineffective for improving both accuracy and reliability at V_ov/EOT = 0.3 V/nm.

We note that although Fig. 5c shows our method yields a reasonably low MAE = 20.2% on the entire set of 100 device families, the 10 worst extractions still overestimate μ by 48% to 102%. However, the CICP considering only these 10 extractions is 100%, i.e., each of these 10 worst extractions also yielded large estimated errors that encompassed the true μ of 50 cm²V⁻¹s⁻¹. Thus, although our method can overestimate μ of contact-gated devices (here with ϕ_B = 0.45 eV), these overestimations are accompanied by large error bars that clearly indicate when the extraction is error-prone.

Discussion

We have developed a simple method for extracting the mobility and channel threshold voltage from transistors with gate-voltage-dependent contact resistance. We tested this method by analyzing TCAD-generated current-voltage characteristics and showed it can accurately extract the mobility and threshold voltage when devices are heavily influenced by contact gating, even when conventional methods overestimate the mobility by 2–3×. We also find that the standard error associated with the estimated mobility and threshold voltage tends to accurately reflect the actual uncertainty in the extraction, enabling a high confidence extraction of mobility even in regimes where the TLM approach fails. Hence, our method expands the range of overdrive voltages that can be used to estimate mobility and threshold voltage, allowing these quantities to be more accurately determined in emerging semiconductor devices with high contact resistance and/or early dielectric breakdown.