Lingqi
Li‡
,
Heng
Xiang‡
,
Haofei
Zheng
,
Yu-Chieh
Chien
,
Ngoc Thanh
Duong
,
Jing
Gao
and
Kah-Wee
Ang
*
Department of Electrical and Computer Engineering, National University of Singapore, 4 Engineering Drive 3, Singapore 117583. E-mail: eleakw@nus.edu.sg
First published on 11th March 2024
Reservoir computing (RC), a variant of recurrent neural networks (RNNs), is well-known for its reduced energy consumption through exclusive focus on training the output weight and its superior performance in handling spatiotemporal information. Implementing these networks in hardware requires devices with superior fading memory behavior. Unlike filament-based two-terminal devices, those relying on ferroelectric switching demonstrate improved voltage reliability, while three-terminal transistors provide additional active control. HfO2-based ferroelectric materials such as Hf0.5Zr0.5O2 (HZO), have garnered attention for their scalability and seamless integration with CMOS technology. This study implements a RC hardware based on MoS2–HZO integrated device structure with enhanced spontaneous polarization field. By adjusting the oxygen vacancy concentration, the devices exhibit consistent responses to both identical and nonidentical voltages, making them suitable for diverse RC applications. The high accuracy of MNIST handwritten digits recognition highlights the rich reservoir states of the traditional RC architecture. Additionally, the impact of masks on RC implementation is assessed, showcasing the device's capability for spatiotemporal signal analysis. This development paves the way for implementing energy-efficient and high-performance computing solutions.
New conceptsThis manuscript systematically presents, for the first time, the optimization of device structure and thin-film deposition to realize an MoS2–HZO integrated FeFET designed for diverse real-life tasks within an RC system. Recognized for its energy-saving capabilities, RC stands out as one of the most efficient neuromorphic computing algorithms. However, given the inherent passive control in reservoirs based on two-terminal devices, actively managing a stable physical reservoir becomes imperative for the implementation of RC systems based on physical devices. The selection of HZO as the ferroelectric material proves advantageous due to its scalability and compatibility with CMOS processes. While HZO exhibits spontaneous depolarization, its impact on retention becomes significant only when the strength of this depolarization electric field surpasses the coercive field (EC). Through careful optimization of the device structure, the intrinsic depolarization electric field within HZO is enhanced, resulting in a volatile behavior. Additionally, the generation of reservoir states hinges on the nonlinear reversal of ferroelectric domains in the polycrystalline structure of HZO, which corresponds to the voltage intensity and pulse duration. Tuning the deposition process of HZO can influence its phase formation, subsequently affecting the flipping behavior of the ferroelectric domains. Therefore, selecting appropriate deposition conditions is crucial for the adaptability of the device to different systems. This proof-of-concept demonstration opens new possibilities for the application of FeFETs in energy-efficient machine learning. |
Physically implemented reservoirs require physical devices that are responsive to continuous pulse signals and exhibit a non-linear fading effect, which simplify computations to spatiotemporal node states.7,8 At present, memristors-based RC systems have been widely explored in various applications, such as pattern classification,9–13 spoken digit recognition,10,12,14–16 and chaotic time series prediction.7,17,18 Among these, most devices utilize ions to establish conductive channels within the dielectric for regulating resistance, yet this approach introduces variability and restricts their controllability. Compared to those based on filament formation, devices utilizing ferroelectric domain switching through voltage control, demonstrate superior voltage reliability, prolonged lifespan and accelerated operational speed. However, the destructive nature of the read/write operations and the passive characteristics of FeRAMs have limited their widespread adoption. In contrast, three-terminal ferroelectric field-effect transistors (FeFETs) offer a promising alternative. FeFETs exhibit fading memory by utilizing the polarization reversal induced by the applied gate voltage (VG) and the depolarization field when the gate voltage is removed (VG = 0 V). These ferroelectric-based three-terminal devices not only showcase active control capabilities but also exhibit reconfigurability.19,20
In this work, we utilize a two-dimensional (2D) MoS2 channel to provide excellent electrostatic control of the channel, which is integrated with an interfacial layer (Al2O3) to mitigate the random and uncontrollable interface trapping effects commonly observed in three-dimensional bulk channels.21,22 By utilizing the insufficient charge compensation-induced depolarization field between the HZO layer and the Al2O3 layer, our physical dynamic devices exhibit a remarkable non-linear fading memory characteristic. Notably, to make our devices suitable for diverse RC tasks, we fine-tune the ALD process by adjusting the O3 pulse time to modulate the oxygen vacancy concentration in HZO and influence the voltage response of the ferroelectric domain. By optimizing growth conditions, our physical RC achieves high accuracy (>95%) in Modified National Institute of Standards and Technology (MNIST) handwritten digit recognition within just 40 training epochs, highlighting the abundance of reservoir states and the device's effectiveness in handling fundamental neuromorphic tasks. Furthermore, we successfully apply the cyclic RC concept to accomplish waveform classification tasks, yielding high determination coefficients (R2) of up to 0.961 and low normalized root mean square errors (NRMSE) of 0.099. These findings underscore the potential of our proof-of-concept demonstration in reducing hardware complexity and enabling energy-efficient machine learning across a wide range of real-life time-series information processing applications.
nm aluminium oxide (Al2O3) layer and a 10
nm polycrystalline HZO layer while tungsten (W) is used as the gate electrode. The polar orthorhombic phase (o-phase) with (111), o-phase (220) yields a d-spacing of 2.96 and 1.85 Å, as identified in the zoomed-in TEM image (Fig. 1(d)), corresponding to the (111) and (220) planes of o-phase, respectively, as shown in the fast Fourier transform (FFT) diffraction pattern in Fig. 1(e). The consistency of these results with previous studies affirms the presence of ferroelectric phases in the HZO layer.23,24 Energy Dispersive X-ray spectroscopy (EDX) analysis is performed (Fig. S2, ESI†) to validate the chemical compositions of the MoS2–HZO integrated device.
To examine the ferroelectric behavior, we fabricate metal–ferroelectric–metal (MFM) devices using the same HZO process conditions as our HZO-based MoS2 FeFETs. We conduct a comprehensive analysis of the capacitance–voltage (C–V) characteristics at various frequencies to examine the property of the 10-nm HZO layer. Fig. 1(f) illustrates the “butterfly” shaped C–V curves, with the two peaks of the curve corresponding to the coercive voltages (Ec) of the ferroelectric film. These observed characteristics are well-recognized indications of ferroelectric behavior.
First-order reversal curve (FORC) is a robust experimental technique to analyze the switching field distributions.25,26Fig. 1(g) shows the built-in bias field and coercive field in terms of the Preisach switching density (ρ) during endurance tests. Fig. S3 (ESI†) illustrates the ρ measurement process, which initiates a positive saturated field and sweeps between the reversal field and positive saturation, irementally amplifying the magnitude of the reversal field (Er) until a negative saturation polarization is achieved. By sweeping the applied field, the polarization–electric field (P–E) curve and switching current density corresponding to each reversal field Er,i can be obtained. The switching density ρ−(Er,E) can be calculated using the following equation:25–27
![]() | (1) |
![]() | (2) |
000 times ferroelectric domain switching. In Fig. 1(h), the corresponding FORC contour plot is displayed for the sample after 5000 cycles. It can be observed that the distribution related to the built-in bias and coercive field remained largely unchanged, confirming the reliability of HZO.
Fig. 2(b) illustrates the time-dependent change of ΔP(t)/(2Pr) under different Vext values (from 1.08 V to 3.00 V), represented by symbols. Furthermore, the TEM image in Fig. 1(c) clearly indicates that the crystallized HZO film comprises multiple domains with distinct orientations. Therefore, the application of the nucleation-limited-switching (NLS) model is crucial to validate the observed domain switching behavior, as previously established in related studies.29,30 The solid lines in the figure depict the fitted results obtained using the NLS model. This model is described by the following formula:
![]() | (3) |
![]() | (4) |
t1 corresponds to the median logarithmic value of the distribution. One can observe that when the voltage is below the coercive voltage of the ferroelectric material, there will be a less frequent occurrence of domain switching in the minor loop. When the applied voltage is larger than the coercive voltage, the domain switching demonstrates non-linear behavior in terms of pulse width and voltage magnitude.
Furthermore, in Fig. 2(c), an energy diagram is presented to illustrate the electron accumulation behavior in an HZO-based FeFET. With a positive applied voltage, a multitude of dipoles switch, drawing more channel electrons and thereby enhancing the channel conductance, resulting in an increased current at the drain terminal. Considering that different voltage intensities or pulse widths would lead to the reversal of varying quantities of dipoles and distinct accumulations of charges in the channel, the FeFET device displays unique response characteristics when subjected to sequential input voltage pulses at the gate terminal.
Additionally, we harness the intrinsic depolarization electric field in HZO-FeFETs to facilitate the fading memory process. While HZO has demonstrated a stable polarization that can be reversed under the influence of an electric field, the scenario with FeFETs is generally more intricate. As FeFETs typically comprise a semiconductor layer, an interfacial layer, and a ferroelectric layer, insufficient charge compensation at the ferroelectric layer interface might result in a depolarization field. The introduction of such depolarizing field causes HZO to exhibit memory decay instead of long-term memory.
To confirm that the manifestation of fading memory in HZO is due to insufficient charge compensation between HZO and interfacial Al2O3, we fabricated metal-ferroelectric-insulator-semiconductor (MFIS)- and metal-ferroelectric-metal-insulator-semiconductor (MFMIS)-FeFETs using similar fabrication processes. Compared to the MFIS-FeFET, the MFMIS-FeFET employs a metal to compensate for the charge at the ferroelectric layer, thereby aiding HZO in reducing the depolarization field.23,31 We initially compared the transfer characteristics of the two structures. It is evident that the device in the MFMIS structure exhibited a counterclockwise hysteresis loop, while the device in the MFIS structure displayed a minor clockwise loop. The counter clockwise hysteresis is attributed to the polarization reversal of HZO under the influence of positive voltage, considered a characteristic behavior of FeFET,32 while the clockwise hysteresis can be attributed to the trapping dominant behavior. However, in the multipulse I–V measurements of the MFIS-FeFET, one can observe a positive ΔID when subjected to program voltage pulses with different time widths and voltage amplitudes. Note that, charge trapping induces a reduction in ΔID. Consequently, the observed positive increase in ΔID serves as compelling evidence of ferroelectric switching. When electrons are trapped within the gate dielectric, its concentration in the channel reduces, which resulted in a negative ΔID instead of a positive ΔID. Therefore, the positive window in the transfer curve of the MFIS-FeFET is attributed to the existence of a depolarization field that induces the weakened polarization reversal of the HZO. This effect is not sufficient to offset the charge trapping induced by DC measurements. The distinct transfer direction can be elucidated by the incomplete charge compensation in the MFIS-FeFET. This incomplete compensation results in the creation of a built-in potential opposite to the polarization direction (as shown in Fig. 2(f)), contributing significantly to a non-negligible depolarization field, consequently leading to retention loss.33,34
To comprehensively understand the effects induced by charge incomplete compensation, we further analyze the decrease in drain current after positive voltage programming. Fig. 2(g) and (h) show the variation of ΔID with respect to the delay time when the positive program voltage is applied in MFIS-FeFET and MFMIS-FeFET, respectively. It is apparent that the overall current change in the MFIS-FeFET is minor and rapidly decays, while the MFMIS-FeFET exhibits a significant albeit gradual decay. This phenomenon demonstrates the ability of the device to achieve fading memory behavior without the introduction of metal in the ferroelectric and interfacial layers. Furthermore, it has been reported that the depolarization induced retention loss can be well captured by the power-law equation35
![]() | (5) |
Oxygen vacancy (VO) is an important parameter in the polarization switching kinetics of HZO which can be adjusted by the ozone pulse time during deposition. Fig. 3(a)–(c) showcase the ferroelectric domain switching behavior with variations in pulse widths and voltage amplitudes, clearly illustrating the significant influence of VO in HZO on field-dependent ferroelectric polarization switching dynamics. Fig. 3(d) demonstrates distinct switching behavior under the same voltage amplitude but varying pulse widths, extending to the response to an identical pulse train. Meanwhile, Fig. 3(e) presents diverse results under different amplitudes but a fixed pulse width of 10 μs, extending to the response to a non-identical pulse train. Both figures show a consistent trend, wherein larger ozone pulse times reflect an overall increase in switching times for polarization reversal. Shorter ozone pulse times result in a rapid ferroelectric response but also reach saturation quickly, while moderate ozone pulse times generate the most homogeneous ferroelectric response loop. This behavior is attributed to the influence of VO on phase generation in HZO.36,37
Grazing-incidence X-ray diffraction (GIXRD) measurements were conducted to analyze the various crystalline structures of the HZO layer and the relative ratios of crystalline phases within the films under different ozone pulse times, as illustrated in Fig. 3(f). The experiments were performed with an incident angle of 0.5° using Cu radiation, and the 2θ range spanned from 20° to 50°. The GIXRD patterns displayed the most prominent diffraction peaks at a 2θ value of approximately 30.4°, corresponding to the mixed reflections from the (111) planes of the o-phase or the (101) planes of the t-phase.24,38 Another distinct peak at a 2θ value of around 31.5° corresponded to the (111) planes of the m-phase. The peaks of the o (111)/t (011) and m (111) were considered as potentially involved peaks for peak splitting as the differentiation of o (111) and t (011) from GIXRD is difficult due to the overlap of the diffraction peaks. The peaks were integrated within a 2θ range of 27.5° to 33.5°. Fig. 3(g) depicted the diffraction results, and the ratio Ao-/t-phase/Ao-/t-+m-phase was calculated, with A representing the peak areal intensity fitted using Gaussian function. Moreover, a shift to lower 2θ values was generally correlated with a higher polar orthorhombic phase fraction and a reduced nonpolar tetragonal phase fraction.39 The results of the area ratio and the 2θ values of the mixed peak are shown in Fig. 3(g) for comparison. While tO3 = 0.1 s is associated with a high o-/t-phase ratio, the concentration in the mixed peak mostly comprised the t-phase. Conversely, tO3 = 5.0 s is linked to a high m-phase and a relatively high t-phase concentration compared with tO3 = 1.0 s. The observed tendency indicates that the increase in the oxygen content in the HZO film led to a higher m-phase fraction, whereas an excessive VO concentration is preferred for the t-phase stabilization. The o-phase is known to contribute to ferroelectricity, while the t-phase or m-phase does not. Additionally, when an electric field is applied to the HZO film, domains with an orthorhombic phase initially switches around pinned domains with the same polarization orientation as the direction of the applied field.40 These pinned domains served as seed domains, leading to initial switching. Consequently, the reduction in the o-phase and seed domains with increasing oxygen pulse time was suggested as the primary factor contributing to the overall increase in switching times for polarization reversal. However, the abundance of ozone vacancies, signalling insufficient polarization for effective switching, corresponds to the trend of easily saturated switching.
In Fig. 3(h), the HZO film deposited using tO3 = 1.0 s presents the highest Pr in the polarization-voltage hysteresis loops, which is attributed to the largest fraction of o-phase and less non-ferroelectric m-phase as indicated in the GIXRD results. Furthermore, the reduced defects, as suggested in Fig. 3(i), also contribute to the largest polarization. In addition, the film of tO3 = 1.0 s exhibits more than one order of magnitude lower in the leakage current than the film of tO3 = 0.1 s (Fig. 3(i)), but slightly decreased domain switching dynamic, making it suitable for subsequent device fabrication.
To perform the MNIST task, a crucial requirement is that the devices must exhibit the ability to respond to a series of voltage pulses and demonstrate rich final states under different sequences, where the input pulses share the same amplitude. The key parameter for generating a sufficiently diverse reservoir state is long-term weight adjustment. Long-term potentiation (LTP) measurements are conducted to highlight the device's capability to produce a diverse reservoir state after a pulse train as shown in Fig. 4(b). Following each program pulse, a read voltage (Vread) of 0 V with VD = 0.5 V is applied to measure transient conductance. Specifically, when the channel current induced by the first pulse cannot fully relax to the initial state, the arrival of the second pulse results in a higher current response, indicating an accumulation effect. The observed fading memory properties, as discussed in Fig. 2(g), play a crucial role in the spatiotemporal dynamics of our HZO-based FeFETs. These properties lead to the generation of a non-linear accumulation value after each pulse in the train, ultimately producing a distinct final conductance state.
Due to the inadequate responsiveness difference of the device to continuous pulses, we perform a pre-processing of the original images, as shown in Fig. 4(a). Taking a digit ‘0’ image as an example, we first binarize and crop the original 28 × 28 grayscale image to obtain a 20 × 20 image with 20 pixels per row. We then remove the unused border area and set the grayscale values below 80 to ‘0’ and those above 80 to ‘1’. Subsequently, each row is divided into smaller sections by combining five pixels into one which converts the 20 × 20 image into 20 × 4 matrix. Note that each pixel in the 20 × 4 matrix represents one conductance state. These conductance values are fed to reservoir and served as an individual reservoir node, which is linked to others in the same rows to form a high-dimensional input vector Xi for software training. The algorithm for training is a simple logistic regression with sigmoid function as follows:
![]() | (6) |
By using this method, we convert 60
000 images in MNIST database to conductance matrix. Herein, the 32 different pulse streams contain 5 consecutive binary signals (e.g., 00011), where the pulse width and interval are 100 μs and 60 μs, respectively. Given a series of five continuous binary signals and only one final conductance value is considered, there are a total of 32 possible conductance states for the reservoir. The training information, comprising of 48
000 digit images (80% of the database), are then fed into the readout layer to obtain the weight matrix (Wout). It is crucial to note that the structure facilitating communication between the reservoir and the output layer is a fully connected layer. After Wout is formed, the rest of dataset consisting of 12
000 images are used to test the recognition accuracy.
In order to provide a clear illustration, we have selected 12 representative states to depict their responses to different temporal inputs as shown in Fig. 4(e). One can observe that each input signal has a unique output conductance such that the device can be used to separate different inputs. Fig. 4(c) demonstrates that the training accuracy reached 100%, and the loss decreased to 0 after 30 epochs using 48
000 training images (80% of the dataset). It is important to note that the training loss and accuracy were determined by sparse categorical cross entropy and accuracy, respectively. Fig. 4(d) illustrates that the testing accuracy for 12
000 images reached 95.43% using a pulse train with a pulse amplitude of 3 V and a pulse time of 100 μs.
Fig. 4(f) shows the benchmarking of MNIST classification task using not only RC but also other neural network-based devices based on 2D materials or ferroelectric materials.7,10,11,41–47 Our devices show rich reservoir states to deal with the task and superior performance with less training cost, which are attributed to the minimized network size.
When addressing temporal information, the utilization of a binary mask enhances cyclic RC capability through time multiplexing. Simultaneously, it elevates the demand for physical devices to generate varied responses to distinct input voltage amplitudes. A pulse train, encompassing non-identical pulse amplitudes, is applied to our device to assess its response. The pulse width is consistently set to 100 μs, and VD remains fixed at 0.5 V. The pulse amplitudes range from 0.94 to 3.46 V in increments of 0.1 V. The non-linear behavior depicted in Fig. 5(c) underscores the remarkable capability of our device in effectively handling pulse trains with varying amplitudes.
A benchmark waveform classification task is carried out to further evaluate the time-series signal performance of the FeFET-based RC system. Two parameters, R2 and NRMSE, are used to quantify the performance. An R2 value close to 1 indicates that the experimental results fit well with the expected results. NRMSE is used to measure the classification error, which is described as:14,48
![]() | (7) |
As exhibited in Fig. 5(b), the original input signal is a random combination of sine and square waves, and the purpose of the task is to use ‘0’ and ‘1’ to represent square and sine waves, respectively.
The optimal classification results are dependent on the number of parallel masks (N) and programmed voltage (VP) as elucidated in Fig. 5(d), with a fixed mask length (M) of 10. The programmed voltage is a constant derived from the multiplication of a binary matrix by a scaling factor, effectively augmenting the values within the matrix. Specifically, when VP is set to 2 V, the matrix elements assume the values of ‘−2’ and ‘2’. We aggregate masks in parallel as a unified entity to preprocess a consistent input signal with varying programmed voltages. It is noteworthy that each mask within the group is unique, yet those applied to different voltages remain consistent. Observations within the VP range of 2 V to 3 V unveil a notable trend: an increase in mask number corresponds to a simultaneous reduction in error rate R2 and an increase in NRMSE. Additionally, for specific N, a distinct pattern emerges wherein an elevation in VP results in a low R2 and a large NRMSE. This dependence is attributed to the system's fundamental capability in handling continuous signals. In other words, the overall change in Fe-FET conductance is constrained, and excessive VP can drive reservoir states to the upper limit, compromising further signal processing capabilities in subsequent durations.
Apart from the number of N and VP, M is a critical parameter. Here, the length of the mask matrix represents the number of virtual nodes, reflecting the state richness of the RC system determined by the binary combinations of the mask sequence. Consequently, we delve into the performance dependence of these two parameters, fixing VP at 2 V for comparison. The waveform classification task presents a notable challenge to the system when either a single mask is applied, or the length of each mask is only one. Expectedly, with the increase in values of N and M, we observe a significant improvement in R2, rising from the lowest value of 0.523 to a high accuracy value of 0.961 in Fig. 5(e). Simultaneously, NRMSE, depicted in Fig. 5(f), decreases to an exceptionally low value of 0.099. The optimal classification results, represented by the gray set points, for the three parameters mentioned above are illustrated in Fig. 5(g) with VP set to 2 V, M fixed at 10, and N at 10. The red set showcases the expected results, where ‘0’ and ‘1’ denote square and sine waves, respectively.
Finally, top W electrodes is deposited and an area of 100 × 100 μm2 are defined using the optical lithography. Subsequently, standard commercial tungsten etchant is used to do the wet etch.
000 training images and 10
000 testing images (sized 28 × 28 pixels) representing digits written by high-school students and employees of the United States Census Bureau.
The training algorithm employed is a logistic regression (LR) known as Scikit-learn package which can be found in Python program. All root means square errors were calculated by sklearn. metrics (
) tool in the package.
Footnotes |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3nh00524k |
| ‡ L. L. and H. X. contributed equally to this work. |
| This journal is © The Royal Society of Chemistry 2024 |