Sungjun
Kim‡
ab,
Keun
Heo‡
b,
Sunghun
Lee
b,
Seunghwan
Seo
b,
Hyeongjun
Kim
b,
Jeongick
Cho
b,
Hyunkyu
Lee
b,
Kyeong-Bae
Lee
b and
Jin-Hong
Park
*b
aFoundry Division, Samsung Electronics Co. Ltd., Youngin 17113, Korea
bDepartment of Electrical and Computer Engineering, Sungkyunkwan University, Suwon 16419, Korea. E-mail: jhpark9@skku.edu
First published on 24th December 2020
Recently, various efforts have been made to implement synaptic characteristics with a ferroelectric field-effect transistor (FeFET), but in-depth physical analyses have not been reported thus far. Here, we investigated the effects by (i) the formation temperature of the ferroelectric material, poly(vinylidene fluoride-trifluoroethylene) P(VDF-TrFE) and (ii) the nature of the contact metals (Ti, Cr, Pd) of the FeFET on the operating performance of a FeFET-based artificial synapse in terms of various synaptic performance indices. Excellent ferroelectric properties were induced by maximizing the size and coverage ratio of the β-phase domains by annealing the P(VDF-TrFE) film at 140 °C. A metal that forms a relatively high barrier improved the dynamic range and nonlinearity by suppressing the contribution of the tunneling current to the post-synaptic current. Subsequently, we studied the influence of the synaptic characteristics on the training and recognition tasks by using two MNIST datasets (fashion and handwritten digits) and the multi-layer perceptron concept of neural networks.
New conceptsThere have been various attempts to realize synaptic operation using FeFETs, but no in-depth physical study has conducted in accordance with the co-analysis of ferroelectric properties and contact barrier heights so far. For the first time, we investigated how (i) the formation temperature of the ferroelectric material (P(VDF-TrFE)) and (ii) the nature of the contact metals (Ti, Cr, Pd) affect the performance of a FeFET synapse, in terms of its updating energy, dynamic range, nonlinearity, symmetricity, cycle to cycle variation, and relative standard deviation in the LTP/D characteristics. Subsequently, we studied the influence of the ferroelectric properties and contact barrier heights in the FeFET synapse on the training and recognition performance of a multi-layer perceptron-based neural network by using two datasets from the MNIST database (fashion and handwritten digits). We expect (i) the optimizations of ferroelectric materials and contact metals, and (ii) the relevant studies performed with the proposed FeFET synaptic device to be a cornerstone for the development of high-performance synaptic devices in future. |
Here, we investigate the effects by (i) the formation temperature of a ferroelectric material and (ii) the nature of the contact metals on the operating performance of an FeFET-based artificial synapse, in terms of its updating energy, dynamic range (DR), nonlinearity (NL), symmetricity, cycle-to-cycle variation (CCV), and relative standard deviation (RSD) in the long-term potentiation and depression (LTP/D) characteristic curves. The synaptic device consists of poly(vinylidene fluoride-trifluoroethylene) (P(VDF-TrFE)) as a gate dielectric and molybdenum disulfide (MoS2) as a channel material. These materials have a very low amount of surface defects due to the absence of dangling bonds, thereby providing a gate-channel interface with very few defects. Subsequently, we also study the influence of the synapse characteristics on the training and recognition performance of a multi-layer perceptron (MLP)-based NN by using two datasets from the MNIST database (fashion and handwritten digits).
As shown in the lower panel of Fig. 1b, the application of negative voltage to the WCT causes the dipoles in the PVDF film to orientate in the direction normal to the surface of the film as a result of alignment with the applied electric field (E-field). This orientation of the dipoles is maintained even after removal of the external electric field, resulting in a nonvolatile memory characteristic. In contrast, the opposite polarization occurs under positive voltage bias conditions (downward polarization). To determine whether this polarization mechanism enables the operation of the synaptic device, we investigated the change in conductance for various pulses, as shown in Fig. 1c. Nonvolatile responses were observed for various input spiking pulses with amplitudes of ±1, ±3, ±5 V, and time duration (td) of 10 ms. The percentage of the “conductance remaining after 64 s” relative to the “initial conductance value immediately after pulse application” is 8%, 26%, and 29% for exhibitory (positive) pulses with amplitudes of 1, 3, and 5 V, and 33%, 62%, and 71% for inhibitory (negative) pulses with amplitudes of −1, −3, and −5 V, respectively. Further details on conductance decay phenomenon as a function of pulse width are provided in the ESI,† Fig. S1. The potentiation (by an excitatory pulse, top) and depression (by an inhibitory pulse, bottom) responses were explained with the energy-band diagrams for the synaptic device, as shown in Fig. 1d. The application of an exhibitory spike to the WCT caused the energy-band of the MoS2 semiconductor to instantly bend upward and electrons were attracted to the MoS2 channel region. Simultaneously, randomly oriented dipoles in the P(VDF-TrFE) were aligned with the direction of the induced E-field. Consequently, the channel potential increased because of the aligned dipole moment, and the decreased tunneling width at the source-MoS2 junction resulted in higher carrier injection (channel conductance↑ = potentiation). In contrast, the application of an inhibitory spike resulted in the MoS2 energy-band instantly bending downward and the amount of electrons accumulated at the MoS2 channel to decrease. As a result, the channel potential is decreased and the tunneling width at the source-MoS2 junction increases, lowering the carrier injection (channel conductance↓ = depression). As a higher voltage pulse is supplied, molecules in the P(VDF-TrFE) are polarized even more strongly, causing a much larger change in the channel conductance.
Then, as shown in Fig. 1e (upper panel), we measured the postsynaptic current (PSC) by applying three kinds of pulse sets with different amplitudes (±1, ±3, ±5 V) consecutively to evaluate the LTP/D characteristics. Each pulse set consists of 128 excitatory and 128 inhibitory pulses, where the duration and interval of the pulses were fixed at 10 and 490 ms, respectively. The PSC changed gradually according to the excitatory and inhibitory spikes, consequently presenting general LTP/D characteristics. Especially, as the pulse amplitude increased from +1 to +5 V, the increase in the PSC became greater with each pulse to raise the maximum PSC (after 128 excitatory pulses) from 0.87 to 2.98 nA. As a result, the DR (Gmax/Gmin) was increased from 2.7 (for a 1 V pulse) to 8.7 (for a 5 V pulse). We also examined the effect of the change in the pulse amplitude in terms of energy consumption for updating, as shown in Fig. 1e (lower panel). The estimated energy was below 9.5 pJ for the pulses that were considered in the aforementioned measurements, and was lower than or comparable to the updating energy of FET-type synapses reported thus far (10 pJ).6,34,35 When increasing the pulse amplitude to 7 and 9 V, the updating energy was raised to 37.1 and 75.6 pJ, respectively. Additional information on the change in the updating energy with respect to the amplitude and duration of the pulses is provided in ESI,† Fig. S2. The gradual change in the conductance of the proposed Fe-FET synapse can be explained by partial polarization switching in the ferroelectric domains, as shown in Fig. 1f.22,24 When an excitatory spike is applied, the polarization domains in the P(VDF-TrFE) film are influenced by the direction of the induced E-field, which is aligned downward. This change in the polarization domains gradually occurs as the spike pulse is applied consecutively, thereby increasing the number of electrons in the channel region. In contrast, when an inhibitory spike is applied consecutively, the opposite direction of the E-field gradually aligns the polarization domains upward, thereby decreasing the number of electrons in the channel.
Because the P(VDF-TrFE) typically exhibits a thermodynamically stable ferroelectric phase (β-phase) when annealed between the Curie and melting temperatures, the processing temperature for crystallization is an important parameter for the implementation of P(VDF-TrFE)-based synaptic devices.33Fig. 2 presents the results of our investigation of the ferroelectric characteristics of the P(VDF-TrFE) films, which were annealed at four different annealing temperatures (60, 100, 140, and 180 °C) for 2 hours. For the capacitance–voltage analysis in Fig. 2a, we prepared ferroelectric capacitors with a metal-ferroelectric-metal (MFM) structure (area: 1 mm2), where 150 nm-thick Pt, 50 nm-thick Au, and 395 nm-thick P(VDF/TrFE) films were used as the bottom electrode, the top electrode, and the ferroelectric material, respectively. An AC voltage signal with an amplitude of 0.1 V and a frequency of 100 kHz was used for the capacitance measurement. The “butterfly loop” observed in the C–V characteristic curves indicates that ferroelectric properties were successfully conferred upon the corresponding P(VDF/TrFE) films.36 As shown in Fig. 2b (upper panel), this ferroelectric characteristic appeared in all capacitor samples except for the sample annealed at the lowest temperature of 60 °C, i.e., below the Curie temperature; furthermore, the hysteresis window broadened as the temperature increased. The hysteresis window indicates the difference between two coercive voltages, which are defined as external voltage to lower the net polarization of a ferroelectric film to zero.37 Thus, a ferroelectric capacitor with a wide hysteresis window formed at a high temperature means that a larger amount of energy was required to switch the polarization. The capacitance per unit area also increased when the annealing temperature increased to 140 °C, which indicates that the remnant polarization was enhanced in this temperature range as shown in Fig. 2b (lower panel). This increasing trend in switching energy and remnant polarization can be explained by the growing size of the ferroelectric domains shown in Fig. 2c (upper panel).38,39 However, the capacitance per unit area was lowered slightly when the annealing temperature increased from 140 to 180 °C, which could not be simply explained by the ferroelectric domain size because the hysteresis window increased in the same range. This is because certain parts of the β-phase domains responsible for ferroelectricity are transformed to α- and γ-phase domains, which have nonpolar characteristics above the melting temperature (164.7 °C). The creation of α- and γ-phase domains increases the spacing among the β-phase domains and reduces the size of the β-phase domains as graphically illustrated in Fig. 2c (upper panel, 180 °C), consequently increasing the polarization switching energy and decreasing the remnant polarization, respectively. Additionally, polarization-voltage hysteresis loops of each annealing condition are presented in ESI,† Fig. S3. For a reference, as shown in Fig. 2c (lower panel), the molecular dipoles in the β-phase structure are completely aligned in the same direction perpendicular to the long axis of the polymer chain, thereby showing large spontaneous polarization.32 In contrast, the dipoles in α- and γ-phase structures are alternately arranged in opposite directions along the axis of the polymer chain, thereby canceling the polarities of the dipoles vectorially. The Curie and melting temperatures of the P(VDF-TrFE) film were also confirmed to be 88.3 and 164.7 °C, respectively, by differential scanning calorimetry (DSC) analysis in ESI,† Fig. S4.
The changes in the ferroelectric domains as a function of the annealing temperature were also confirmed by using X-ray diffraction (XRD) and atomic force microscopy (AFM) analyses, as shown in Fig. 2d and e. In each of the XRD patterns measured for the P(VDF-TrFE) samples annealed at four different temperatures, the peaks related to the β-phase domain were observed near 2θ = 20°.33 As the annealing temperature was increased from 60 to 140 °C (below the melting temperature), we observed a gradual increase in the peak intensity with a corresponding decrease in the full width at half maximum (FWHM) from 2.2 to 0.8°. These changes support the previous prediction that the size of the β-phase domains increased. The decrease in the peak intensity and the slight increase in FWHM that appeared at 180 °C indicate the reduced proportion of the β-phase domain relative to the entire P(VDF-TrFE) film and the decreased size of the β-phase domains, respectively. These changes experimentally support the previous explanation on the basis of the transformation from the β-phase to the α- and γ-phases above the melting temperature. The AFM images of the P(VDF-TrFE) surface also reflect these changes in the ferroelectric domains: (i) below the melting temperature, the β-phase domains continuously grew as the annealing temperature increased and (ii) above the melting temperature, the area occupied by the β-phase diminished because of the formation of additional α- and γ-phases above this temperature (Fig. 2e). To quantitatively evaluate the relationship between the annealing temperature and the size and distribution of ferroelectric domains, we then extracted the mean (μ) and RSD (standard deviation (σ)/mean (μ)) values for the domain sizes from the AFM images (Fig. 2f). The mean and RSD values for the ferroelectric domain sizes increased from 1.2 × 103 to 6.8 × 103 nm2 and from 1.0 to 16.6, respectively, as the temperature increased from 60 to 140 °C. At 180 °C, i.e., above the melting point, the mean value decreased by 13.2% (6.8 × 103 → 5.9 × 103 nm2) compared with the value at 140 °C, and the RSD value increased by 94.6% (16.6 → 32.3). These changes are different from the changing trend in the mean and RSD values in the temperature range below the melting point. This is because certain parts of β-phase domains are unevenly converted into α- and γ-phases, resulting in a larger size difference among the ferroelectric domains. The ID–VG characteristics of the P(VDF-TrFE) FeFET synapse samples annealed at the four different temperatures and the relevant device performance analysis are provided in ESI,† Fig. S5.
In addition to the characteristics of the ferroelectric layer, the carrier barrier height at the MoS2–metal junction is another important factor that affects the performance of the FeFET synapse. We thus investigated the extent to which different metal contacts on the MoS2 channel influence the operation of the FeFET synapse. Ti, Cr, and Pd contacts, respectively, were applied to the source/drain components of the FeFET synapse, where the P(VDF-TrFE) layer was formed at 140 °C to maximize the size and portion of the β-phase domains. The ID–VG characteristic curves measured for the devices with these three distinct contacts presented different shapes, as shown in the upper left panel of Fig. 3a. This is because the barrier height differed with respect to the metals and consequently the source metals provided a different number of carriers to the MoS2 channel (upper right panel of Fig. 3a). In particular, the different metal contacts affected the levels and ratio of the two current points (at VG = 0 V) that appeared because of the counter-clockwise (CCW) hysteresis resulting from the polarization of the ferroelectric layer. We extracted two conductance (GFeFET) values at VG = 0 from each ID–VG curve and then plotted the values in Fig. 3a (lower panel), where GFS (or GRS) denotes the forward (or reverse) sweep conductance and GRS/GFS means the ratio of the reverse conductance to the forward conductance. Replacement of the Ti contact by Pd lowered the GFS and GRS from 260 and 360 nS (Ti contact) to 0.27 and 3.6 nS (Pd contact), respectively, but the GRS/GFS ratio increased from 1.4 (Ti contact) to 13.0 (Pd contact). This is because of the increase in the barrier height for injecting carriers (for reference, the effective barrier heights for Ti, Cr, and Pd contacts on MoS2 were confirmed experimentally as 50, 80, and 400 meV).40–42 We then applied 128 excitatory and 128 inhibitory spikes (VWC = 5 and −5 V, td = 10 ms, 2 Hz) to investigate the LTP/D characteristics of the three kinds of FeFET synapses. As shown in Fig. 3b, the LTP/D curves were distributed in different current ranges and especially showed that the synaptic properties, such as the DR (GMAX/GMIN), NL, number of effective states, and symmetricity, were affected.
The DR was linearly dependent on the GRS/GFS ratio, where the GMAX/GMIN value of the Pd contact device (8.97) was much higher than the values of the Ti and Cr devices (2.15 and 4.86, respectively) owing to the larger GRS/GFS ratio, as shown in Fig. 3c. In addition, the three kinds of contacts affected the NL of the LTP/D characteristic curves. To compare the NL values, we normalized the LTP/D curves with the highest PSC values and then extracted the NL values, as shown in Fig. 3d. In the LTP region, the NL was observed to have improved in the Pd-contacted device (β = 2.02) compared with the Cr and Ti devices (β = 4.05 and 5.93, respectively). In the LTD region, the NL value was independent from the contact metals, and was slightly degraded from 3.25 (Ti) to 3.31 (Cr) and then to 3.42 (Pd). Details on the calculation of NL are provided in ESI,† Fig. S6. The application of excitatory pulses to the WCT in the FeFET synapse resulted in the dipoles in the P(VDF-TrFE) capacitor (CFE) aligning with the direction of the applied E-field, creating a depolarization E-field (EDP) in the direction opposite to the applied excitatory E-field (EEX), as shown in Fig. 3e.39 Because the EDP is negligible in the initial part of the LTP region, the EEX mostly contributes to the dipole switching E-field (ESW), resulting in a sufficient increase in the surface potential (ΨS) (stage 1 → 2). However, continued application of the excitatory pulses causes the EDP to increase such that it suppresses the dipole switching ability of the applied pulses. This consequently reduces the extent to which ΨS increases (stage 2 → 3 → 4), as shown in Fig. 3e (lower left panel). Owing to this decrease in the rate whereby ΨS increases, upward-bending LTP curves were observed in all of the FeFET synapses. Especially, for the devices with low barrier contact metals such as Ti and Cr, the upward-bending behavior appeared to be more severe. This is because the direct and/or Fowler–Nordheim (F–N) tunneling mechanism contributes to the increase in the PSC and the tunneling current is expected to dominate the thermionic current.43 In the case of the Pd-contacted device with its relatively high injection barrier, the upward-bending phenomenon was much less pronounced than the other devices because the thermionic emission mechanism is expected to contribute to the increase in the PSC dominantly.44 The application of inhibitory pulses to the P(VDF-TrFE) capacitor (CFE) with maximized depolarization results in an external inhibitory field (EIH) with the same direction as the EDP, instantly inducing the largest ESW to the ferroelectric capacitor. This has the effect of abrupt dipole switching in the ΨS curve, as shown in the lower right panel of Fig. 3e (stage 4 → 5). Immediately after the application of the first inhibitory pulse, because of the large residual EDP that still remains, the ΨS is predicted to decrease considerably (stage 5 → 6). Finally, the extent to which the ΨS decreases with respect to the subsequent inhibitory pulses is lowered because of the continuous decrease in the EDP (stage 6 → 7). Consequently, downward-bending LTD curves are observed in all of the FeFET synapses. The LTD characteristic curves of the low barrier metal-contacted devices (Ti, Cr) were bent similarly to that of the high barrier metal-contacted devices (Pd), resulting in analogous nonlinearities. We then investigated the effective conductance-state ratios with respect to the contact metals (Fig. 3f (upper panel)). The effective conductance-state ratio was defined as the ratio of the number of conductance states in which ΔG exceeded a certain percentage of Gmax/Gmin to the total number of conductance states (Fig. S7, ESI†). That is, insufficient conductance changes (ΔG) that do not affect the recognition rate were excluded. The effective conductance-state ratios of all of the FeFET synapses decreased as the ΔG threshold increased. The ratio of the Pd-contacted device was maintained at 82.4% even when ΔG increased to 0.2% because of its excellent linearity, whereas those of the Ti and Cr-contacted devices were as low as 65.2% and 69.9%, respectively. In addition, the symmetricity between the LTP and LTD curves was investigated with respect to the contact metals, as shown in Fig. 3f (lower panel). The method that was used to estimate the symmetricity is provided in ESI,† Fig. S8. The symmetricity value of the Pd-contacted device (5.5) was much higher than those of the Ti- and Cr-contacted devices (2.5 and 3.4, respectively) owing to the enhanced linearity in the LTP region. Overall, the synaptic performance of the Pd-contacted device was superior to that of the Ti- and Cr-contacted devices in terms of the DR, NL, effective conductance-state ratio, and symmetricity.
In an attempt to understand the way in which these synaptic characteristics affect the training and inferencing tasks of the multi-layer perceptron NNs (ML-NNs) composed of the FeFET synapses, we used the synaptic characteristics to construct an ML-NN and then conducted training/inferencing simulation with the MNIST fashion dataset, as shown in Fig. 4a.13,45 Our dataset included 10 categories of fashion items and consisted of a training set of 60000 example images and a test set of 10000 example images. A fashion MNIST image has 28 × 28 pixels, of which the grayscale values range from 0 to 255.46 As shown in Fig. 4b, the ML-NN consisted of one input layer (IL), one hidden layer (HL), and one output layer (OL), where the layers were composed of 784 input neurons, 100 hidden neurons, and 10 output neurons, respectively. Additionally, each neuron in one layer was connected to every neuron in the next layer via the FeFET synapses. The weight of the FeFET synapse used to connect the i-th neuron in the IL with the h-th neuron in the HL was indicated as WIHi,h. Similarly, the weight of the synapse between the h-th neuron in the HL and o-th neuron in the OL was WHOh,o. The 128 conductance states were normalized by the maximum state value and the normalized conductance values were used as the synaptic weights. The training process consisted of two key operations: feed-forward (FF) and backpropagation (BP). In the FF operation, 784 input voltage signals (VILi for the i-th signal in the IL) with values between 0 and 1 were obtained from the 28 × 28 pixels of the fashion images, and those input signals were applied to the corresponding input neurons. They were consequently transformed to 100 weighted sum currents ( for the h-th current in the HL). After that, the 100 hidden voltage signals (VHLh = f(IHLh) for the h-th signal in the HL) were obtained via the sigmoid activation function . The hidden signals were then converted to 10 output voltage signals by the weighted sum operation ( for the o-th current in the OL) and the sigmoid activation function (). Finally, the 10 output signals (VOLo for the o-th signal in the OL) were compared with each label value (Ko) to calculate the error (Eo = Ko − VOLo). In the BP operation, the errors propagated backward from the OL to the IL, subsequently updating the synaptic weights in a propagating way by minimizing the errors by the adaptive moment estimation (ADAM) algorithm.47 For a reference, we modified the distribution of the synaptic weights such that they were between −1 and 1 by using the following equation, WMOD = 2 × W − 1.
We then trained the ML-NN with the 60000 training images from the MNIST fashion dataset and extracted the recognition rates after every 10000 training step (1 epoch) with the 10000 images in the testing dataset. The left panel of Fig. 4c shows the recognition rate as a function of the number of epochs with respect to the annealing temperature for the P(VDF-TrFE) film. After sufficient training over 50 epochs, the ML-NN constructed with FeFET synapses annealed at 140 °C demonstrated a high recognition rate of 74.7%, which is slightly lower than that of the SW-NN (86.8%). In comparison, the recognition rates of the other ML-NNs did not even reach 50% (11.4%, 24.1%, and 40.9% for the FeFET synapses annealed at 60 °C, 100 °C, and 180 °C, respectively). This is because the relatively superior ferroelectric characteristics of the P(VDF-TrFE) layer annealed at 140 °C enabled synaptic operation in a wider DR. As evidence thereof, we confirmed the highest DR value of 8.7 in the 140 °C FeFET synapse sample after ten LTP characteristic measurements (Fig. 4d). The FeFET synapse based on the P(VDF-TrFE) annealed at 180 °C underwent considerable fluctuation at the PSC level and in its recognition rate. This is considered to be caused by the large variation in the size of the ferroelectric domains (see Fig. 2). We then plotted the maximum recognition rate according to the corresponding DR in Fig. 4d (lower panel), where we confirmed that a DR greater than 6.3 is required to achieve a recognition rate of over 50% in the proposed ML-NN. Similarly, the recognition rates of the ML-NN based on the FeFET synapses annealed at 140 °C were investigated to examine the effect of the contact metal (Fig. 4e). The maximum recognition rate for the Pd-contacted synapse was approximately 75.2%, which was slightly lower than that of the SW-NN (86.8%). However, the rates of the NNs formed by the Ti- and Cr-contacted synapses were 11.1% and 35.1% because of their lower DR and NLLTP (2.15 and 5.93 for the Ti-synapse; 4.86 and 4.05 for the Cr-synapse). In comparison, the corresponding values for the Pd-synapse device were 8.97 (DR) and 2.02 (NLLTP). We then conducted additional simulations with ML-NNs to examine the effects of the DR and NL on the recognition rate independently (Fig. 4f): (i) the NL of the Pd-synapse was applied to the other Ti-/Cr-synapses (upper panel). (ii) The DR of the Pd-synapse was used for the Ti-/Cr-synapses (lower panel). As shown in Fig. 4f (upper panel), the recognition rates for the Ti- and Cr-synapses increased slightly owing to the improved NL value; the rates increased by 10.0%p and 13.2%p, respectively. When the DR of the Pd-synapse was used for all of the synapses, the recognition rates were enhanced more markedly from 11.1 to 51.3% (40.2%p) and from 35.1 to 63.5% (28.4%p) for the Ti- and Cr-synapses, respectively. Additional simulation results are provided in ESI,† Fig. S9. Overall, we could confirm that the influence of the DR on the recognition rate of the ML-NNs we used was greater than that of the NL. In addition, we conducted simulations on ML-NNs using MNIST handwritten digits dataset as shown in ESI,† Fig. S10. The maximum recognition rate for the Pd-contacted synapse was 92.1%.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0nh00559b |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2021 |