Jingxian
Li
ab,
Yuchao
Yang
*a,
Minghui
Yin
a,
Xinhao
Sun
a,
Lidong
Li
*b and
Ru
Huang
*a
aKey Laboratory of Microelectronic Devices and Circuits (MOE), Institute of Microelectronics, Peking University, Beijing 100871, China. E-mail: yuchaoyang@pku.edu.cn; ruhuang@pku.edu.cn
bState Key Lab for Advanced Metals and Materials, School of Materials Science and Engineering, University of Science and Technology Beijing, Beijing 100083, China. E-mail: lidong@mater.ustb.edu.cn
First published on 30th August 2019
Artificial synapses and neurons are recognized as key elements in building bioinspired, neuromorphic computing systems. However, synaptic and neuronal elements that have compatible material systems with each other with high scalability and biorealistic dynamics are yet to be demonstrated. Here we report a two-terminal memristive synapse that can realize short-term and long-term plasticity in both potentiation and depression processes. The Ag nanoclusters introduced at the interface can move, connect and redistribute in response to applied pulses, where their electrochemical migration and thermodynamic relaxation in dielectrics compete with each other and faithfully emulate the synaptic and neuronal dynamics in biology, which in turn allows the same devices to exhibit various synaptic functions and neuronal spiking in a scalable manner. The evolution dynamics of Ag nanoclusters was verified using high resolution transmission electron microscopy and compositional analyses. Based on the inherent state modulator and timing mechanism offered by such dynamics, the devices were able to naturally implement complex functions including metaplasticity, asynchronous classical conditioning and spike-timing-dependent plasticity without needing intentionally designed overlapping pulses, thus paving the way for the construction of intelligent neuromorphic systems capable of encoding and processing spatiotemporal information.
New conceptsFor the first time, we report a two-terminal synapse that can realize short-term and long-term plasticity in both potentiation and depression processes. The Ag nanoclusters introduced at the interface can move, connect and redistribute in response to applied pulses, where their electrochemical migration and thermodynamic relaxation in dielectrics compete with each other, which in turn allows the same devices to exhibit various synaptic functions and neuronal spiking in a scalable manner. This is a significant and sufficient step forward compared with existing studies: (1) the present devices can realize short-term and long-term plasticity in both potentiation and depression, which has been proved crucial for tuning information transmission and network dynamics; (2) using a simple two-terminal and scalable structure, the present memristive synapses faithfully emulate the Ca2+ dynamics in biology and provide an inherent state modulator and timing mechanism, leading to complex synaptic functions including metaplasticity, asynchronous classical conditioning and STDP, without needing intentionally designed overlapping pulses; (3) the asynchronous classical conditioning does not require conditional and unconditional stimuli to be presented synchronously, in analogy to the neurobiological mechanism and allowing higher robustness; (4) synaptic and neuronal functions can be achieved in the same device in a scalable manner, ensuring great compatibility. |
Here we report a two-terminal memristive synapse that can realize STP and LTP in both potentiation and depression, where Ag nanoclusters (NCs) are introduced into the electrode/dielectric interface. The nanoclusters can move, connect and redistribute in response to applied pulses, where their electrochemical migration and thermodynamic relaxation in dielectrics compete with each other and faithfully emulate the calcium ion dynamics in biology, which in turn allows the devices to exhibit various synaptic functions. The evolution dynamics of Ag nanoclusters was verified using high resolution transmission electron microscopy (TEM) and compositional analyses. Based on the inherent state modulator and timing mechanism offered by such dynamics, the devices were able to naturally implement complex functions including metaplasticity, asynchronous classical conditioning and spike-timing-dependent plasticity (STDP) without needing intentionally designed overlapping pulses. These scalable yet powerful memristive synapses with rich and asynchronous plasticity in both potentiation and depression are crucial for robust learning with tunable information transmission and network dynamics, thus paving the way for the construction of intelligent neuromorphic systems capable of encoding and processing spatiotemporal information.
The synaptic behaviors of the devices were studied by pulse measurements, where a read voltage of 50 mV was used to detect the synaptic state, as shown in Fig. 1f. One can see an abrupt increase in current upon application of a voltage pulse with an amplitude of 0.5 V and duration of 100 ms, followed by a gradual decay of the current back to the resting state over a time course of ∼0.2 s after removing the pulse, coinciding with the excitatory postsynaptic current (EPSC) behavior in biological synapses.9 The time scale of ∼0.2 s implies that the synaptic behavior falls into short-term potentiation, which however can be manipulated and switched to long-term plasticity by varying the pulse amplitude (Fig. 1f–i). As shown in Fig. 1f–h, the relaxation time scale increases as the amplitude of the pulses increases (0.5, 0.6 and 0.7 V), with the pulse width fixed at 100 ms. When the pulse amplitude reaches 0.8 V, a transition from STP to LTP is observed, as shown in Fig. 1i. Interestingly, not only can short-term and long-term potentiation be achieved, but our devices also demonstrate capability in implementing short-term and long-term depression as well as a controlled transition between them. Fig. 1j clearly shows that application of a negative pulse of (−0.3 V, 100 ms) induced short-term depression (STD) in the device, where the conductance decreases first after the negative pulse arrives but recovers to the resting state over a time course of ∼2 s. The relaxation time scale of STD once again increases as the amplitude of the negative pulse increases (Fig. 1k), and a transition from STD to long-term depression (LTD) is observed when the pulse amplitude reaches −0.4 V (Fig. 1l). To the best of our knowledge, this is the first demonstration of artificial synapses with STP and LTP behaviors that can be achieved in both potentiation and depression processes. Such capability is deemed indispensable for the construction of neuromorphic systems with complex cognitive functions, such as working memory mediated reasoning and decision making in the human brain.26,27 These short-term effects are the key for enabling information transmission and network dynamics, including adaptation, temporal filtering, gain control, damped oscillation, state hopping with transient population spike, rotating bump state, robust self-organized critical activity, induction of mobility in network states, and enrichment of attractor dynamics, etc. For example, STD may generate a mechanism to hold sensory memory and a mechanism for memory searching.28
To shed light on the mechanism of the above STP and LTP behaviors, we performed detailed high resolution TEM and energy dispersive X-ray spectroscopy (EDS) characterization, as shown in Fig. 2. Fig. 2a–c exhibits cross sectional TEM images and corresponding EDS mapping at the Ag L edge from a pristine Ta/Ag-NCs/Ta2O5/Pt/Ti device, where the inclusion of Ag nanoclusters at the top interface can be confirmed (Fig. 2a–c), as schematically depicted in Fig. 2d. Fig. 2e–g further shows TEM images and EDS mapping of Ag from a device showing short-term potentiation. Compared with the Ag distribution solely at the top electrode interface in the pristine state (Fig. 2a–d), here in Ta/Ag-NCs/Ta2O5/Pt/Ti devices with STP, a low concentration of Ag at the bottom interface can be detected (Fig. 2f and g), implying that Ag has moved through the dielectric in response to the applied electric field and arrived at the bottom interface. Such movement of Ag nanoclusters in dielectrics has been studied in our previous in situ TEM experiments,32 revealing that individual metal nanoclusters in dielectrics behave as bipolar electrodes upon application of an electric field and effective cluster displacement along the field direction can be achieved through a sequence of ionization, Ag+ migration and reduction processes. Such electrochemical dynamics thus forms the basis of Ag cluster redistribution in the present devices. The occurrence of STP indicates that a complete conducting filament has been formed temporarily between the top and bottom electrodes during switching, while the existence of Ag at the bottom interface also suggests that the front of the filament has reached the cathode. However, in order to minimize the surface energy of the Ag filament and reach thermodynamic equilibrium after removing the electrical signals, the filament has spontaneously broken into discrete nanoclusters as depicted in Fig. 2h. This thermodynamic process, in combination with the preceding electrochemical redox reactions and ion migrations, accounts for the short-term or volatile effect in the memristive synapses. The robustness of the Ag filament is associated with the amount of Ag involved during the stimulus. In previous studies, Ag films were widely used as the top electrode, which requires an initial forming process (a strong stimulus) to drive a large quantity of Ag into the dielectric and the Ag filaments formed tend to be strong. Here, we limit the amount of Ag that participates in the resistance switching by replacing Ag films with Ag nanoclusters, thus forming a thin Ag filament. It should be noted that the incorporation of Ag-NCs actually leads to locally enhanced electric fields around the Ag-NCs, which may therefore promote the Ag cation injections therein and in turn confine the filament locations.33Fig. 2i–k further exhibits cross sectional TEM images and corresponding EDS mapping at the Ag L edge from a Ta/Ag-NCs/Ta2O5/Pt/Ti device showing long-term potentiation. Different from the pristine and STP states, herein the existence of high Ag concentration at the bottom interface can be clearly identified and a certain Ag concentration was also detected in the dielectric, as shown by the high resolution TEM image (Fig. 2i) and EDS mapping (Fig. 2k). Although the thermodynamic requirement of minimizing surface energy still breaks the continuous filament(s) into clusters, the high concentration of Ag at both the top and bottom interfaces and the noticeable concentration of Ag in the dielectric has reduced the effective thickness of the dielectric, which is ∼10 nm in the as-prepared state but ∼4 nm after the extensive Ag incorporations (Fig. 2i–l and Fig. S5, ESI†). Such gap distance allows electric tunneling to take place, as further verified by the weak temperature dependence of device conductance (Fig. S6, ESI†),30 which therefore leads to the observed transition from STP to LTP (Fig. 1i and l). We have also performed a control experiment where the Ag nanoclusters were replaced by Pd nanoclusters in a Ta/Pd-NCs/Ta2O5/Pt/Ti structure. The corresponding TEM and EDS results showed that the Pd nanoclusters were immobile under similar field intensity and thus still concentrated at the top electrode interface (Fig. S7, ESI†). The different behaviors of Ag-NCs and Pd-NCs highlights the role of mobile Ag-NCs in enabling the STP and LTP behaviors.
The above dynamics of Ag nanoclusters in the present memristors bears striking similarity with synaptic Ca2+ dynamics, as depicted in Fig. 1b–e. In biological synapses, Ca2+ dynamics is responsible for both STP and LTP, forming the basis of memory and learning.34–36 While STP is linked with transient enhancement of synaptic transmission caused by the Ca2+ influx through N-methyl-D-aspartate receptors (NMDAR) as shown in Fig. 1c, the transition from STP to LTP through repeated or strong stimulations originates from the Ca2+ accumulation inside the postsynaptic membrane by changing the number and/or conductance of α-amino-3-hydroxy-5-methyl-4-isoxazole-propionic acid receptors (AMPAR) (Fig. 1e). Similarly, in the Ta/Ag-NCs/Ta2O5/Pt/Ti memristor mediated by the redistribution and evolution of Ag nanoclusters, a single pulse with low amplitude can only excite a small amount of Ag into the dielectric (Fig. 1b), which resembles the influx process of Ca2+ ions, and a weak conduction channel might be formed consequently. After the electrochemical migration of Ag during the applied pulse, the clearance of the bridging Ag nanoclusters from the dielectric driven by the thermodynamic minimization of the surface energy of Ag filament is analogous to the extrusion process of Ca2+via plasma membrane Ca2+-ATPase (PMCA), hence leading to STP (Fig. 1f–h and 2e–h). However, if a stronger stimulation is applied or subsequent pulses arrive before the nanoclusters diffuse away, more Ag will be driven into and get accumulated in the dielectric through electrochemical processes, leading to further growth of the conduction channel (Fig. 1d), akin to the persistent influx of Ca2+ ions. Once enough Ag has entered the dielectric and arrived at the bottom interface, the conduction channel can retain its high conductance even after thermodynamic relaxation (Fig. 1i and 2i–l), which bears similarity with the accumulation of Ca2+ inside the postsynaptic membrane and the long-term modification of synaptic strengths by changing the number and/or conductance of AMPAR (Fig. 1d and e), accounting for LTP. If the device starts from a potentiated state instead, a negative voltage pulse applied on the device can drive the Ag nanoclusters already accumulated in the dielectric and at the bottom interface toward the top electrode, thus causing synaptic depression. However, since the Ag concentration at the top interface is still much higher (Fig. 2k), the overall concentration gradient will result in backward diffusion of Ag after removing the negative pulse, leading to recovery of the synaptic weight and thus the short-term depression (Fig. 1j and k). If a stronger negative pulse or a train of negative pulses is applied instead, a permanent reduction of Ag concentration in the dielectric and at the bottom interface can be triggered, giving rise to the transition from STD to LTD (Fig. 1l). All the above synaptic behaviors can therefore be consistently explained in the same physical picture, based on the competing electrochemical and thermodynamic processes and the resultant evolution of Ag nanoclusters.
In addition to the pulse amplitude modulated plasticity shown in Fig. 1f–l, the time scale of synaptic plasticity can also be manipulated by the number or rate of the applied pulses, which can be readily expected from the above physical mechanism. Taking the potentiation process as an example, Fig. 3a–c exhibits a transition from STP to LTP as the pulse number increases from 1 to 20, while the pulse amplitude, width and period are fixed at 0.45 V, 100 ms and 200 ms, respectively. In addition, we also show that EPSC and STP to LTP transition can be implemented by pulses within higher amplitude of 0.7 V and shorter width of 10 μs, as illustrated in Fig. S8 (ESI†). Furthermore, when a successive pulse train containing 10 pulses (0.2 V, 20 ms) is applied, the plasticity then largely depends on the stimulation rate or pulse interval. One can see that the amplitude of the EPSC current increases when the pulse rate increases from 0.2 to 20 Hz (Fig. 3d–f), once again bearing similarity with the spiking rate dependent plasticity (SRDP) or dynamic filtering characteristics in biological systems.37,38 The present devices can also mimic the paired-pulse facilitation (PPF) behavior,39 another important short-term phenomenon for temporal information processing. One can see from Fig. 3g and h that a larger pulse interval leads to a smaller conductance enhancement, and the red line represents fitting results using the double exponential decay function. All these behaviors in Fig. 3a–h can be consistently interpreted by the competing effects between inward electrochemical migration of Ag during the pulses and outward thermodynamic diffusion during the pulse intervals.
Such electrochemical and thermodynamic processes of Ag naturally constitute an integration and leaking mechanism, which can be further used to emulate the leaky integrate and fire dynamics in biological neurons, as shown in Fig. 4a,3 provided that a reset signal is applied after neuronal firing to return the artificial neuron back to its resting state. We have tested the neuronal activity using different pulse intervals (Fig. 4) and found that the number of stimulations required to fire the neuron monotonously increases (7, 10, 17, 22) as pulse interval increases (0.1, 0.2, 0.5, 1 ms), thus demonstrating the leaky characteristic of the neuron during the pulse intervals. Consecutive cycles of neuron activity also showed that a rest period of 13.5 ms after neuronal firing can return the artificial neuron back to its resting state (Fig. S9, ESI†), even without a reset operation. Since the synaptic and neuronal functions are achieved in the same device structure, a compatibility in material systems and processes is ensured for the construction of hardware neural networks.
Notably, the first pulse in Fig. 4a did not result in visible change in the current, although the migration of Ag as a result of this pulse contributes to the subsequent apparent increase in device conductance. Herein, the first voltage pulse is equivalent with an internal modulator that regulates the following synaptic efficacy, which is reminiscent of metaplasticity in biological systems, a higher-order form of plasticity highlighting that the previous history of synaptic activities, even without affecting the efficacy of normal synaptic transmission, also plays a significant role in subsequent synaptic plasticity.40,41 The existence of metaplasticity in the Ta/Ag-NCs/Ta2O5/Pt/Ti synapses is further validated in Fig. 5a–d. One can see from Fig. 5a that the application of a programming pulse (0.5 V, 100 ms) triggers an EPSC with a magnitude of 50 μA, which then gradually decays over time and returns to the resting state in ∼0.3 s. When a weaker voltage pulse of (0.2 V, 100 ms) was applied, no obvious response in the device current can be detected after removing the pulse (Fig. 5b). However, the seemingly weak pulse can be adopted as an internal modulator of synaptic efficacy, where an enhanced EPSC has been successfully triggered by an identical programming pulse of (0.5 V, 100 ms) with Fig. 5a, if a preceding modulator pulse of (0.2 V, 100 ms) is applied (Fig. 5b–d). Moreover, the enhancement of the EPSC is a function of the interval between the modulator pulse and the programming pulse (Δt), where a decrease in Δt leads to higher amplitude and longer relaxation time in EPSC (Fig. 5b–d). A relatively long-term potentiation is observed when Δt decreases to 0.5 s, thus highlighting the role of the modulator pulse in manipulating the internal device state.
The metaplasticity demonstrated in Fig. 5a–d exhibits two important characteristics, internal state modulation and inherent timing mechanism, and they can be further utilized to implement important learning rules for spiking neural networks. Pavlov's dog is a famous example of classical conditioning or associative learning, which can be realized even on the synaptic level without sophisticated neural hardware, as discovered in Aplysia.42–46 In general, classical conditioning involves associating a stimulus that evokes a measurable response with a second stimulus that normally does not evoke this response. The first type of stimulus that normally evokes the response is called the unconditional stimulus (US), because no training (conditioning) is required for it to yield a response. The second type of stimulus that normally does not evoke the same response is called the conditional stimulus (CS), where training (conditioning) is required before it can yield the response. Although classical conditioning has been demonstrated previously using memristors by synchronously presenting CS and US during training,47,48 it is important to point out that classical conditioning in biology is asynchronous and has a more relaxed timing requirement, that is, conditioning will occur not only when the US and CS are presented simultaneously but also when the CS precedes the US by a short interval (≤0.5 s),45,49 as illustrated in Fig. 5f. Such asynchronous learning is certainly more robust. To emulate such asynchronous classical conditioning, the synaptic elements must have an internal modulator that times the interval between CS and US, which can be readily implemented using our Ta/Ag-NCs/Ta2O5/Pt/Ti devices, as shown in Fig. 5g–i. Herein, a pulse of (0.2 V, 100 ms) was used as the CS, which cannot evoke the response (defined as a current exceeding 10 μA) by itself, while a pulse of (0.5 V, 100 ms) was set as the US, which can yield the response by itself but without causing a long-term change. Therefore, when US precedes CS by 0.5 s, no long-term change in the device conductance is triggered after training, so CS itself still cannot evoke the response (Fig. 5g). In stark contrast, when CS was ahead of US by 0.5 s (Fig. 5h), the experimental condition is identical with that in Fig. 4d and a long-term potentiation can be obtained. As a result, the CS pulse was able to evoke the response after training (Fig. 5h). When the interval between CS and US was reduced to zero, namely the CS and US were presented synchronously, successful conditioning can also be achieved (Fig. 5i). Such associative learning demonstrated in Fig. 5g–i once again originates from the internal state modulation and timing mechanism as a result of Ag cluster dynamics, which will be important for building robust hardware networks. For instance, the classical conditioning could be used to learn and detect temporal correlations in event-based data streams based on unsupervised learning (see the illustration in Fig. S10 and related discussions, ESI†).
Last but not the least, the internal state modulation and inherent timing mechanism can also naturally lead to STDP, another important timing-based learning rule in biological systems,50 where the relative timing between the pre- and postsynaptic spikes determines whether the synaptic weight will be potentiated or depressed and by how much. In neurobiology, the relative timing information between the spikes is natively embedded, e.g. by the natural decay of Ca2+ levels providing an internal timing mechanism. To achieve STDP in hardware with similarly simple, non-overlapping pre- and postsynaptic pulse pairs, a natural timing mechanism in the synaptic element is demanded as well, which can be readily achieved by the diffusion dynamics of Ag. Fig. 6a–e shows the implementation of STDP in the present devices, where the pulse pair used contains a positive pulse of (0.3 V, 100 ms) representing the effect of a presynaptic spike and a negative pulse of (−0.45 V, 100 ms) representing the postsynaptic spike, applied both on the top electrode (Fig. 6d). Since the preceding positive (negative) pulse will cause short-term potentiation (depression) when Δt > 0 (Δt < 0), it is expected that the net effect of the pulse pair is determined by the residue effect of the preceding pulse, which increases as the absolute value of Δt decreases. Taking the case of Δt > 0 as an example, the residue effect of the preceding pulse becomes negligible when Δt is large enough (e.g. 1 s), resulting in little conductance change after the pulse pair (Fig. 6a). However, when Δt decreases, the residue effect of the preceding pulse counteracts with the contribution of the second pulse, resulting in a net synaptic potentiation (Fig. 6b and c). This eventually gives rise to the implementation of the STDP learning rule, as shown in Fig. 6e. It is worthwhile pointing out that the asymmetric pulse pair (0.3 and −0.45 V) could increase the complexity of peripheral circuitry during STDP learning. However, this might be addressed by further engineering the interfacial property especially the bottom Ta2O5/Pt interface and forming a higher Schottky barrier therein. As a result, the Ta2O5/Pt interface will cause larger voltage drop when it is reversely biased, which can in turn increase the amplitude of the positive pulses required and lead to a symmetric pulse pair during STDP. It should also be noted that when the pulse width and amplitude are reduced, the accompanying relaxation time is expected to decrease accordingly,8 which can therefore lead to reduced time gap Δt during the STDP and increase the training speed.
Footnote |
† Electronic supplementary information (ESI) available: Fig. S1–S10. See DOI: 10.1039/c9mh01206k |
This journal is © The Royal Society of Chemistry 2020 |