Open Access Article
Jung Min Yun,
Yu Bin Kim,
Min Jung Choi and
Seong Jun Kang
*
Department of Materials Science and Engineering, Kyung Hee University, Yongin 17104, Republic of Korea. E-mail: junkang@khu.ac.kr; Tel: +82-31-201-3324
First published on 10th February 2026
Neuromorphic vision systems demand highly efficient optical signal acquisition and adaptable, energy-aware learning capabilities. Optical synaptic transistors have emerged as promising components for in-sensor computing by directly responding to visual stimuli and mimicking core synaptic functions such as excitatory postsynaptic current (EPSC), paired-pulse facilitation (PPF), and both short- and long-term plasticity. However, most devices demonstrate fixed synaptic gain, limiting their ability to adapt learning behavior in response to varying input conditions or computational tasks. Inspired by biological neuromodulation, we present a gate-tunable optical synaptic transistor based on an In–Ga–Zn–O (IGZO) phototransistor that supports both conventional synaptic behaviors and voltage-dependent modulation of learning sensitivity. The device allows pre-conditioning of EPSC amplitude via gate bias prior to optical stimulation, effectively mimicking neuromodulatory gain control. Convolutional Neural Network (CNN) training on the CIFAR-10 dataset shows that higher gate biases improve classification accuracy with higher energy use, while weaker biases reduce energy consumption with an adaptive accuracy tradeoff. Our device integrates core synaptic behaviors with gate-controlled gain modulation, effectively emulating neuromodulation and offering a practical and efficient approach to adaptive neuromorphic vision systems.
New conceptsThis work presents an oxide-based optical synaptic transistor that emulates biological neuromodulation by enabling pre-adjustment of synaptic gain via gate voltage before light stimulation. In biology, neuromodulators such as dopamine regulate synaptic strength, sensitivity, and learning rate without directly triggering activity. Gate-controlled carrier modulation combined with intrinsic oxygen-vacancy traps in IGZO enables tunable optical plasticity and supports both short- and long-term synaptic functions. The intrinsic oxygen-vacancy trap states extend the photoresponse into the visible range and induce persistent photoconductivity, providing the physical basis for adjustable optical gain. Identical optical stimuli yield different synaptic responses depending on the preset gate bias, allowing flexible control of learning behavior. When applied to a convolutional neural network (CNN), higher gate-induced EPSC improved classification accuracy, while lower bias reduced energy consumption. This work demonstrates visible-light-driven synaptic modulation that mimics neuromodulator function using gate control in a single oxide-based optical synaptic device. |
To emulate brain like visual learning, optoelectronic synaptic devices have attracted considerable attention.7,8 These devices combine light sensing and memory functions in a single platform, enabling direct processing of optical signals at the sensor level. This “in-sensor computing” approach removes the need for separate image sensing and processing units, improving speed and reducing energy use.7,9–15 However, despite notable progress, most reported optical synaptic transistors display fixed synaptic gain properties, meaning the strength of the excitatory postsynaptic current (EPSC) is determined only by the intensity or duration of the optical input.16 This limits their adaptability and learning flexibility, especially in systems that require variable learning speeds or stimulus sensitivity.17,18
In contrast, biological synapses show more adaptable and context-sensitive learning behavior through the effects of neuromodulators such as dopamine, acetylcholine, and oxytocin. These molecules do not directly trigger postsynaptic responses but instead regulate the sensitivity and plasticity of synapses either before or during stimulation.19–21 For example, dopamine release can lower the threshold for long-term potentiation (LTP), resulting in greater synaptic weight change in response to the same input. This process, known as neuromodulation, enables context dependent learning, where identical stimuli can produce different learning outcomes depending on the neuromodulatory state of the network.17,22–24 As illustrated in Fig. 1a, such neuromodulators play critical roles in adjusting synaptic gain in accordance with internal or external context.
In the context of neuromorphic computing, this concept can be directly mapped onto our device framework. Optical excitation in the IGZO channel increases carrier density, leading to an excitatory postsynaptic current (EPSC) that functions as the synaptic weight in a neural network. In this analogy, the gate voltage plays a role similar to biological neuromodulators such as dopamine or oxytocin, regulating the degree of carrier accumulation and thereby modulating the effective synaptic gain. As illustrated in Fig. 1b, the gate bias acts as an external modulatory input that tunes the channel state, analogous to how neuromodulators modulate synaptic efficacy in biological systems. Consequently, the amplitude of the EPSC determines the learning strength or weight update in the CNN, establishing a clear correspondence between material-level charge dynamics and algorithm-level learning behavior.
Inspired by this mechanism, we present a gate-tunable optical synaptic transistor enabling pre-adjustment of the postsynaptic response before optical stimulation. Our device uses an indium gallium zinc oxide (IGZO) channel in a phototransistor configuration, in which gate voltages modulate the carrier distribution and synaptic strength. Under the same optical input, the device produces distinct EPSC values depending on the gate bias. The device is sensitive to both blue and green light, enabling synaptic behavior under visible-light excitation. This spectral sensitivity ensures compatibility with practical light sources and supports wavelength-selective, energy-efficient neuromorphic vision operation.25,26 This pre-conditioning enables tunable learning rates, controllable energy–accuracy balance, and flexible synaptic response modulation. To examine its functional impact, we trained a convolutional neural network (CNN) using the CIFAR-10 dataset under different gate voltage conditions.27 Using experimentally measured EPSCs as synaptic weights, stronger gate biases improve learning accuracy, though at the cost of increased energy consumption. Lower gate biases reduce accuracy but enhance energy efficiency, showcasing a biologically inspired tradeoff.28,29 These findings highlight the potential of gate-tunable optical synaptic transistors for enabling adaptive, low-power neuromorphic vision hardware that closely mirrors human learning processes.30
:
1
:
2. The mixed solution was stirred at 80 °C for 1 hour.16
000 images) and test sets (10
000 images). EPSC values measured under different gate biases were normalized and assigned as synaptic weights in a convolutional neural network (CNN). To validate the impact of device-derived synaptic weights, three training scenarios were compared: a baseline model using standard Xavier initialization without device influence, an optimal device mode in which weights were scaled by the normalized EPSC obtained at VG = –3 V, and a low-power mode using EPSC values at VG = –18 V. The experimentally measured EPSC values were not directly assigned as CNN synaptic weights. Instead, the EPSC values obtained under different gate voltages were normalized and used as multiplicative scaling factors applied to the initial weights generated by standard Xavier initialization. Specifically, the initial weights were defined as:All models shared the same architecture (Conv2D: 32 → 64 → 128, Dense: 128 → 10) with tanh activation for all hidden layers, and were trained using stochastic gradient descent with a learning rate of 0.01 and a batch size of 32 for 20 epochs. The experimentally measured EPSC values under different gate voltages were normalized and applied as multiplicative scaling factors to the initial weights of all convolutional and fully connected layers prior to training. After this initialization step, all models were trained using identical backpropagation settings without further device involvement. This approach allows the device-derived EPSC to act as a neuromodulatory prior that biases the initial learning trajectory while preserving identical network architectures and training conditions.
Fig. 2d presents the PPF (%) results across 12 different conditions—six gate voltages under blue light and six under green light. In every case, PPF was observed. As the interval between pulses increased, retention decreased. This result reflects the recombination of photocarriers over time.40–42 On the other hand, Fig. 2e shows the EPSC enhancement measured with increasing pulse lengths.43–46 EPSC enhancement refers to the growth in EPSC area under repeated stimulation. It was calculated using the EPSC areas from single and repeated pulses, and, using the formula:
The data show that in all 12 conditions, EPSC increased steadily with longer pulse duration. This confirms that the device supports cumulative plasticity.43 Even under strong negative gate voltages like –18 V, the EPSC enhancement remained reliable. This suggests the device maintains high sensitivity and stable response. The raw EPSC measurements used in these calculations are shown in Fig. S1 and S2. We further explored gate-tunable learning behavior using repeated training and relearning tests. Fig. 2f and g show EPSC changes during a cycle of 50 light pulses (learning), followed by 50 seconds without stimulation (forgetting), and 10 more pulses (relearning). Blue (405 nm) and green (520 nm) light were used, respectively. In both cases, EPSC showed a clear cycle of increase, decrease, and increase again. This reflects long-term synaptic plasticity like that seen in the brain.46,47 Gate voltage clearly affected the EPSC: it was highest at –3 V and lowest at –18 V. This pattern was consistent under both wavelengths. It supports the idea that gate voltage works like a neuromodulator by setting synaptic sensitivity before the light input. Fig. S3 gives a closer view of the early response under different gate voltages. It shows that the timing and slope of EPSC growth shift with gate control. These observations demonstrate that our device can mimic short-term and long-term synaptic behaviors, such as facilitation, memory retention, and relearning. In addition, it offers adjustable gain control through gate voltage, enabling flexible light-driven synaptic response that can be fine-tuned ahead of stimulation that similar to biological neuromodulation.30,38,43 Furthermore, Fig. S6 shows that the device maintains stable EPSC generation even under periodic blue light pulses delivered at a short 25 ms interval, confirming that the channel responds reliably to high-frequency optical stimulation without signal degradation. Fig. S7 further demonstrates that the EPSC magnitude varies systematically with input light intensity under both blue (450 nm) and green (520 nm) illumination. Higher optical intensity produced larger EPSC increments, indicating that the synaptic strength is jointly determined by gate-controlled carrier density and the density of photogenerated carriers. These results indicate that the interplay between gate-induced carrier depletion and light-generated photocarriers governs the EPSC magnitude, suggesting that carrier dynamics rather than optical intensity alone dictate synaptic strength.
Fig. 3 presents a comprehensive spectroscopic analysis of the IGZO layer, which clarifies the material's composition, defect states, and electronic structure that influence its photoresponsive behavior. Fig. 3a–d show the XPS spectra of the four main elements: In, Ga, Zn, and O. The In 3d peaks are centered around 445 eV and 453 eV (Fig. 3a), Ga 2p peaks appear near 1119 eV and 1145 eV (Fig. 3b), and Zn 2p peaks are located at approximately 1023 eV and 1046 eV (Fig. 3c). These data confirm that the IGZO layer includes all the intended elements, with an atomic ratio of In
:
Ga
:
Zn approximately 6.1
:
1
:
2.48 The O 1s core-level spectrum (Fig. 3d) provides insights into the chemical environment of oxygen in the film.49 It contains three distinct peaks at 530.0 eV, 531.5 eV, and 533.1 eV, which are attributed to lattice oxygen (M–O), oxygen vacancies, and hydroxyl groups (O–OH), respectively. The presence of a oxygen vacancy peak at 531.5 eV indicates defect states within the IGZO.31 These vacancies are widely reported to introduce localized trap states inside the bandgap, which can play a critical role in modulating electrical conductivity and enabling light-induced persistent photoconductivity (PPC).50,51 In particular, trapped carriers can be slowly released after photoexcitation, exhibit a sustained synaptic response resembling biological short-term memory.52,53 The optical absorption properties of IGZO were evaluated using UV-vis spectroscopy. As shown in Fig. 3e, the Tauc plot yields an optical bandgap of approximately 3.37 eV. Notably, absorption starts to increase gradually below this value, suggesting the influence of sub-gap states. This deviation from ideal band-to-band absorption further supports the existence of oxygen-vacancy-induced states that extend the spectral response into the visible region.54 Fig. 3f presents the ultraviolet photoelectron spectroscopy (UPS) results. The secondary electron cutoff gives a work function of 3.83 eV, and the valence band maximum (VBM) is located 3.03 eV below the Fermi level.55 The inset image shows additional states detected at approximately 1.63 eV below the conduction band minimum (CBM), corresponding to oxygen-vacancy-related gap states.56 These states are critical in enabling sub-bandgap optical transitions.57 Based on the energy difference between the gap states and the CBM (∼2.43 eV), the device can absorb green light (∼520 nm, ∼2.38 eV) via defect-assisted transitions. In addition, blue light (∼450 nm, ∼2.76 eV) has sufficient photon energy to excite electrons directly across the bandgap, further contributing to the device's optical response.58 Fig. 3g schematically summarizes the band structure of IGZO constructed from the UPS and UV-vis data. The diagram illustrates the positions of the VBM, CBM, and mid-gap states associated with oxygen vacancies, clarifying the optical excitation pathways under different photon energies. Specifically, defect-assisted transitions enable sub-bandgap absorption of green light, whereas blue light excitation occurs via direct band-to-band transitions. This schematic highlights how oxygen-vacancy-related states expand the effective absorption window of IGZO into the visible region and provide a physical basis for the device's broadband photoresponsive and synaptic behavior. This demonstrates that oxygen vacancy–induced trap states extend the absorption window of IGZO into the visible region, enhancing its suitability for low-energy, broadband optical neuromorphic applications. The correlation between sub-gap optical absorption and oxygen-vacancy-related states confirms that intrinsic defect engineering is a viable route for tailoring optical synaptic gain in oxide semiconductors.
Fig. 4 explains the mechanism behind the gate-controlled synaptic behavior in the IGZO-based optical synaptic transistor. Fig. 4a–c show the band diagrams of the IGZO/SiO2 interface under different gate voltages, and Fig. 4d–f present the corresponding cross-sectional schematics indicating the depletion region. At VG = 0 (Fig. 4a and d), the CBM and VBM in the IGZO layer are nearly flat near the IGZO/SiO2 interface, indicating the absence of significant band bending. The schematic confirms that the depletion region is negligible, allowing easy carrier transport when light is applied. Under these conditions, photogenerated carriers move freely, resulting in the largest EPSC among the three cases. When a moderate negative gate voltage of –3 V is applied (Fig. 4b and e), a slight upward band bending occurs at the IGZO/SiO2 interface, producing a shallow depletion region that partially extends into the IGZO/SiO2 interface. This depletion region moderately restricts carrier movement, so photogenerated electrons are more difficult to transport than at 0 V.59–62 As a result, the EPSC amplitude decreases compared to 0 V, but remains considerable. At a stronger negative gate voltage of –18 V (Fig. 4c and f), the band bending becomes much more severe. The depletion region significantly expands. This large depletion width severely limits the photogenerated carrier movement, strongly suppressing the photocurrent under identical optical input. As a result, the EPSC amplitude becomes very small, even under the same optical stimulation. This gradual expansion of the depletion region explains the changes in EPSC seen earlier in Fig. 2f and g. In this way, the gate voltage can be used ahead of light input to adjust the sensitivity of the device. This analysis clarifies that gate-induced band bending serves as an external control parameter for neuromodulatory gain regulation, providing a direct physical mechanism for adaptive learning behavior.
Fig. 5 evaluates how the gate-tunable synaptic behavior of the proposed IGZO optical transistor affects high-level machine vision tasks using the CIFAR-10 dataset and a CNN model. In Fig. 5a, a schematic illustrates the integration of the device into a learning system. Identical visual input images from CIFAR-10 are processed, and the device's EPSC levels, which vary depending on the gate voltage, are used to set the synaptic weights delivered to the CNN model. The normalized EPSC values for each gate condition were directly applied as scaling factors to the synaptic connections in both the convolutional and dense layers, allowing the gate-dependent device behavior to influence feature extraction and classification processes simultaneously. This connection ensures that the physical characteristics of the device contribute to the learning efficiency throughout the network rather than only during initialization. Here, the CNN is not intended to replicate full hardware-in-the-loop learning. Instead, it serves as a functional-level validation tool to examine how gate-controlled EPSC modulation biases the learning trajectory under identical network architectures and training conditions.
For statistical evaluation, five independent CNN models were trained for each condition using different random seeds, and the resulting accuracies were analyzed in terms of their mean values and standard deviations, as summarized in Fig. S8. The results shown in Fig. S8 indicate that the overall performance trends are preserved despite variations in random initialization. In particular, the device-modulated CNNs under the same gate bias exhibit comparable accuracy distributions across repeated trials, while the baseline CNN trained with standard random weight initialization serves as a consistent reference across the same set of random seeds. Importantly, the relative performance relationships between the baseline model and the device-modulated conditions at −3 V and −18 V remain unchanged across independent runs. These observations confirm that the reported CNN performance reflects a reproducible and statistically consistent trend rather than being strongly influenced by a specific random initialization, thereby supporting the validity of the device-derived weight modulation scheme.
In this study, the device was not used to directly capture visual images. Instead, experimentally measured EPSC values obtained under different gate voltages and illumination conditions were normalized and used as synaptic weight-scaling factors in the CNN model. These normalized weights were then applied to the CIFAR-10 dataset during training and testing, enabling simulation of how the device's gate-dependent synaptic behavior influences learning performance. Therefore, the CNN training and inference were performed through a hardware-inspired simulation using experimentally derived device characteristics rather than direct hardware image sensing. This approach allows evaluation of how gate-dependent synaptic modulation influences learning accuracy and energy consumption in a neuromorphic vision system.10 The CNN model includes three convolutional layers (32, 64, and 128 filters), with max pooling applied after the second and third layers. These layers are followed by two dense layers and a final softmax output layer for classification. Higher EPSC values, induced by less negative gate voltages, result in larger weight values being passed to the CNN, while lower EPSC values lead to smaller weights. Fig. 5b and c show the resulting test accuracies and energy consumption values after 20 training epochs under blue (450 nm) and green (520 nm) light, respectively. In both figures, higher gate voltages lead to greater EPSC enhancement, as shown in Fig. 2, resulting in a clear improvement in CNN classification accuracy. Under blue light (405 nm), the test accuracy was about 96% at VG = −3 V and 60% at VG = −18 V, while under green light (520 nm), it was about 94% at VG = −3 V and 57% at VG = −18 V.38 However, this improvement in classification performance comes at the cost of increased energy consumption. We note that randomly initialized networks may eventually reach comparable accuracy given sufficient training. However, the purpose of this study is not to maximize final accuracy, but to demonstrate that gate-tunable EPSC presets learning sensitivity and enables controllable accuracy–energy tradeoffs under constrained training budgets.
The energy consumption for each optical stimulus was estimated using the following equation:28,29
| Ec = Ipeak × VDS × tpulse |
| This journal is © The Royal Society of Chemistry 2026 |