Eun Chong
Ju
a,
Dong Hwan
Byeon
a,
Jong Min
Lee
a,
Yu-Jung
Cha
bf,
Hyung Gon
Shin
c,
Seongil
Im
c,
Jeong-Wan
Jo
d,
Yong-Hoon
Kim
*e,
Sung Kyu
Park
*a and
Sung Woon
Cho
*b
aDepartment of Intelligent Semiconductor Engineering, Chung-Ang University, Seoul, 06974, Republic of Korea. E-mail: skpark@cau.ac.kr
bDepartment of Advanced Components and Materials Engineering, Sunchon National University, Suncheon, Jeonnam 57922, Republic of Korea. E-mail: swcho@scnu.ac.kr
cDepartment of Physics and Applied Physics, Yonsei University, Seoul 03722, Republic of Korea
dElectrical Engineering Division, Department of Engineering, University of Cambridge, 9 JJ Thomson Avenue, Cambridge CB3 0FA, UK
eSchool of Advanced Materials Science and Engineering, Sungkyunkwan University, Suwon 16419, Republic of Korea. E-mail: yhkim76@skku.edu
fDepartment of Energy Technology, Korea Institute of Energy Technology, Naju, Jeonnam 58330, Republic of Korea
First published on 23rd September 2025
Recently, with the rapid development of autonomous vehicles, intelligent robots, and mobile electronics, retina-inspired neuromorphic photosensors have attracted growing interest as color image processors for machine vision systems. These devices typically mimic essential functions of the human retina, such as multicolor detection and the processing of raw visual information. However, most neuromorphic photosensors have been implemented with heterojunction channel structures or complex circuit architectures, resulting in system complexity, low resolution, and inefficient energy consumption. Here, we propose a neuromorphic phototransistor with a homogeneous channel structure using subgap-engineered metal-oxide (MO) semiconductors. Despite the absence of conventional heterojunction channel architectures, subgap-engineered MO phototransistors, achieved through intentional doping with alkali metal ions, demonstrate broad spectral responsivity and analog conductance modulation in a compact device structure. Particularly, when doped with a small amount of alkali metal ions (Li 5 at%-MO), the device exhibits in-sensor color image processing capabilities, including full-color detection, enhanced analog conductivity, and distinct sensing performance based on the input color. By applying a 7 × 7 neuromorphic phototransistor array using the Li 5 at%-MO semiconductor as the frontend device, innovative refinement tasks of raw color images such as color character sharpness, noise reduction, and contrast enhancement were successfully achieved, significantly contributing to the overall performance enhancement of the machine vision system.
New conceptsWe present the first demonstration of a homostructure neuromorphic phototransistor based on subgap-modulated oxide semiconductors, enabling full-color in-sensor image processing without the need for complex heterojunction architectures. While conventional neuromorphic photosensors rely on multilayer heterostructures for color discrimination, our approach utilizes interstitial lithium doping to precisely engineer shallow subgap states within a singular oxide semiconductor channel, thereby achieving color-selective analog memory functionality. This dual-state modulation mechanism—combining Li-induced shallow traps and oxygen vacancy-induced deep traps—enables non-volatile analog conductance tuning across the entire visible spectrum. Our work replaces structural complexity with electronic subgap control, simultaneously enhancing spectral sensitivity and simplifying device fabrication, and thus introduces a paradigm shift in the design of neuromorphic optoelectronic devices. Furthermore, by integrating these devices into arrays, we demonstrate real-time, energy-efficient, and scalable in-sensor image refinement, offering a compact and low-power alternative for machine vision frontends. Ultimately, by leveraging this frontend device capable of in-sensor color image processing, we achieved a significant improvement in both recognition performance and processing speed in machine vision systems. |
In general, implementing retina-inspired neuromorphic phototransistors has often relied on the heterostructure channel architecture in phototransistors, which typically include a photoabsorber for color light reception, a semiconducting counter layer for photocharge transport, and a hetero-interface region for photocharge storage.10–13 For the visible light absorber and semiconducting counter layer, high-efficiency solar light absorbers with nanostructures and crystalline semiconductors with high electrical mobility have been adopted, resulting in excellent color response and low-power operation. Although neuromorphic color vision sensors with a heterojunction channel structure have been successfully demonstrated, they have suffered from several challenges associated with multilayer stacking and nanoscale material usage, which include complex fabrication processes, device instability, and non-uniformity, hindering the practical realization of neuromorphic image sensor chips. In contrast, metal-oxide (MO) semiconductors such as InGaZnO, InZnO, and ZnSnO enable neuromorphic photosensing in phototransistors without the need for a heterostructure channel, allowing the fabrication of neuromorphic homostructure phototransistors with a singular channel layer.14–16 These capabilities are possibly due to the persistent photoconductivity (PPC) effect, induced by the photoionization of point defects in MO semiconductors.15,17 However, the localized energy states associated with naturally formed oxygen vacancies in MO semiconductors are mainly located in the deep subgap energy levels within their wide bandgap, limiting the photoionization to the high-energy visible light spectrum.15,16 Consequently, MO-based homostructure phototransistors typically exhibit neuromorphic photosensing functions only within a limited spectrum range such as deep blue or ultraviolet (UV) light and often suffer from low sensitivity in most of the visible spectrum range. Due to the absence of shallow subgap states, MO-based phototransistors struggle to achieve key characteristics essential for neuromorphic image processing, such as high sensitivity, analog operation, and color-discriminative conductance changes for refined color image processing. Fortunately, interstitial elements such as alkali metals or protons are well-known to introduce shallow subgap states in MO semiconductors, enhancing their photoresponse in the red and green regions of the visible spectrum.18–22 Therefore, the strategy of doping interstitial elements into conventional MO semiconductors with oxygen defects has great potential to realize full-color neuromorphic phototransistors for in-sensor image processing without heterojunction channel architectures. While extended photoresponse properties in the visible spectrum have been reported in classical photosensor applications, their neuromorphic spectral characteristics and applications for in-sensor image processing have rarely been demonstrated.18
Here, we report the development of MO semiconductor-based neuromorphic phototransistors with full-color sensing and in-sensor image processing capabilities, replicating essential retinal functions. These devices work as frontend processors in machine vision systems, enabling efficient preprocessing of raw color images. To address the inherent spectral limitations of conventional MO semiconductor channels, which arise from the absence of shallow subgap states, we introduce a subgap-modulated homostructure MO semiconductor via alkali metal doping. The incorporation of interstitial alkali metal dopants induces shallow subgap states, thereby extending the photoresponse to the full visible spectrum. Notably, lithium (Li)-doped ZnSnO (ZTO) channels exhibit both shallow subgap states, generated by the interstitial Li atoms, and deep subgap states, associated with oxygen vacancies (VO), enabling enhanced spectral selectivity and neuromorphic functionality. The Li-doped ZnSnO (Li-ZTO) exhibits photo-induced non-volatile conductance switching and analog conductance updating, attributed to the photo-ionization of point defects. Leveraging these properties, we demonstrate a retina-inspired neuromorphic phototransistor including full-color detection with analog memory-like conductance updating functions for simultaneous image sensing and processing. To demonstrate the viability of the developed device for neuromorphic color image processing, a 7 × 7 phototransistor array based on subgap-modulated Li-ZTO was fabricated, enabling direct acquisition and in situ refinement of raw color images. Applying these refined images to pattern recognition tasks markedly enhanced recognition accuracy and computational efficiency in machine vision systems, highlighting the potential of the simplified MO-based neuromorphic phototransistors for advanced in-sensor processing.
![]() | ||
| Fig. 1 Schematics of biological and artificial vision systems for high-efficiency color image recognition. (a) Human visual system and (b) neuromorphic image recognition system. | ||
However, as shown in Fig. 2(a), typical MO-based phototransistors using InGaZnO, InZnO, and ZnSnO indicate limited color sensing, predominantly in the blue spectrum, despite their non-volatile storage characteristics. The narrow spectral response may be attributed to their subgap density-of-states (DOS), primarily determined by oxygen vacancy (VO)-related states. These deep subgap states, centered around 2.3 eV below the conduction band minimum (CBM) with a Gaussian distribution, can limitedly contribute to photo-electron generation under high-energy visible light illumination via the photo-ionization process of VO, following the reaction VO → V++O + 2e− (VO = oxygen vacancy; V++O = doubly ionized oxygen vacancy; e− = photo-electron), as illustrated in Fig. 2(b).23,24 Therefore, conventional optoelectronics utilizing homostructure MO channels with deep-state dominant subgap DOS show partial detection capabilities, mostly for high-energy blue light.15,17 In contrast, as shown in Fig. 2(c), the interstitial-metal-doped MO semiconductors achieve broadband light responsivity due to a modified subgap DOS structure that consists of both shallow subgap states from interstitial metal doping and Vo-related deep subgap states, enabling full-color sensing and photo-induced conductance switching. Specifically, interstitial elements with small atomic radii, such as alkali metals and protons, can introduce additional shallow subgap states near the CBM.18–22 These states facilitate photo-electron generation even under low-energy visible light by compulsory photo-ionization in response to light stimuli, following the reaction Mi → Min+ + ne− (Mi = interstitial metal; Min+ = metal ion in the interstitial site, e− = photo-electron), as illustrated in Fig. 2(d).18,21 Next, after the termination of light illumination, the photo-charges generated during light exposure may undergo a recombination process. In general, the band-to-band recombination process between delocalized photo-charges (i.e., photo-electron and photo-hole) shows spontaneous characteristics (Fig. S1a), characterized by a high recombination probability and low activation energy.25,26 As a result, the created photo-charges are rapidly annihilated upon the termination of light illumination, preventing memory-related behaviors. In contrast, the recombination between delocalized photo-electrons and localized ionized point defects in our device (i.e., V++O + 2e− → VO and Min+ + ne− → Mi) follows a non-spontaneous process, requiring relatively high activation energy and exhibiting a prolonged reaction time with low recombination probability (Fig. S1b).27 Consequently, the photo-induced (updated) conductivity states generated by the compulsory photoionization of point defects persist even after the termination of light exposure, demonstrating the memory-like PPC effects. These conductance switching characteristics, driven by the interstitial metal- and oxygen vacancy-related PPC phenomenon, enable non-volatile memory functionalities over a broad spectrum range, offering new opportunities for neuromorphic vision sensing applications.
Fig. 3(a) illustrates the device structure of a phototransistor employing a Li-doped ZTO (Li-ZTO) channel, fabricated to demonstrate a neuromorphic full-color image processor with a homostructure channel layer. Among various interstitial cation candidates, including alkali metals and alkaline earth metals, Li, owing to its smallest radius, is readily incorporated into ZTO films without significantly altering the microstructure. Furthermore, Li incorporation offers superior compositional control compared to the proton, facilitating precise modulation of the electronic properties of the films. Similar to undoped ZTO, the 10-nm-thick Li (5 at%)-ZTO film exhibits an amorphous phase, as evidenced by the halo ring-shaped selective-area diffraction pattern image, the smooth film surface, and the well-defined channel/gate-dielectric interface (Fig. 3(b) and Fig. S2). Meanwhile, due to theoretical limitations in energy dispersive X-ray spectroscopy using a transmission electron microscope (TEM), direct composition mapping of the Li element is not feasible. Instead, the presence of the Li dopant within the amorphous ZTO thin film was verified through the Li-related peaks in the X-ray photoelectron spectroscopy (XPS) spectrum (Fig. 3(c)). Additionally, photo-excited charge-collection spectroscopy (PECCS) analysis was performed to compare the subgap DOS profiles between undoped ZTO and Li (5 at%)-ZTO films. For the PECCS analysis, monochromatic photons with varying photonic energies, but with a constant light power intensity of 0.1 mW cm−2, were irradiated onto samples (Fig. S3).28,29 As shown in Fig. 3(d), undoped ZTO exhibited a negligible density of deep subgap states near the CBM, while a significant density of subgap states were observed in the energy range of 2.0 eV to 2.6 eV below the CBM. These deep subgap states in amorphous ZTO are attributed to VO defects. In contrast, the Li (5 at%)-ZTO film showed both deep subgap states and a clear emergence of shallow subgap states near the CBM, spanning from 1.2 eV to 1.8 eV below the CBM, demonstrating a successful modification of the electronic structure upon Li incorporation.
Subsequently, the optoelectronic properties of various MO phototransistors employing the homostructure channel layer were investigated to elucidate the impact of Li incorporation on the subgap states of ZTO films. Due to its small atomic radius, Li element can act as either an interstitial or a substitutional dopant in MO semiconductors, depending on the doping concentration.30,31 At low concentrations, as in the case of Li (5 at%)-ZTO, the Li atoms preferentially occupy interstitial sites rather than substitute metal cations (Zn2+ and Sn3+). These interstitial Li dopants (Lii) introduce shallow subgap states near the CBM and generate free electrons via the interstitial doping process, following the reaction Lii → Lii+ + e− (interstitial doping). In fact, as shown in Fig. 3(e) and (f) along with Fig. S4 and S5, the Li (5 at%)-ZTO transistor exhibited negatively shifted threshold voltage (VTh) and increased subthreshold swing (S.S.) values compared to the undoped ZTO transistor, indicating the increase of electron concentration and the generation of shallow subgap states, respectively. Energy band structure analysis using ultraviolet photoelectron spectroscopy and UV-vis spectroscopy further support these observations (Fig. S6). The Fermi level (EF) of the Li (5 at%)-ZTO film was measured at 4.10 eV, which is closer to the CBM than that of undoped ZTO (EF = 4.3. eV). As a result, the Li (5 at%)-ZTO films are expected to show broadband photo-sensing capability, attributed to the presence of a widely distributed subgap DOS structure, incorporating both interstitial Li-induced shallow states and Vo-induced deep states. As shown in Fig. 3(g) and (j), undoped ZTO exhibited a photo-response predominantly limited to the blue (B) light spectrum. This behavior is attributed to the deep state-dominant subgap DOS structure, resulting in photocurrent generation and a negative shift of VTh (ΔVTh = 6 V). Meanwhile, under red (R) and green (G) light illumination, the device indicates negligible photoresponse due to the absence of shallow subgap states. In comparison, the Li (5 at%)-ZTO phototransistor demonstrates broadband RGB detection, facilitated by the widely distributed subgap DOS structure (Fig. 3(h)), enabling high-performance and color-discriminative photo-sensing characteristics such as distinct VTh shifts (ΔVTh = 5.1 V, 7.5 V, and 14.3 V under R, G, and B illumination, respectively) and varying photocurrent responses (IPhoto = 5.68 × 10−11 A, 3.1 × 10−10 A, and 6.66 × 10−8 A under R, G, and B illumination, respectively) (Fig. 3(j) and (k)).
In contrast, excessive incorporation of Li elements [doping concentration > 5 at%] leads to a reduction of color-responsive subgap states. When the Li content exceeds the limit for interstitial doping, additional Li atoms act as substitutional dopants, forming hole carriers and lithium–oxygen (Li–O) bonds via the following reaction: Li → LiZn–O + h+ or LiSn–O + 2h+ (substitutional doping).31 As shown in Fig. 3(i), the EF of the Li(10%)-ZTO films shift further from the CBM (EF = 4.5 eV), indicating a significant reduction in electron carrier concentration. Furthermore, the formation of Li–O bonds suppresses the density of VO-induced deep states owing to the high oxidation potential of Li (Li: 3.04 V) compared to other metal cations such as Zn (0.76 V) and Sn (0.14 V).32 XPS analysis (Fig. S7) further confirms that Li (10 at%)-ZTO exhibits an increased density of metal–oxygen (M–O) bonds and a reduced VO concentration, consistent with the substitutional doping behaviors of excessive Li. Consequently, compared to Li (5 at%)-ZTO, Li (10 at%)-ZTO demonstrates a more positive VTh shift and suppression of S.S. (Fig. 3(f)), validating a reduction in color-responsive subgap states and electron concentration due to substitutional Li doping. These results suggest that, unlike Li (5 at%)-ZTO, the Li (10 at%)-ZTO film is less favorable for achieving broadband or full-color neuromorphic color sensors.
The machine vision system consists of a retina-like frontend processor for image acquisition and preprocessing, and a brain-like backend processor for image recognition. Phototransistors, which enable both color image detection and analog conductance updating, can be used as retina-inspired neuromorphic image processors, especially for key frontend components in machine vision architectures. To assess the potential for neuromorphic color image sensing, the photo-induced conductance state variations of the ZTO-based phototransistors were systematically examined by monitoring drain current variations under dynamic R/G/B-light pulse conditions. The conductance of phototransistors was measured in the subthreshold region (VGS − VTh = −1.0 V), where the initial current in the dark state remains low, while photosensitivity (i.e., the ratio of photocurrent to initial current) is maximized compared to other operating regions (Fig. S8).33 The conductance switching behaviors under a single pulse of R/G/B-light was initially evaluated. As shown in Fig. 4(a), the undoped ZTO phototransistor shows a rapid increase in conductance under B-light illumination due to the active generation of photocarriers from the deep subgap states associated with VO. Moreover, the elevated conductance can be maintained for an extended period following the termination of B-light illumination, indicating that the conductance state can be switched to a higher conducting state under the B-light pulse. The conductance retention behaviors can be attributed to the incomplete recombination of photo-electrons and photo-ionized deep subgap states, which is a kind of non-spontaneous process requiring high activation energy. Unlike the spontaneous and rapid recombination process between delocalized photo-electrons and delocalized photo-holes, which can freely move within the material, the relaxation between delocalized photo-electrons and localized photo-ionized deep subgap states is not easily activated and occurs at a significantly slower rate.15,26 In contrast, under R- and G-light pulse illumination, the device fails to exhibit clear conductance changes and non-volatile behavior owing to the absence of localized shallow subgap states induced by point defects. Instead, slight conductance transitions are observed under R and G light pulse illumination, which are believed to be caused by photo-electron generation from band-tail states near the conduction band, formed due to disordered MO bonding networks (i.e., disordered bonding angles and lengths) within the amorphous metal oxide. Such conductance changes induced by photoexcitation from band-tail states can exhibit volatile behavior due to rapid recombination phenomena between delocalized photo-electrons and the widely distributed band-tail states within the MO semiconductor (Fig. S9). Comparatively, the Li-ZTO phototransistor exhibited non-volatile switching behaviors for all R/G/B-light pulses (Fig. 4(b) and (c)), which is attributed to the presence of widely distributed subgap states in the bandgap. These non-volatile switching characteristics under R/G/B light illumination were also similarly observed in devices doped with other alkali metals, such as Na and K (Fig. S10). Meanwhile, the Li (5 at%)-ZTO phototransistor exhibits distinct conductance switching characteristics, featuring a high conductance and robust non-volatile retention performance compared to the Li (10 at%)-ZTO phototransistor. Particularly, the conductance increased in response to the photon energy of the incident light, yielding conductance values of 2.04 × 10−11 S, 4.35 × 10−11 S, and 1.29 × 10−9 S under R-, G-, and B-light pulses, respectively, at a light intensity of 7 mW cm−2. These observations are likely due to the increased involvement of deeper subgap states in photo-electron generation with higher incident photon energy.
Furthermore, the analog conductance updating capability of the phototransistors was evaluated under repeated R/G/B-light pulses (30 pulses). As shown in Fig. 4(d), the undoped ZTO transistor exhibits conductance updating behavior exclusively under B-light pulses. The conductance progressively increases corresponding to the number of B-light pulses (Fig. 4(g)), resulting in a cumulative conductance update. Moreover, even after the termination of the B-light pulses, the elevated conductance persists with slow decay characteristics, indicating long-term retention of the conductance state. On the other hand, under R/G-light pulses, the conductance (C) values show transient detection behavior with rapid decay characteristics. The increased conductance after 30 pulses (C30th) of R- and G-light rapidly decreased to 37% and 34% within 50 s after termination of R- and G-light pulses, respectively. In contrast, the Li-ZTO phototransistors exhibit analog conductance update behaviors under all R/G/B-light pulses (Fig. 4(e) and (f)), which provides a basis for obtaining preprocessed color image data with enhanced contrast and reduced noise. Notably, regardless of the light color, the conductance reveals incremental updates with each successive light pulse. This behavior is possibly due to the excessive generation and incomplete recombination of photocarriers within the Li-ZTO channel, leading to a gradual accumulation of photocarriers (Fig. 4(i)). Specifically, in the case of the Li (5 at%)-ZTO transistor, the conductance state could be clearly enhanced depending on the number of repeated light pulses, resulting in a dramatic increase in conductance (Fig. 4(h)). Compared to the conductance value obtained after a single pulse (C1st) of R/G/B-light, the conductance values after 30 pulses of R/G/B-light (C30th) increase by factors of 15.5, 22.1, and 70.6, respectively. Additionally, a linear change in the conductivity state is observed during the light pulse, with low non-linearity (NL) values close to zero (Fig. S11). An NL value of zero indicates an ideal straight-line response.34 Furthermore, the updated conductance state can be maintained, showing robust retention behavior with slow decay characteristics. The C/C30th values remained at 77%, 63%, and 59% after 50 s following the termination of R/G/B-light pulses, respectively, demonstrating the stability of the conductance state.
Fig. 5(a) shows the neuromorphic image processor developed as a frontend device, which can further enhance machine vision systems by simultaneously performing color image detection and data preprocessing on the same device based on contrast enhancement and noise reduction capabilities. To verify the viability of the developed device as a neuromorphic color image processor, a simple prototype of the neuromorphic color image processor was implemented using a 7 × 7 device array of Li (5 at%)-ZTO phototransistors (Fig. 5(b)). An individual phototransistor can be considered as one pixel of the refined output image. The device array demonstrates exceptional device-to-device uniformity, as evaluated through the electrical characteristics of 49 devices uniformly distributed across a 9.9 cm2 Si substrate (Fig. S12). To demonstrate simple image preprocessing tasks, various raw images of letter ‘A’ were prepared, maintaining a consistent main feature while introducing randomly distributed background noise. In all raw datasets, letter ‘A’ occupies the identical 17 pixels within the 7 × 7 pixel array, while the remaining 32 pixels contain stochastic noise. As illustrated in Fig. 5(c), image processing was performed using the neuromorphic color image processor by mapping the light intensity of each pixel in the raw image. Strong (P = 7 mW cm−2) and weak (P = 3 mW cm−2) color light pulses were employed as input signals for the main feature and noise pixels, respectively. During sequential input of the noisy character image, strong light pulses (7 mW cm−2 and a pulse duration of 10 s) were constantly irradiated to the pixel locations corresponding to letter ‘A’, whereas weak pulses were randomly distributed across the background pixel. As a representative case, the detailed processing results for the red-colored letter ‘A’ are provided in Fig. S13.
As shown in Fig. 5(d)–(f), increasing the number of preprocessing cycles leads to a gradual enhancement in the conduction state of the phototransistors corresponding to letter ‘A’, resulting in an increased C/C0. Here, C0 and C denote the initial conductance and the light-stimuli updated conductance, respectively. These preprocessing operations can be achieved for letter ‘A’ with RGB colors owing to the full color response characteristics of the Li (5 at%)-ZTO phototransistor. Meanwhile, phototransistors corresponding to the background region of the raw image exhibited minimal changes in conductance, as the weak light pulses representing noise were intermittently incident and transient. Consequently, repeated input of the raw image resulted in progressively higher output signals (C/C0) in the phototransistors, leading to a more refined output image. This can be clearly observed in the output images shown in Fig. 5(c), where the main feature, letter ‘A’, becomes increasingly distinct compared to the initial output image, while the background noise is gradually suppressed. Additionally, despite an identical number of preprocessing operations, the neuromorphic image process can provide distinct output signals depending on the color of the input image. When the blue letter ‘A’ with high photon energy was utilized, the conductance state of the phototransistor indicated a more pronounced increase compared to the red letter ‘A’, owing to a greater generation of photocarriers. This resulted in a significant enhancement in the conductance compared to the initial state with C/C0 = 15
145, 779, and 184 for letter ‘A’ positions after 20 light pulses of B-, G- and R-light, respectively. The results presented here suggest that a neuromorphic image sensor consisting of the Li (5 at%)-ZTO phototransistor array can be viable as the frontend processor of a machine vision system, capable of transformation of raw color images into refined image data with enhanced contrast. Meanwhile, to process new image data with different colors or shapes, it is essential to reset the updated conductance states of the phototransistors, restoring them to their initial state. As shown in Fig. S14 and S15, the conductance states, updated by light stimulation, can be fully reset by applying a gate bias pulse (VGS for the reset operation = +2.5 V for 2 s).
As shown in Fig. 6(a), a machine vision system for machine recognition of color images was established using a neuromorphic phototransistor array (frontend part for image preprocessing) and AI algorithms (backend part for image recognition). For the raw image data, two types of 7 × 7 pixel character images (‘A’ and ‘B’) with six RGB color combinations for the main features and background areas were used (Fig. 6(b)). For each color configuration, multiple variations of letters ‘A’ and ‘B’ were extracted from a dataset of 100 images (Fig. S16). Therefore, a total of 1200 color character images [2 characters × 6 RGB color combinations × 100 shape variations] were prepared for training and testing. The raw images were repeatedly processed using the image preprocessor to enhance contrast through iterative image refinement. In this process, visible light signals corresponding to each pixel color in the raw image were applied to individual phototransistors within the image preprocessor at a constant light intensity of 7 mW cm−2. Following repeated image iteration, the IPhoto/I0 values of each phototransistor were extracted and converted into grayscale values from 0 to 255. By mapping these grayscale values, a refined image was reconstructed through image preprocessing. The image preprocessing was conducted for all 1200 raw images. Depending on the number of preprocessing cycles and the specific neuromorphic image processor used, refined images with varying grayscale quality were obtained. As a representative example, Fig. 6(c)–(g) present the grayscale images obtained by preprocessing the raw color images of letters ‘A’ and ‘B’ in Fig. 6(b). As demonstrated in Fig. 6(c)–(f), iterative preprocessing progressively enhances image contrast, producing grayscale character images with more distinct differences between the key features and the background. It is worth noting that the enhancement of image contrast through image preprocessing is a unique capability of neuromorphic image sensors, which is generally unattainable with conventional sensors. During the convolutional neural network (CNN) model training and pattern recognition using refined data, it is observed that multiple preprocessing iterations improved learning efficiency validated by a reduction in the number of training epochs and enhanced recognition rates compared to the single preprocessing case (Fig. 6(h)). However, increasing the number of preprocessing steps also results in higher power and time consumption. Notably, beyond ten times of image preprocessing, no significant improvement in learning efficiency or recognition rates is observed. These findings highlight the importance of optimizing the number of preprocessing cycles to balance performance gains with recognition efficiency.
Additionally, the performance of the Li (5 at%)-ZTO-based neuromorphic processor was compared with that of the undoped ZTO-based neuromorphic processor. As shown in Fig. 6(e) and (g), with an identical preprocessing number of 10, the Li (5 at%)-ZTO-based neuromorphic processor produces high-quality refined images with distinct contrast, outperforming the ZTO-based processor. In the ZTO phototransistors, extracting clear character patterns from raw images with R/G color combinations is challenging owing to the low photoresponse to R- and G-light. As a result, during CNN-based learning and pattern recognition, the Li (5 at%)-ZTO-based image processor exhibits a high recognition accuracy of 92% and enhanced learning efficiency (saturation of recognition accuracy at 22 epochs) compared to the ZTO-based image processor, which achieves a recognition accuracy of 65% with a saturation point at 30 epochs (Fig. 6(i)). For a more comprehensive discussion, the color pattern recognition performance of the image processor was compared between the homostructure Li (5 at%)-ZTO and heterojunction channel structures, such as our previously developed heterostructure MO phototransistor with a mixed quantum dot light-absorbing layer.9 The Li (5 at%)-ZTO processor exhibits a comparable recognition accuracy of more than 90%, similar to that of the heterojunction channel structure (Fig. S17). These results show that, even in the absence of a heterostructure, the MO semiconductor-based optoelectronic synapse array with a subgap-engineered homostructure channel layer can possess excellent color image preprocessing capabilities for both monochromatic RGB and mixed-color images (Table S1), thereby providing critical insights into the development of compact and low-power neuromorphic machine vision systems (Table S2).
:
Sn = 5
:
5). Next, Li-ZTO precursor solutions for Li-ZTO films (Li = 5 at% and 10 at%) were prepared by adding the appropriate amount of Li precursor solution to the ZTO precursor solution according to their composition ratios [Li
:
(Zn + Sn) = 5
:
100 and 10
:
100]. Then homogeneous solutions were prepared via magnetic stirring at 400 rpm for 12 h at room temperature and filtered using a 0.2-μm syringe before sol–gel spin-coating.
Next, all optoelectronic characteristics of MO phototransistors were evaluated using a semiconductor parameter analyzer (Agilent 4156C, Keysight) at room temperature in ambient air. The drain current–gate voltage (IDS–VGS) transfer curve with VGS sweep from −20 to +20 V was measured at a specific drain voltage (VDS) of 10 V. The VTh was determined as the specific VGS at IDS = L/W × 10 nA. The subthreshold swing (S.S.) value was extracted from the transfer curve (VDS = 10 V) using the following equation: S.S. = dVGS/dlog(IDS). The density of charge trap states (Nit) can be calculated from the following equation: Nit = [(S.S. × log
e)/(kT/q) − 1] × (Ci/q), where k and T are Boltzmann's constant and absolute temperature, respectively. The static and dynamic photoresponses of phototransistors under primary visible-light illumination were evaluated using light-emitting diodes (LEDs) with peak wavelengths at 454 (B), 519 (G), and 635 nm (R) being used as color light sources. The optical input-spike signals were generated by controlling visible LEDs using arbitrary function generators (Tektronix AFG 3252C).
| This journal is © The Royal Society of Chemistry 2026 |