Open Access Article
Seong-Yun Yuna,
Joon-Kyu Hanb and
Yang-Kyu Choi
*a
aSchool of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. E-mail: ykchoi@ee.kaist.ac.kr
bDepartment of Materials Science and Engineering and Inter-University Semiconductor Research Center, College of Engineering, Seoul National University, Gwanak-ro 1, Gwanak-gu, Seoul, 08826, South Korea
First published on 2nd February 2026
To enable in-sensor computing that emulates the biological brain's ability to process visual information with minimal energy consumption, we demonstrated brain-inspired coupled twin photo-responsive oscillators (CT-PROs) that integrate sensing and signal processing within a single device. The CT-PROs utilize a single-transistor latch (STL) phenomenon to generate iterative oscillations modulated by incident light. Each PRO, structurally identical to a metal–oxide-semiconductor field-effect transistor (MOSFET), is capable of sensing optical stimuli and producing corresponding oscillatory outputs governed by the STL mechanism. Notably, the PRO obviates the need for an external photodetector because it inherently operates as both an oscillator and a photodetector, functioning as a Janus-like device. The oscillation characteristics vary with input light intensity and wavelength, enabling detection of both parameters by observing whether synchronization or desynchronization occurs between the CT-PROs under different illumination conditions. The CT-PROs can be used for a coupled oscillator network (CON) with in-sensor computing capabilities. As an application of the PRO-based CON, edge detection was conducted. Then, edge maps generated by the proposed CON were used to aid in classification. These results highlight the potential of PRO-based CONs for in-sensor vision systems.
New conceptsThis work introduces a novel concept by redefining the traditional transistor's role beyond switching and amplification to function as a photo-responsive oscillator (PRO). Leveraging the single-transistor latch, the PRO generates oscillatory signals modulated by incident light intensity and wavelength. This dual-function device integrates sensing and signal processing within a single unit, eliminating the need for separate photodetectors and oscillators. Coupled twin PROs form a network capable of synchronized oscillations that process optical information for in-sensor computing. This enables efficient edge detection and image classification, with significant potential in low-energy vision systems. The PRO's compatibility with CMOS fabrication processes underscores its practicality for large-scale integration. Importantly, the concept represents a shift from traditional digital and static signal processing to dynamic, wave-based computation using coupled oscillators. Such a paradigm enhances noise robustness, energy efficiency, and scalability, which align with emerging neuromorphic and brain-inspired computing trends. This new framework fundamentally redefines sensor and processor integration, offering a promising direction for next-generation sensory and artificial intelligence systems. |
However, conventional vision systems based on the von Neumann architecture, which are structurally integrated with external photodetectors, are fundamentally limited by the inherent separation of photo-sensing, computation, and memory units.7–10 This separation leads to significant energy consumption and latency resulting from frequent data transfers, which poses substantial challenges for applications that require real-time operation and low energy consumption, particularly when deployed in mobile devices, edge devices, and Internet of Things (IoT) environments. In-sensor computing provides an innovative approach to overcome these bottlenecks by enabling sensors to perform both signal acquisition and data pre-processing within a single unit. By integrating computation directly into the photodetector, in-sensor computing reduces redundant data transfers, increases computational speed, and enhances energy efficiency.11,12
Various forms of brain-inspired networks have been proposed as alternatives to the von Neumann architecture. Inspired by the synchronized oscillatory behaviour of biological neurons rather than discrete spikes in the brain, networks composed of coupled oscillators, known as coupled oscillator networks (CONs), offer smoother signal representation and simpler dynamics.13–17 They are well suited for low-energy and real-time sensory systems given their inherently high energy efficiency, parallelism, and compatibility with analogue hardware.18,19 Since CONs process information through phase relationships rather than complex arithmetic operations, unlike conventional networks that rely on deterministic functions, they are additionally more robust to noise. Therefore, if properly implemented, vision systems based on CONs can achieve simpler computation and lower energy consumption compared to conventional vision systems.
It is worth noting that signal processing based on wave-based oscillatory systems generally provides greater noise robustness, compared to traditional magnitude-centric linear approaches. This increased robustness is attributed to the inherent wave nature, where information is represented not only by amplitude but also by phase and frequency, enabling the system to selectively filter signals and utilize resonance effects to amplify relevant components while overcoming inevitable noise. Additionally, nonlinear interactions in coupled oscillators help them synchronize their phases collectively, making the system more resistant to disturbances. Wave-based systems can also encode information across both space and time, which helps protect against the effects of noise or damage in specific areas.
In this work, we propose coupled twin photo-responsive oscillators (CT-PROs) as a core element for a CON with in-sensor computing capabilities. The overall research concept is illustrated in Fig. 1a. The proposed PRO is equivalent to a metal–oxide-semiconductor field-effect transistor (MOSFET) and can thus be fabricated using mature complementary metal–oxide-semiconductor (CMOS) technologies developed over the past six decades. This MOSFET comprises a source (S), a drain (D), a gate (G), and a floating body (FB). Traditionally, the body has been conceptualized as a resistive element, exhibiting high resistance in the off-state (Roff) and low resistance in the on-state (Ron). This conventional interpretation forms the basis for the term ‘transistor’, a combination of ‘transfer’ and ‘resistor’. Within this framework, the transistor maintains a stable low current in the off-state and a high current in the on-state, independent of time. However, the present study offers a reinterpretation of the transistor's functional role by treating the FB not as a resistive but as a capacitive component. We experimentally demonstrate that a transistor incorporating an FB can function as an oscillator. The FB exhibits capacitive behaviour, undergoing continuous cycles of charging and discharging when a constant current is supplied to D. This process yields an oscillatory output, marking a functional transformation wherein the transistor operates not as a traditional electronic switch but as an active oscillator.
The MOSFET used in this work has a vertical pillar structure with the S/D positioned at the top and bottom of the pillar-shaped FB, respectively. The p-type single-crystalline silicon (Si) channel within the vertical pillar is physically isolated by a circular-shaped gate oxide and an n+ in situ doped polycrystalline-silicon (poly-Si) gate, forming a gate-all-around (GAA) structure surrounding the channel. The channel (p-type Si) shares the same body region with the n+ S and n+ D, forming a FB due to the absence of an external electrical contact to the body. Since this FB can directly respond to external light stimuli for various light intensities (IL) and wavelengths (λ), the PRO can function as a photodetector and a conventional transistor. Additionally, because the FB facilitates iterative oscillations through the single-transistor latch (STL) phenomenon, the PRO can also serve as the smallest unit of oscillatory processing.20–22 Consequently, the PRO is capable of performing both sensing and processing functions for in-sensor computing. A building block of the CT-PROs consists of two photo-responsive oscillators (PRO1 and PRO2) connected via a coupling capacitor (CC). An array of such CT-PROs collectively constitutes the coupled oscillator network (CON).
When the PRO is exposed to visible light, photogenerated electron–hole pairs (EHPs) accumulate in the FB, modulating the oscillation characteristics. After capacitively coupling the twin PROs, their synchronization behaviours were investigated by varying the IL and λ of the illuminating light (Fig. 1b). As an application, edge detection was demonstrated based on measured synchronization behaviours. Furthermore, to evaluate the effectiveness of edge extraction across diverse images and its potential to enhance image recognition, software-based simulations employing convolutional neural networks (CNNs) were conducted.
As a Janus-like device, the PRO functions in two distinct operating modes. The first mode, corresponding to conventional transistor operation, involves a voltage input (Vin) and current output (Iout); in this mode, a stationary drain current (ID) without time dependence flows from D when the gate voltage (VG) exceeds the threshold voltage (VT). The second mode, functioning as an oscillator, is characterized by a current input (Iin) and a voltage output (Vout); here, an oscillating drain voltage (VD) is generated at the D when VG is below VT.
When a positive VD is applied along with a negative VG, the PRO with an n-channel initially remains in a high-resistance state (HRS), with a small number of electrons overcoming the potential barrier between S and FB (Vbi,S_FB). When VD increases, EHPs are generated by impact ionization (II) near the junction between the FB and D. The generated holes accumulate in the FB, while the electrons move toward D. These accumulated holes reduce Vbi,S_FB, enabling more electrons to flow from S to D, thereby amplifying the II through a positive feedback loop. This positive feedback loop induces a STL phenomenon, characterized by an abrupt transition from the HRS to a low-resistance state (LRS). When VD reaches the latch-up voltage (VLU), ID increases suddenly by more than six orders of magnitude.
When a constant Iin, as a direct current (DC) input signal, is supplied to the D, the PRO initially in the HRS begins to integrate positive charges in a parasitic capacitor (Cpar), which is parallelly connected at the D. This is a charging process. Here, Cpar comprises the device capacitances and external capacitances. The former includes the capacitance of the large-sized probing pad, as well as the capacitances associated with G, S, D, and FB. The latter includes the capacitance of the measurement equipment and the cable capacitance used to connect the PRO to the measurement setup. This charging process causes Vout to increase until it reaches the top voltage (Vtop), equivalent to the aforementioned VLU. When Vout reaches Vtop, the HRS switches to the LRS via the STL. Consequently, the integrated positive charges are abruptly discharged from Cpar, resulting in a sudden drop in Vout to the bottom voltage (Vbottom). This is a discharging process. The voltage difference between Vtop and Vbottom is defined as the peak-to-peak voltage (Vpp), i.e., Vtop − Vbottom. As long as Iin is forced, repetitive charging and discharging processes generate wedge-shaped oscillatory waveforms between Vtop and Vbottom without requiring complex circuits.
Fig. S2 illustrates a detailed mechanism of the oscillation with energy band diagrams at each step. It is essential to investigate the oscillation characteristics of the PRO for various Iin, Cpar, and Dp values. Fig. S3 shows its measured oscillation frequency (f) and energy consumption per oscillation (E/osc) for different Iin Cpar, and Dp values. The f of the PRO can be simply modelled using the following equation:
Both increasing Iin and the reduction of Cpar increase f due to a higher speed of charge integration. As Cpar decreases, f tends to increase, while E/osc tends to decrease. This result suggests that computing speed and energy efficiency can be further improved when Cpar approaches the intrinsic capacitance of the PRO, by eliminating the extrinsic capacitance introduced by the measurement setup. As shown in Fig. S4, the measured switching speeds of the PRO from the HRS to the LRS and vice versa were 50 ns and 60 ns, respectively. Therefore, the maximum f of the PRO is expected to be 9.1 MHz, which is the reciprocal of the total switching delay. In addition, the oscillation characteristics also depend on Dp, which directly affects the effective channel width of the PRO. As Dp increases, the larger FB volume provides a higher effective capacity for hole storage, leading to an increased f and reduced E/osc.
As degradation of the oscillation characteristics can adversely affect system performance, it is worth evaluating the endurance of the PRO. Oscillating operations were continuously performed for 5 × 105 cycles. Fig. S5a–c show that there was no noticeable degradation in Vtop and f during long-term operation. Also, Fig. S5d compares the transfer characteristics (ID − VG) of the PRO before and after this endurance test, showing a negligible change in threshold voltage, subthreshold swing, and gate current. These results confirm that the performance of the PRO can be maintained under repetitive operation.
When the electrically activated PRO, which was biased by the applied Iin, is exposed to visible light, additional EHPs are generated by incident photons, as illustrated in Fig. 2a. Although the FB is partially shielded by the S electrode, visible light can still effectively influence the FB region because the interlayer dielectric (ILD) is transparent and the thickness of the S region is sufficiently small compared with the penetration depth of visible light in silicon, particularly at longer λ. Consequently, photo-carriers generated in the S region can diffuse and drift into the FB. The optically generated holes additionally reduce Vbi,S_FB, resulting in a decrease in VLU with increasing IL, as shown in Fig. 2b. In this proof-of-concept demonstration, white light was used as the illumination source with the fixed λ. As IL increases, more EHPs are optically generated, further lowering Vbi,S_FB. This, in turn, leads to reductions in both Vtop and Vpp, while simultaneously increasing f, as shown in Fig. 2c–e. Through this mechanism, the PRO functions dually as a photodetector and an oscillator.
The electrical characteristics of the PRO also depend on λ. Fig. 3a shows the energy band diagrams with different λ values of the illuminated light with the fixed IL. The PRO was exposed to light in the visible spectrum (red, green, and blue) using a white light source and a colour filter. As λ increases, the penetration depth of photons into silicon also increases.25,26 In the PRO, the key factor governing the λ dependence is the spatial distribution of photogenerated EHPs, particularly the distance between their peak generation point and the junction boundary between the n+ S and the p-type FB. Shorter λ is strongly absorbed near the n+ S, where carrier recombination is significant and their influence on the FB is limited. In contrast, longer λ, such as red light in this work, penetrates deeper into the p-type FB region, thereby more effectively modulating the hole population in the FB. Therefore, the values of Vtop, Vpp, and f in the oscillation, as well as VLU in the STL, are influenced by λ (Fig. 3b–e). In this way, the PRO simultaneously detects light and encodes light information into oscillations.
Fig. S6c presents the measured Vout,1 and Vout,2, while Fig. S6d shows the averaged XOR (AVXOR) value, which is adopted to quantify synchronization behaviours. The AVXOR measures the degree of similarity between waveforms, and this method has been reliably used to assess synchronization.27–29 To calculate AVXOR, Vout,1 and Vout,2 are converted into binary values (i.e., 0 or 1) using a thresholding process, after which the XOR operation is performed on them. The result of this operation is then averaged over time. When CC is less than 20 pF, Vout,1 and Vout,2 fluctuate independently. The coupling is too weak to enable effective charge exchange between the twin PROs, resulting in a lack of correlation. Consequently, AVXOR fluctuates randomly over time and converges to 0.5, indicating no synchronization.
In contrast, when CC exceeds 5.6 nF, the twin PROs synchronize with a phase difference (Δϕ) of 0°, corresponding to in-phase synchronization and an AVXOR value approaching 0. At intermediate CC values between these two extremes, out-of-phase synchronization with a Δϕ of 180° (π) is observed. In this range, AVXOR gradually decreases with increasing CC, indicating a transition from uncorrelated to correlated behaviour characterized by in-phase synchronization. It is important to note that under practical operating conditions, a dedicated readout circuit can be used for signal input and output, avoiding the need for external instruments such as a parameter analyser. This configuration helps eliminate the influence of external capacitance, whose magnitude is comparable to that of CC, and therefore leads to a significant reduction in the total capacitance.
To examine the robustness of spontaneous synchronization against noise in the CT-PROs, a controlled noisy input was intentionally applied to one of the PROs, as shown in Fig. S7a. A constant direct current (IDC) was supplied to PRO1, while a time-varying alternating current (IAC) superimposed on the same IDC was applied to PRO2. Fig. S7b presents Vout,1 and Vout,2 for various IAC amplitudes. Out-of-phase synchronization was consistently observed regardless of the magnitude of IAC. As shown in Fig. S7c and d, a comparison of the fast Fourier transform (FFT) spectra with and without IAC injection indicates that the f of the CT-PROs was not significantly altered. To better emulate real environments, semi-empirical simulations were conducted using the SPICE circuit simulator. As depicted in Fig. S8a, the PRO was modelled as a voltage-controlled threshold switch connected in parallel with a Cpar. The simulated output, shown in Fig. S8b, closely matched the experimental results. In the simulation setup (Fig. S8c and d), a constant IDC was supplied to PRO1 as a reference, while an IAC with randomly fluctuating noise superimposed on the same IDC was applied to PRO2. As presented in Fig. S8e and f, the waveform and FFT spectra exhibited a minimal change despite the presence of noise. This high level of noise immunity demonstrates the suitability of CT-PROs for integration into noise-resilient CONs.
Next, the synchronization behaviours of the CT-PROs under the different combinations of light illumination were investigated. Two light sources were independently controlled to apply various IL and λ values to the CT-PROs. In Fig. 4a–c, synchronization behaviours are investigated by adjusting only IL of the white light incident on each PRO, with IL levels categorized as low, middle, and high. In contrast, Fig. 4d–f explore synchronization behaviours while adjusting only λ of the light incident on each PRO, with λ classified as red, green, and blue.
Fig. 4a illustrates a circuit configuration where PRO1 and PRO2 are connected through a CC of 200 pF and subjected to illumination with different IL values. When low, middle, and high IL levels are applied to PRO1 and PRO2, respectively, the maximum number of possible IL combinations is nine, forming a 3-by-3 matrix. As shown in Fig. S9a, when CT-PROs receive light of the same IL (e.g., low, middle, or high), they oscillate with similar Vpp values in Vout,1 and Vout,2, readily exhibiting out-of-phase synchronization. For example, as shown in Fig. 4b, when CT-PROs are exposed to low IL, strong coupling occurs, resulting in effective synchronization. In contrast, when one PRO is illuminated with low IL and the other with high IL, the coupling weakens, leading to less coherent synchronization. This variation in synchronization behaviour is also reflected in AVXOR values, as shown in Fig. 4c. A higher AVXOR value indicates more coherent synchronization. The AVXOR increases as the IL levels of the CT-PROs become more similar, indicating that synchronization becomes more pronounced when the CT-PROs are illuminated with comparable IL.
Similarly, synchronization behaviour also depends on the λ of the illuminating light. Fig. 4d presents another circuit diagram in which PRO1 and PRO2 are coupled through the same CC of 200 pF and are exposed to light of various λ values. The twin PROs are independently illuminated with nine λ combinations, corresponding to a 3-by-3 matrix of red/green/blue light applied to PRO1 and PRO2, respectively. As shown in Fig. 4e, f, and S9b when the CT-PROs receive light with the same λ (e.g., red, green, or blue), strong coupling facilitates coherent out-of-phase synchronization with higher AVXOR values. However, when one PRO receives red light, while the other receives blue light, the coupling weakens, leading to deteriorated out-of-phase synchronization and lower AVXOR values.
Unlike the previous cases, where only IL or λ was varied, light information under diverse illumination conditions becomes more complex. Independently adjusting both IL and λ for each oscillator results in 81 possible combinations, corresponding to a 9-by-9 matrix comprising IL values of low/middle/high and λ values of red/green/blue, applied to PRO1 and PRO2, respectively. Fig. S10a presents the circuit diagram for 81 combinations, covering three values of IL and three values of λ for the CT-PROs. As shown in Fig. S10b, the results indicate that similar conditions of IL and λ for CT-PROs produce higher AVXOR values. In contrast, mismatched conditions generally result in lower AVXOR values.
The edge detection was performed under three scenarios: (1) when only IL was considered, (2) when only λ was considered, and (3) when both IL and λ were considered. Fig. S11 categorizes the three scenarios and provides procedural details. When only IL was considered, an image was converted into a grayscale image. When only λ was considered, the image was converted into an image composed solely of red/green/blue pixels. In the combined scenario, both of them were used for edge detection, identifying edges where either IL or λ of two neighbouring pixels differ.
Edge detection simulations were performed using MATLAB by scanning the image and observing the AVXOR values between adjacent pixel pairs, as shown in Fig. 5. The scanning process involved both vertical and horizontal movements. Horizontal scanning involved sequential movements from left to right, while vertical scanning moved sequentially from top to bottom. As shown in Fig. S12, for vertical edge detection, the CT-PROs were placed vertically—one above the other—and an AVXOR value from the top and bottom pixels was extracted using a monotone grayscale. Similarly, for lateral edge detection, the CT-PROs were placed side-by-side, and an AVXOR value from left and right pixels was obtained in the same manner. As shown in Fig. S13, the scenario that considered both IL and λ resulted in the highest number of detected edges, demonstrating that simultaneous scanning of both parameters enhances edge detection performance.
To investigate the benefits of integrating edge detection in classification tasks, we constructed two convolutional neural networks (CNNs): a conventional one-stream CNN using only original images and a two-stream CNN that integrates original images and edge maps. To assess the utility of edge information as a complementary feature in visual recognition, we used the Canadian Institute for Advanced Research (CIFAR)-10 dataset, which consists of 50
000 training and 10
000 test images (Table S1).32
Edge maps were generated in advance using the CON based on the CT-PROs, where a predefined threshold was applied to binarize the AVXOR output (Fig. S14). The resulting edge maps were compared with those obtained using the Canny and Sobel edge detection algorithms.33–35 Under the scenario considering both IL and λ, the CT-PROs exhibited edge detection performance comparable to these conventional methods, while significantly reducing hardware resources and energy consumption by eliminating multiple kernel matrices.
Fig. 6a depicts the one-stream and two-stream CNN architectures. The two-stream CNN comprises two parallel pathways for original images and edge maps, each including convolutional layers, ReLU activation layers, and max pooling layers. An element-wise attention mechanism is applied prior to fusion, where attention weights derived from the edge stream modulate the original image features.36 The fused features are then concatenated and passed through a dropout layer and a fully connected layer for final classification. For a fair comparison, the one-stream CNN shares the same convolutional structure and both models were trained under identical conditions. Detailed information of the network is provided in Table S2.
As shown in Fig. 6b, the two-stream CNN that incorporated edge maps achieved higher classification accuracy than the one-stream CNN. In addition, the corresponding confusion matrices in Fig. 6c and S15 showed enhanced classification performance. This classification performance can be further improved through techniques such as employing pretrained networks, optimizing data structures, and applying learning rate scheduling. To further validate generalizability, the two-stream CNN was evaluated on the International Skin Imaging Collaboration (ISIC) 2018 skin lesion dataset (Fig. S16), where it consistently outperformed the one-stream CNN.37
The results from these semi-empirical simulations demonstrate that edge maps function as valuable structural priors, allowing the network to effectively capture boundary and contour information that cannot be readily inferred from the original images alone. By explicitly integrating edge representations, the model is better equipped to learn both visual and structural features in a complementary manner. Furthermore, the inclusion of an attention mechanism amplifies the contribution of edge features. Collectively, the integration of edge maps significantly enhances classification performance, particularly in categories where structural information is critical.
| This journal is © The Royal Society of Chemistry 2026 |