File Download

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

최민호

Choi, Minho
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Transferable polychromatic optical encoder for neural networks

Author(s)
Choi, MinhoXiang, JinlinWirth-Singh, AnnaBaek, Seung-HwanShlizerman, EliMajumdar, Arka
Issued Date
2025-07
DOI
10.1038/s41467-025-61338-4
URI
https://scholarworks.unist.ac.kr/handle/201301/88788
Citation
NATURE COMMUNICATIONS, v.16, no.1, pp.5623
Abstract
Artificial neural networks have fundamentally transformed the field of computer vision, providing unprecedented performance. However, these neural networks for image processing demand substantial computational resources, often hindering real-time operation. In this work, we demonstrate an optical encoder that can perform convolution simultaneously in three color channels during the image capture, effectively implementing several initial convolutional layers of the network. Such an optical encoding results in similar to 24, 000 x reduction in computational operations, with a state-of-the-art classification accuracy (similar to 73.2%) in free-space optical system. In addition, our analog optical encoder, trained for CIFAR-10 data, can be transferred to the ImageNet subset, High-10, without any modifications, and still exhibits moderate accuracy. The proposed method can decrease total system-level energy more than two orders of magnitude per a single object classification. Our results evidence the potential of hybrid optical/digital computer vision system in which the optical frontend can pre-process an ambient scene to reduce the energy and latency of the whole computer vision system
Publisher
NATURE PORTFOLIO
ISSN
2041-1723

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.