Solving computer vision tasks with diffractive neural networks

Tao Yan, Jiamin Wu, Tiankuang Zhou, Hao Xie, Feng Xu, Jingtao Fan, Lu Fang, Xing Lin, Qionghai Dai

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

3 Citations (Scopus)

Abstract

Modern computer vision tasks are achieved by first capturing and storing large-scale images and then performing the processing electronically, the paradigm of which has the fundamentally limited speed and power efficiency with the continuous increase of the data throughput and computational complexity. We propose to build the all-optical artificial intelligent for light-speed computing, which performs advanced computer vision tasks during the imaging so that the detector can directly measure the computed results. The proposed method uses light diffraction property to build the optical neural network, where the neuron function is achieved by tuning the optical diffraction with a nonlinear threshold. Since every target scene has different frequency components, the proposed diffractive neural network is trained to perform various filtering on different frequency components and achieves different transform functions for the target scenes. We demonstrate the proposed approach can be used for high-speed detecting and segmenting visual saliency objects of the microscopic samples and macroscopic scenes as well as performing the task of object classification. The low power consumption, light-speed processing, and high-throughput capability of the proposed approach can serve as significant support for high-performance computing and will find applications in self-driving automobile, video monitoring, and intelligent microscopy, etc.

Original languageEnglish
Title of host publicationOptoelectronic Imaging and Multimedia Technology VI
EditorsQionghai Dai, Tsutomu Shimura, Zhenrong Zheng
PublisherSPIE
ISBN (Electronic)9781510630918
DOIs
Publication statusPublished - 2019
Externally publishedYes
EventOptoelectronic Imaging and Multimedia Technology VI 2019 - Hangzhou, China
Duration: 21 Oct 201923 Oct 2019

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume11187
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

ConferenceOptoelectronic Imaging and Multimedia Technology VI 2019
Country/TerritoryChina
CityHangzhou
Period21/10/1923/10/19

Bibliographical note

Publisher Copyright:
© 2019 SPIE.

Keywords

  • All-optical artificial intelligent
  • Computer vision
  • Diffractive neural network
  • Object classification
  • Saliency detection

Fingerprint

Dive into the research topics of 'Solving computer vision tasks with diffractive neural networks'. Together they form a unique fingerprint.

Cite this