Artificial Neural Network Training on an Optical Processor via Direct Feedback Alignment

Kilian Müller; Julien Launay; Iacopo Poli; Matthew Filipovich; Alessandro Capelli; Daniel Hesslow; Igor Carron; Laurent Daudet; Florent Krzakala; Florent Krzakala; Sylvain Gigan; Sylvain Gigan

Conference on Lasers and Electro-Optics/Europe (CLEO/Europe 2023) and European Quantum Electronics Conference (EQEC 2023)
Technical Digest Series (Optica Publishing Group, 2023),
paper jsiii_3_3

Artificial Neural Network Training on an Optical Processor via Direct Feedback Alignment

Kilian Müller, Julien Launay, Iacopo Poli, Matthew Filipovich, Alessandro Capelli, Daniel Hesslow, Igor Carron, Laurent Daudet, Florent Krzakala, and Sylvain Gigan

The European Conference on Lasers and Electro-Optics 2023

Munich Germany
26–30 June 2023
ISBN: 979-8-3503-4599-5

From the session
Photonic accelerators I (jsiii_3)

Not Accessible

Your library or personal account may give you access

Get PDF
Email
Share
Get Citation
Copy Citation Text
K. Müller, J. Launay, I. Poli, M. Filipovich, A. Capelli, D. Hesslow, I. Carron, L. Daudet, F. Krzakala, and S. Gigan, "Artificial Neural Network Training on an Optical Processor via Direct Feedback Alignment," in Conference on Lasers and Electro-Optics/Europe (CLEO/Europe 2023) and European Quantum Electronics Conference (EQEC 2023), Technical Digest Series (Optica Publishing Group, 2023), paper jsiii_3_3.

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Save article

Abstract

Artificial Neural Networks (ANN) are habitually trained via the back-propagation (BP) algorithm. This approach has been extremely successful: Current models like GPT-3 have O(10¹¹) parameters, are trained on O(10¹¹) words and produce awe-inspiring results. However, there are good reasons to look for alternative training methods: With current algorithms and hardware constraints sometimes only half the available computing power is actually used. This is due to a complicated interplay between the size of the ANN, the available memory, throughput limitations of interconnects, the architecture of the network of computers, and the training algorithm. Training a model like the aforementioned GPT-3 takes months and costs millions. A different training paradigm, which could make clever use of specialized hardware, may train large ANNs more efficiently.

PDF Article

More Like This

Optical Training Framework for Optical Diffractive Deep Neural Network via Direct Feedback Alignment

Tao Fang, Jingwei Li, Biao Zhang, Tongyu Wu, and Xiaowen Dong
JW7A.27 Frontiers in Optics (FiO) 2021

All-Photonic Artificial Neural Network Processor Via Nonlinear Optics

Jasvith Raj Basani, Stefan Krastanov, Mikkel Heuck, and Dirk R. Englund
SF4F.5 CLEO: Science and Innovations (CLEO:S&I) 2022

A Photonic Deep Neural Network Processor on a Single Chip with Optically Accelerated Training

Saumil Bandyopadhyay, Alexander Sludds, Stefan Krastanov, Ryan Hamerly, Nicholas Harris, Darius Bunandar, Matthew Streshinsky, Michael Hochberg, and Dirk Englund
SM2P.2 CLEO: Science and Innovations (CLEO:S&I) 2023

Previous Article Next Article