Training fully connected networks with resistive memories: impact of device failures

Louis P. Romero; Stefano Ambrogio; Massimo Giordano; Giorgio Cristiano; Martina Bodini; Pritish Narayanan; Hsinyu Tsai; Robert M. Shelby; Geoffrey W. Burr

doi:10.1039/C8FD00107C

Training fully connected networks with resistive memories: impact of device failures

Louis P. Romero,^a Stefano Ambrogio,^a Massimo Giordano,^ab Giorgio Cristiano,^ab Martina Bodini,^ab Pritish Narayanan,^a Hsinyu Tsai,^a Robert M. Shelby^a and Geoffrey W. Burr

*^a

Author affiliations

* Corresponding authors

^a IBM Research AI, IBM Research – Almaden, 650 Harry Road, San Jose, CA, USA 95120
E-mail: gwburr@us.ibm.com
Fax: +1 408 927-2100
Tel: +1 408 927-1512

^b EPFL, Route Cantonale, 1015 Lausanne, Switzerland

Abstract

Hardware accelerators based on two-terminal non-volatile memories (NVMs) can potentially provide competitive speed and accuracy for the training of fully connected deep neural networks (FC-DNNs), with respect to GPUs and other digital accelerators. We recently proposed [S. Ambrogio et al., Nature, 2018] novel neuromorphic crossbar arrays, consisting of a pair of phase-change memory (PCM) devices combined with a pair of 3-Transistor 1-Capacitor (3T1C) circuit elements, so that each weight was implemented using multiple conductances of varying significance, and then showed that this weight element can train FC-DNNs to software-equivalent accuracies. Unfortunately, however, real arrays of emerging NVMs such as PCM typically include some failed devices (e.g., <100% yield), either due to fabrication issues or early endurance failures, which can degrade DNN training accuracy. This paper explores the impact of device failures, NVM conductances that may contribute read current but which cannot be programmed, on DNN training and test accuracy. Results show that “stuck-on” and “dead” devices, exhibiting high and low read conductances, respectively, do in fact degrade accuracy performance to some degree. We find that the presence of the CMOS-based and thus highly-reliable 3T1C devices greatly increase system robustness. After studying the inherent mechanisms, we study the dependence of DNN accuracy on the number of functional weights, the number of neurons in the hidden layer, and the number and type of damaged devices. Finally, we describe conditions under which making the network larger or adjusting the network hyperparameters can still improve the network accuracy, even in the presence of failed devices.

This article is part of the themed collection: New memory paradigms: memristive phenomena and neuromorphic applications

Associated articles

Article information

DOI: https://doi.org/10.1039/C8FD00107C
Article type: Paper
Submitted: 29 May 2018
Accepted: 19 Jul 2018
First published: 20 Jul 2018

Download Citation

Faraday Discuss., 2019,213, 371-391

Permissions

Request permissions

Training fully connected networks with resistive memories: impact of device failures

L. P. Romero, S. Ambrogio, M. Giordano, G. Cristiano, M. Bodini, P. Narayanan, H. Tsai, Robert M. Shelby and G. W. Burr, Faraday Discuss., 2019, 213, 371 DOI: 10.1039/C8FD00107C

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Faraday Discussions

Training fully connected networks with resistive memories: impact of device failures

Abstract

Associated articles

Article information

Download Citation

Permissions

Training fully connected networks with resistive memories: impact of device failures

Social activity

Search articles by author

Spotlight

Advertisements