1
|
Pei J, Song L, Liu P, Liu S, Liang Z, Wen Y, Liu Y, Wang S, Chen X, Ma T, Gao S, Hu G. Scalable Synaptic Transistor Memory from Solution-Processed Carbon Nanotubes for High-Speed Neuromorphic Data Processing. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024:e2312783. [PMID: 39468862 DOI: 10.1002/adma.202312783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 05/13/2024] [Indexed: 10/30/2024]
Abstract
Neural networks as a core information processing technology in machine learning and artificial intelligence demand substantial computational resources to deal with the extensive multiply-accumulate operations. Neuromorphic computing is an emergent solution to address this problem, allowing the computation performed in memory arrays in parallel with high efficiencies conforming to the neural networks. Here, scalable synaptic transistor memories are developed from solution-sorted carbon nanotubes. The transistors exhibit a large switching ratio of over 105, a significant memory window of ≈12 V arising from charge trapping, and low response delays down to tens of nanoseconds. These device characteristics endow highly stabilized reconfigurable conductance states, successful emulation of synaptic functions, and a high data processing speed. Importantly, the devices exhibit uniform characteristic metrics, e.g., with a 1.8% variation in the memory window, suggesting an industrial-scale manufacturing capability of the fabrication. Using the memories, a hardware convolution kernel is designed and parallel image processing is demonstrated at a speed of 1 M bit per second per input channel. Given the efficacy of the convolution kernel, a promising prospect of the memories in implementing neuromorphic computing is envisaged. To explore the potential, large-scale convolution kernels are simulated and high-speed video processing is realized for autonomous driving.
Collapse
Affiliation(s)
- Jingfang Pei
- Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong SAR, 999077, China
| | - Lekai Song
- Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong SAR, 999077, China
| | - Pengyu Liu
- Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong SAR, 999077, China
| | - Songwei Liu
- Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong SAR, 999077, China
| | - Zihan Liang
- Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
| | - Yingyi Wen
- Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong SAR, 999077, China
| | - Yang Liu
- Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong SAR, 999077, China
- Shun Hing Institute of Advanced Engineering, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong SAR, 999077, China
| | - Shengbo Wang
- School of Instrumentation and Optoelectronic Engineering, Beihang University, Beijing, 100191, China
| | - Xiaolong Chen
- Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
| | - Teng Ma
- Department of Applied Physics, Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong SAR, 999077, China
| | - Shuo Gao
- School of Instrumentation and Optoelectronic Engineering, Beihang University, Beijing, 100191, China
| | - Guohua Hu
- Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong SAR, 999077, China
| |
Collapse
|
2
|
Fritscher M, Singh S, Rizzi T, Baroni A, Reiser D, Mallah M, Hartmann D, Bende A, Kempen T, Uhlmann M, Kahmen G, Fey D, Rana V, Menzel S, Reichenbach M, Krstic M, Merchant F, Wenger C. A flexible and fast digital twin for RRAM systems applied for training resilient neural networks. Sci Rep 2024; 14:23695. [PMID: 39390001 PMCID: PMC11467404 DOI: 10.1038/s41598-024-73439-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 09/16/2024] [Indexed: 10/12/2024] Open
Abstract
Resistive Random Access Memory (RRAM) has gained considerable momentum due to its non-volatility and energy efficiency. Material and device scientists have been proposing novel material stacks that can mimic the "ideal memristor" which can deliver performance, energy efficiency, reliability and accuracy. However, designing RRAM-based systems is challenging. Engineering a new material stack, designing a device, and experimenting takes significant time for material and device researchers. Furthermore, the acceptability of the device is ultimately decided at the system level. We see a gap here where there is a need for facilitating material and device researchers with a "push button" modeling framework that allows to evaluate the efficacy of the device at system level during early device design stages. Speed, accuracy, and adaptability are the fundamental requirements of this modelling framework. In this paper, we propose a digital twin (DT)-like modeling framework that automatically creates RRAM device models from device measurement data. Furthermore, the model incorporates the peripheral circuit to ensure accurate energy and performance evaluations. We demonstrate the DT generation and DT usage for multiple RRAM technologies and applications and illustrate the achieved performance of our GPU implementation. We conclude with the application of our modeling approach to measurement data from two distinct fabricated devices, validating its effectiveness in a neural network processing an Electrocardiogram (ECG) dataset and incorporating Fault Aware Training (FAT).
Collapse
Affiliation(s)
- Markus Fritscher
- IHP - Leibniz Institut für innovative Mikroelektronik, Frankfurt (Oder), Germany.
- BTU Cottbus-Senftenberg, Cottbus, Germany.
| | | | - Tommaso Rizzi
- IHP - Leibniz Institut für innovative Mikroelektronik, Frankfurt (Oder), Germany
| | - Andrea Baroni
- IHP - Leibniz Institut für innovative Mikroelektronik, Frankfurt (Oder), Germany
| | | | | | | | - Ankit Bende
- Forschungszentrum Jülich GmbH, Jülich, Germany
| | - Tim Kempen
- Forschungszentrum Jülich GmbH, Jülich, Germany
| | - Max Uhlmann
- IHP - Leibniz Institut für innovative Mikroelektronik, Frankfurt (Oder), Germany
| | - Gerhard Kahmen
- IHP - Leibniz Institut für innovative Mikroelektronik, Frankfurt (Oder), Germany
- BTU Cottbus-Senftenberg, Cottbus, Germany
| | | | - Vikas Rana
- Indian Institute of Technology Bombay, Mumbai, India
| | | | | | - Milos Krstic
- IHP - Leibniz Institut für innovative Mikroelektronik, Frankfurt (Oder), Germany
- University of Potsdam, Potsdam, Germany
| | - Farhad Merchant
- Newcastle University, Newcastle upon Tyne, UK
- Cognigron and Bernoulli Institute, University of Groningen, Groningen, The Netherlands
| | - Christian Wenger
- IHP - Leibniz Institut für innovative Mikroelektronik, Frankfurt (Oder), Germany
- BTU Cottbus-Senftenberg, Cottbus, Germany
| |
Collapse
|
3
|
Li C, Lammie C, Dong X, Amirsoleimani A, Azghadi MR, Genov R. Seizure Detection and Prediction by Parallel Memristive Convolutional Neural Networks. IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS 2022; 16:609-625. [PMID: 35737626 DOI: 10.1109/tbcas.2022.3185584] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
During the past two decades, epileptic seizure detection and prediction algorithms have evolved rapidly. However, despite significant performance improvements, their hardware implementation using conventional technologies, such as Complementary Metal-Oxide-Semiconductor (CMOS), in power and area-constrained settings remains a challenging task; especially when many recording channels are used. In this paper, we propose a novel low-latency parallel Convolutional Neural Network (CNN) architecture that has between 2-2,800x fewer network parameters compared to State-Of-The-Art (SOTA) CNN architectures and achieves 5-fold cross validation accuracy of 99.84% for epileptic seizure detection, and 99.01% and 97.54% for epileptic seizure prediction, when evaluated using the University of Bonn Electroencephalogram (EEG), CHB-MIT and SWEC-ETHZ seizure datasets, respectively. We subsequently implement our network onto analog crossbar arrays comprising Resistive Random-Access Memory (RRAM) devices, and provide a comprehensive benchmark by simulating, laying out, and determining hardware requirements of the CNN component of our system. We parallelize the execution of convolution layer kernels on separate analog crossbars to enable 2 orders of magnitude reduction in latency compared to SOTA hybrid Memristive-CMOS Deep Learning (DL) accelerators. Furthermore, we investigate the effects of non-idealities on our system and investigate Quantization Aware Training (QAT) to mitigate the performance degradation due to low Analog-to-Digital Converter (ADC)/Digital-to-Analog Converter (DAC) resolution. Finally, we propose a stuck weight offsetting methodology to mitigate performance degradation due to stuck [Formula: see text] memristor weights, recovering up to 32% accuracy, without requiring retraining. The CNN component of our platform is estimated to consume approximately 2.791 W of power while occupying an area of 31.255 mm2 in a 22 nm FDSOI CMOS process.
Collapse
|