Memory Optimization Techniques in Neural Networks: A Review
Pratheeksha P1, Pranav B M2, Azra Nasreen3
1Pratheeksha P, Department of Computer Science, R. V College of Engineering, Bengaluru (Karnataka), India.
2Pranav B M*, Department of Computer Science, R. V College of Engineering, Bengaluru (Karnataka), India.
3Dr. Azra Nasreen, Assistant Professor, Department of Computer Science, R. V College of Engineering, Bengaluru (Karnataka), India.
Manuscript received on July 19, 2021. Revised Manuscript received on July 25, 2021. Manuscript published on August 30, 2021. | PP: 44-48 | Volume-10 Issue-6, August 2021 | Retrieval Number: 100.1/ijeat.F29910810621 | DOI: 10.35940/ijeat.F2991.0810621
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Deep neural networks have been continuously evolving towards larger and more complex models to solve challenging problems in the field of AI. The primary bottleneck that restricts new network architectures is memory consumption. Running or training DNNs heavily relies on the hardware (CPUs, GPUs, or FPGA) which are either inadequate in terms of memory or hard-to-extend. This would further make it difficult to scale. In this paper, we review some of the latest memory footprint reduction techniques which would enable faster low model complexity. Additionally, it improves accuracy by increasing the batch size and developing wider and deeper neural networks with the same set of hardware resources. The paper emphasizes on memory optimization methods specific to CNN and RNN training.
Keywords: Memory footprint reduction, Backpropagation through time (BPTT), CNN, RNN.