ReRAM-based IMC techniques are promising for ML inference. Many research papers propose ReRAM IMC techniques for accelerating NN operations. However, they do not discuss the system integration of the ReRAM platform.
In contrast, consider hardware acceleration using conventional platforms, e.g., FPGA. Applications can run on an ordinary CPU and use the FPGA to speed up specific kernels. Data exchange happens using PCIe, ethernet, or serial interfaces.
If ReRAM-based IMC has to be used in practice applications, how would software applications interact with the ReRAM-based IMC accelerator? For example, a software application might have an image and want to know if it is a cat or a dog, using the ReRAM-based NN processor.