I once came across a paper on talking about something similar (https://dl.acm.org/citation.cfm?id=3001164). I can't access it right now and don't remember all of it, but it was about using analog hardware to run a convolutional network for computer vision before converting the camera's output to digital.