[Discussion] Millions of different NN models at the same time?
Hi folks,
does anyone know of applications or use cases where one has to deal with hundreds, thousands or even millions of queries of different neural network models? Maybe cloud service providers with user-customised neural networks, or maybe some AR applications… And if there is, how do people handle this situation nowadays, wouldn’t the storage requirements blow up like crazy? And what about latency issues (off-chip loading)?
submitted by /u/swiedema
[link] [comments]