[P] Tool to build GPT-2 textgen APIs scalable and free using Google Cloud Run
https://github.com/minimaxir/gpt-2-cloud-run
There have been a few posts here w/ interactive GPT-2 textgen models. I’ve built an open-source tool to help build APIs with GPT-2 (specifically, fine-tuned models on a new dataset) via gpt-2-simple and deploy them to Cloud Run, where the pricing works out to be effectively free unless you have huge spikes or constant requests.
I have also included a mini-Cloud Build tutorial to limit model downloading/uploading.
submitted by /u/minimaxir
[link] [comments]