[P] Pretrained pytorch resnet models for anime tag estimation
Hi
I trained some Resnet models to estimate tags for anime images.
There has been a lot of cool anime-related projects recently, such as DeepDanbooru and some other cool work with anime face generation, however most use tensorflow and so I wanted a nice pretrained pytorch model to use for transfer learning with downstream tasks.
A TL;DR of the networks:
- Framework: Pytorch
- Network: adapted resnet50, resnet34, and resnet18 models
- Training time : about 12 days (using mixed-precision training and Nvidia V100)
- Dataset: Danbooru2018 dataset, with networks trained to predict top tags of each image
An example output of the resnet50 model
If you have pytorch, you can load up the model and predict on new images in only a few lines using pytorch hub. See this post for a quick getting started
I hope some people find this useful, and that it saves some people training time with other tasks.
For more info on the networks, training, data preparation, and performance metrics of each network, please have a look at the project page, or just ask :). Any comments or feedback is nice.
submitted by /u/m_baas
[link] [comments]