[P] Pretrained pytorch resnet models for anime tag estimation

Written by torontoai on July 7, 2019. Posted in Reddit MachineLearning.

I trained some Resnet models to estimate tags for anime images.

There has been a lot of cool anime-related projects recently, such as DeepDanbooru and some other cool work with anime face generation, however most use tensorflow and so I wanted a nice pretrained pytorch model to use for transfer learning with downstream tasks.

A TL;DR of the networks:

Framework: Pytorch
Network: adapted resnet50, resnet34, and resnet18 models
Training time : about 12 days (using mixed-precision training and Nvidia V100)
Dataset: Danbooru2018 dataset, with networks trained to predict top tags of each image

An example output of the resnet50 model

If you have pytorch, you can load up the model and predict on new images in only a few lines using pytorch hub. See this post for a quick getting started

I hope some people find this useful, and that it saves some people training time with other tasks.

For more info on the networks, training, data preparation, and performance metrics of each network, please have a look at the project page, or just ask :). Any comments or feedback is nice.

submitted by /u/m_baas
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

[P] Pretrained pytorch resnet models for anime tag estimation