[D] CIFAR-10 equivalents in video classification (action recognition)
So I’ve been working on some stuff with CNNs trained on CIFAR-10, and I’m interested in seeing if what I’ve been working on scales to video, i.e. with 3D CNNs.
Unfortunately, I haven’t been able to find any CIFAR-10 equivalents for videos. Most of the well-recognized datasets (UCF-101, Kinetics, Moments in Time, etc) are absolutely massive and correspond more with ImageNet.
Does anyone know of recognized and (relatively) small-scale video datasets that are good for sanity-checking and won’t require a massive overhead to work with? I have looked at UCF-11, but it is labeled by the authors as an “incredibly challenging” dataset, which is not what I’m looking for. Thanks for your help!