Blog

Learn About Our Meetup

5000+ Members

GO >

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community.

JOIN

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

JOBS

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

CONTACT

[R] Low accuracy on small English dataset using XLNet

Written by torontoai on July 2, 2019. Posted in Reddit MachineLearning.

I compared accuracies on small dataset, got 2 classes, which are negative and positive, each class has 5k of sentences. You can check it here english-polarity. All notebooks finetune on single Tesla V100, 100 max length of sentences. 80% to train, 20% to test. All notebooks do early stopping with 3 patients, and I will pick highest test accuracy as the benchmark.

BERT English Base, test accuracy 81%.
GPT2 345M, test accuracy 79%.
XLnet Large, test accuracy 51%.

I am not sure what is wrong here maybe not follow an exact batch size and max length?, I doubled check all the code to transform into notebooks based, everything looks same, just a bit tweak to make it notebook-able. Great model and repository, https://github.com/zihangdai/xlnet!

submitted by /u/huseinzol05
[link] [comments]