[D] Which SOTA authorship attribution / text classification model to use?
I’m currently doing research for my thesis project, and was wondering which models to experiment with. I have a large dataset of political speeches (around 180.000) annotated with the respective party (10 parties total), and would like a model to learn to classify each party given the speeches.
My question is, which model is currently best for this type of task? I have some experience with Bi-LSTM models, and also CNN with LSTM – however I’m very interested if other models would perform better at this task, or if you any experience with the architecture of these type of models?