[D] Whats the best explanation of accuracy_per_sequence when evaluating transformer model in Tensorflow?
I’m looking for a good detailed explanation of the evaluation metric: Accuracy_per_sequence as displayed when training the transformer network https://github.com/tensorflow/models/tree/master/official/transformer
Here is the tensorboard output:
I’m assuming it refers to accuracy of sequences per step?
I’m familiar with Top-5 accuracy and such but haven’t found a good explanation of Accuracy per sequence.
Appreciate the help, Thanks!