[D] BERT Large Fine-tune Benchmarks with NVIDIA Quadro RTX 6000 & RTX 8000 GPUs
Hey ML community,
We recently ran a series of benchmark tests showing the capabilities of NVIDIA Quadro RTX 6000 and RTX 8000 GPUs on BERT Large with different batch sizes, sequence lengths, and FP32 and FP16 precision. These were ran using the NVIDIA benchmark script found on their github, and show 1, 2, and 4 GPU configs in a workstation.
What types of tests/benchmarks would you like to see ran on these GPUs? What are your thoughts?