Blog

Learn About Our Meetup

5000+ Members

GO >

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community.

JOIN

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

JOBS

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

CONTACT

[D] Tensorflow GPU C API performance in C++

Written by torontoai on September 24, 2019. Posted in Reddit MachineLearning.

I recently wrote a wrapper for the Tensorflow GPU C API to run in a C++ project I’m working on. Since the library is in C, it can’t throw, and the only STL function I call is std::vector’s “push back”. Based on Herb Sutter’s recent talk, I thought, “hey, I might as well make this function noexcept”. Much to my surprise, the function (which took 40ms to run my CNN before) sped up to running in 19ms. Can anyone help me speculate why it’s that big of a performance difference? (Using Visual Studio 19, C++17, default optimization options)

submitted by /u/WalkingAFI
[link] [comments]