Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[D] Tensorflow GPU C API performance in C++

I recently wrote a wrapper for the Tensorflow GPU C API to run in a C++ project I’m working on. Since the library is in C, it can’t throw, and the only STL function I call is std::vector’s “push back”. Based on Herb Sutter’s recent talk, I thought, “hey, I might as well make this function noexcept”. Much to my surprise, the function (which took 40ms to run my CNN before) sped up to running in 19ms. Can anyone help me speculate why it’s that big of a performance difference? (Using Visual Studio 19, C++17, default optimization options)

submitted by /u/WalkingAFI
[link] [comments]