[D] Speech-to-text adversarial examples to slow YouTube censorship
A Google whistleblower explained that much of the demonetization/censorship action occurring on YouTube is done through Google’s speech-to-text. If so, it seems that altering a video’s audio to become an adversarial example, prior to it being uploaded, could serve to slow what’s happening.
Is it possible to reliably generate adversarial examples for an ai which you do not have direct access to (Google’s Cloud Speech-To-Text is behind a pay wall)? I’ve heard Lex Fridman mention that adversarial examples are often effective against multiple networks, even when their structures differ.