[Discussion] How to ‘convert’ pyaudio’s results into useful audio data ?
I’m sorry if the question is too vague (or too stupid). I’m trying to build a very basic speech recognition (for my own learning). I’m using pyaudio to record from the microphone and I’ve managed to convert the bytes to 16bit representations. But I do not know how to move forward with my little project.
I did find bits and pieces of code that seems to do what I’m trying to achieve but doesn’t explain clearly why it does what it does.
For context I’ve previously worked on Image recognition, object detection and similar computer vision projects, but I’m a newb when it comes to audio.
Any help is appreciated.