convert real time audio to phonemes