I’ve been messing around with long form machine generated audio transcriptions. Amazon Transcribe seems to be able to detect multiple speakers but Google’s is more accurate. #AI is so weird.

πŸ”„ 0 πŸ‘ 0 Original URL