Yesterday I posted about using Azure Cognitive Services to convert text to speech. I also eluded that I’ve been leveraging Cognitive Services to do the conversion from Speech to Text. I detail that in this post.
Just as with the Text to Speech we will need an API key to use Cognitive Services. You can get one from Azure Cognitive Services here.
I created an audio file in Audacity for testing purposes. In my real application it is direct spoken text, but that’s a topic for another time. I set the project rate to 16000hz for the conversion source file then exported the file as a .wav file.
The Script below needs to be updated for your input file (line 2) and your API Key (line 6). Run it line by line in VSCode or PowerShell ISE.
That’s it. Pretty simple once you have a reference script to work with. Enjoy.
Recently under the experimental Azure Functions build Microsoft Developer Advocates have shown enabling Azure Functions…
Today, I’m super excited to finally announce the Beta release of EntraPulse Lite – a…
I'm excited to share some significant authentication enhancements I've contributed to the Lokka MCP Server…
Last month I had the pleasure of speaking at the Sydney event for Global Azure.…
Model Context Protocol (MCP) is a powerful framework that extends AI clients like Claude and…
I've just completed participating in the Azure AI Developer Hackathon that was looking to provide…
This website uses cookies.
View Comments