Simple Example Of Speech To Text

A walk-through example of how you can apply Speech To Text in Python

Speech recognition (or Speech To Text) is still far from perfect. However, the SpeechRecognition library provides an easy way to interact with many speech-to-text APIs. In this post, we will show how to use the Python SpeechRecognition library to easily start converting the spoken language in our audio files to text.

SpeechRecognition is a library for performing speech recognition, with support for several engines and APIs, online and offline.

Speech recognition engine/API support:

For our example we will use the recognize_google, however there are also some other choices like recognize_bing(), recognize_wit(). The audio .wav file that we are going to use for this example can be found here. Note that the recognize_google allows 50 free calls per day.

# Importing the speech_recognition library
import speech_recognition as sr# Create an instance of the Recognizer class
recognizer = sr.Recognizer()# Set the energy threshold
recognizer.energy_threshold = 300# Convert audio to AudioFile
clean_support_call = sr.AudioFile("staytuned.wav")# Convert AudioFile to AudioData
with clean_support_call as source:
clean_support_call_audio = recognizer.record(source)# Transcribe AudioData to text
text = recognizer.recognize_google(clean_support_call_audio,
language="en-US")
print(text)

And the output we get is:

hello everybody today we are going to talk about speech-to-text stay tuned

1. 8 Proven Ways to Use Chatbots for Marketing (with Real Examples)

2. How to Use Texthero to Prepare a Text-based Dataset for Your NLP Project

3. 5 Top Tips For Human-Centred Chatbot Design

4. Chatbot Conference Online

Sometimes, we have to deal with noisy audio files. We can use the adjust_for_ambient_noise() function of Recognizer to negate the background noise. We will use this audio text for our example.

# Importing the speech_recognition library
import speech_recognition as srrecognizer = sr.Recognizer()# Convert audio to AudioFile
noisy_support_call = sr.AudioFile("2-noisy-support-call.wav")# Record the audio from the noisy support call
with noisy_support_call as source:
# Adjust the recognizer energy threshold for ambient noise
recognizer.adjust_for_ambient_noise(source, duration=0.5)
noisy_support_call_audio = recognizer.record(noisy_support_call)# Transcribe the speech from the noisy support call
text = recognizer.recognize_google(noisy_support_call_audio,
language="en-US")print(text)

And the output that we get is:

hello I'd like to get to help setting up my account please

That was a simple reproducible example of how you can easily convert Text-To-Speech. In the following posts, we will give more examples. Feel free to send us your preferences about the new posts.

Source: https://chatbotslife.com/simple-example-of-speech-to-text-a86fff4ab626?source=rss—-a49517e4c30b—4

Generative Data Intelligence

A walk-through example of how you can apply Speech To Text in Python

ESL Challenger Melbourne 2024: Scores, standings, and more – Snowball Esports

2024 NFL Draft Grades

Latest Intelligence

Ethereum, Solana and Altcoins Approaching ‘Banana Zone,’ According to Macro Guru Raoul Pal – Here’s His Outlook – The Daily Hodl

NASA astronauts arrive at Kennedy Space Center ahead of Boeing Starliner Crew Flight Test

UK Law Enforcement Agencies Can Now Seize Crypto More Easily as New Rules Take Effect

Bitcoin Bears Risk Losing $7.2 Billion If BTC Price Reaches This Level

Meme coin Dog Go To The Moon surpasses $500 million market cap

Why Many ‘Zombie Blockchains’ Still Have Market Caps in the Billions of Dollars – Unchained

Chat with us