Speech Recognition

Table of contents :

  • Introduction
  • Code

Introduction:

Speech recognition (SR) is the translation of spoken words into text. It is also known as “automatic speech recognition”, “ASR”, “computer speech recognition”, “speech to text”, or just “STT”. Some SR systems use “training” where an individual speaker reads sections of text into the SR system. These systems analyze the person’s specific voice and use it to fine tune the recognition of that person’s speech, resulting in more accurate transcription. Systems that do not use training are called “Speaker Independent” systems. Systems that use training are called “Speaker Dependent” systems.The term voice recognition refers to finding the identity of “who” is speaking, rather than what they are saying.

Code:

Untitleds

using System;
using System.Collections.Generic;
using System.Linq;
using System.Text;
using System.Windows;
using System.Windows.Controls;
using System.Windows.Data;
using System.Windows.Documents;
using System.Windows.Input;
using System.Windows.Media;
using System.Windows.Media.Imaging;
using System.Windows.Navigation;
using System.Windows.Shapes;
using System.Speech.Recognition;

namespace VoiceRecognition_Application
{

public partial class MainWindow : Window
{
private SpeechRecognitionEngine _recognizer = new SpeechRecognitionEngine();
public MainWindow()
{
InitializeComponent();
}

private void buttonRec_Click(object sender, RoutedEventArgs e)

{
_recognizer.SetInputToDefaultAudioDevice();

_recognizer.LoadGrammar(new DictationGrammar());
_recognizer.SpeechRecognized += new EventHandler<SpeechRecognizedEventArgs>(_recognizer_SpeechRecognized);
_recognizer.RecognizeAsync(RecognizeMode.Multiple);

}

void _recognizer_SpeechRecognized(object sender, SpeechRecognizedEventArgs e)
{
foreach (RecognizedWordUnit word in e.Result.Words)
{
listBoxResults.Items.Add(word.Text);
}

}
}
}

Umer Butt(024-bscs-2010)

Comments are closed.