Runtime Speech Recognizer

Cross-platform offline speech recognition plugin for Unreal Engine. Convert speech to text with advanced Whisper AI technology across all platforms without internet connection.

UE 4.27 - 5.5
Blueprints & C++
All platforms supported
95+ languages supported

Advanced Speech Recognition Made Simple

Runtime Speech Recognizer provides a comprehensive system for real-time speech recognition, featuring offline processing powered by Whisper OpenAI technology. From recognizing commands to transcribing full conversations, handle all your speech recognition needs with a single solution that works entirely offline.

Offline Processing

Works entirely on-device without requiring internet connection

Multiple Language Models

Choose from Tiny, Base, Small, Medium, or Large models to balance accuracy and performance

Multilingual Support

Recognize speech in over 95 languages with automatic language detection

GPU Acceleration

Vulkan-based acceleration on Windows for significantly faster recognition

Video Tutorial
Complete tutorial for setting up lip sync

Key Features

Streaming Recognition

Process audio in real-time as it's being captured, ideal for interactive applications and voice commands.

Streaming recognition nodes
Basic streaming recognition setup

Non-Streaming Recognition

Process complete audio files or buffers in a single operation for maximum accuracy.

Non-streaming recognition nodes
Non-streaming recognition from audio file

Voice Activity Detection Integration

Combine with Voice Activity Detection for optimal recognition of speech segments and command-based interfaces.

VAD integration nodes
Voice-activated command recognition with VAD

Technical Details

Recognition Engine

Powered by Whisper OpenAI technology, specifically the optimized whisper.cpp implementation, providing state-of-the-art speech recognition with:

On-device processing with no data sent to external servers
Vulkan-based GPU acceleration on Windows
CPU + intrinsics acceleration on all other platforms
Multilingual support with 95+ languages
Translation capabilities to convert any supported language to English

Audio Input Requirements

PCM Format

Floating point 32-bit interleaved PCM audio format

Compatible Sources

Works with any audio source that can provide PCM data, including Runtime Audio Importer

Streaming Support

Process audio as it's being captured or in complete chunks

VAD Recommended

Voice Activity Detection recommended for streaming scenarios

Platform Support

Windows
Mac
Linux
Android
iOS
Meta Quest
PlayStation
Xbox
Nintendo Switch

Powerful Integrations

Runtime Speech Recognizer works seamlessly with other plugins to create complete voice interaction solutions for your Unreal Engine projects.

Runtime Audio Importer

Capture microphone input with Voice Activity Detection to provide clean audio segments for optimal speech recognition performance.

Learn more

Runtime MetaHuman Lip Sync

Create realistic lip sync for MetaHuman characters that speak the responses to recognized speech, enabling natural conversational interfaces.

Learn more

Runtime Text To Speech

Complete the voice interaction loop by generating spoken responses to recognized speech commands with offline TTS technology.

Learn more

Runtime AI Chatbot Integrator

Process recognized speech with AI models from OpenAI, Claude, or DeepSeek to create intelligent conversational agents in your applications.

Learn more

Complete Voice Interface Solution

Combine Runtime Audio Importer for voice capture, Runtime Speech Recognizer for speech-to-text, Runtime AI Chatbot Integrator for intelligent responses, and Runtime Text To Speech for voice output to create a fully functional voice assistant or NPC conversation system.

Documentation & Support

Get started quickly with our detailed documentation and receive support through multiple channels. From basic usage to advanced speech recognition techniques, we're here to help you succeed.

Comprehensive Documentation

Step-by-step guides for all features

Video Tutorial

Visual guide to get you started quickly

Discord Community

Get real-time help from developers and users

Email Support

Direct contact for custom development needs

Speech recognition with Voice Activity Detection
Example of voice-activated command recognition with VAD

Demo Project Included

Get started quickly with the included demo project that showcases all the key features of Runtime Speech Recognizer.

Demo Features

  • Real-time microphone capture with voice activity detection
  • Streaming speech recognition with visual feedback
  • Language selection from 95+ supported languages
  • Command recognition with fuzzy matching
  • Performance optimization controls
  • Complete Blueprint implementation
Demo Project Screenshot

Extensive Language Support

Runtime Speech Recognizer supports over 95 languages, making it suitable for global applications.

English
En
Chinese
Zh
Spanish
Es
Russian
Ru
French
Fr
German
De
Japanese
Ja
Korean
Ko
Arabic
Ar
Italian
It
Portuguese
Pt
Hindi
Hi

Additional Languages

Turkish, Polish, Dutch, Swedish, Finnish, Vietnamese, Ukrainian, Greek, Czech, Romanian, Danish, Hungarian, Norwegian, Thai, Croatian, Bulgarian, Lithuanian, Welsh, Slovak, Persian, Latvian, Bengali, Serbian, Slovenian, Estonian, and many more.

View complete list of 95+ supported languages

Add Speech Recognition to Your Project

Enable natural voice interaction in your applications with offline, cross-platform speech recognition. From voice commands to full conversation transcription, unlock new possibilities for user interaction.