Ends Today: Save up to 95% on the Time Travel Mega Bundle!

Demo video: Link

Documentation: Link

Free Demo project (exe): Link


This plugin allows you to recognize speech in 99 languages, just by adding one component to your blueprint, without relying on any separate servers or subscriptions.


The machine learning model used in this plugin is based on OpenAI's Whisper, but has been optimized to run on the ONNX Runtime for best performance and to minimize dependencies.


Accuracy varies for each supported language. See the original paper for the accuracy of supported languages.


Prerequisite to use with GPU (CUDA)

To use this with a GPU, you need a supported NVIDIA GPU and to install the following versions of CUDA and cuDNN. 

  • CUDA: 11.6
  • cuDNN: 8.5.0.96

Technical Details

Features:

  • Real-time transcription from microphone input to text in 99 languages
  • Real-time translation from microphone input to English text
  • Real-time alighment from microphone input to user-specified text

Code Modules:

  • AudioInputSpectrumAnalysis (Runtime)
  • ByteLevelBpeTokenizer (Runtime)
  • CustomizedOnnxRuntime (Runtime)
  • WhisperOnnxModel (Runtime)

Number of Blueprints: 2

Number of C++ Classes: 13+

Network Replicated: No

Supported Development Platforms: Windows 64-bit

Supported Target Build Platforms: Windows 64-bit

Documentation: Link

Important/Additional Notes:

  • To use with GPU, you need to install CUDA 11.6 and cuDNN 8.5.0.96.
ARI
Akiya Research Institute
All Assets by Author
98.69 
Platforms Windows 64-bit
UE Versions 4.27, 5.0 - 5.2
Tags DEEP LEARNING, AI, SPEECH RECOGNITION, BLUEPRINTS, SPEECH TO TEXT, NEURAL NETWORK, CPP, MACHINE LEARNING
Release date 04.02.2023

Similar products

Image
Sign In
Image
Sign Up
Image
Recovery