Offline Speech Recognition

Ilgar Lunin - 2021/09/20

Accurate offline speech recognition

  • 지원 플랫폼
  • 지원 엔진 버전
    4.25 - 4.27
  • 다운로드 유형
    엔진 플러그인
    이 제품은 코드 플러그인과 함께 미리 빌드한 바이너리와 언리얼 엔진에 통합되는 모든 소스 코드를 포함하고 있으며 원하시는 엔진 버전에 설치하여 프로젝트에 따라 활성화할 수 있습니다.

Allows you to recognize speech from more than 15 languages, without relying on any cloud service or subscription. Instead, a language server is a separate process on your machine, which talks with your game. The language server app is public ( https://github.com/IlgarLunin/vosk-language-server ), you can fork it and customize, distribute with your game, run it without any user interface.


Unreal engine client is dead simple communication with language server. It connects to it, records, and feeds your voice to the language server, the server sends recognized voices as text back to unreal.


This is streaming voice recognition, and you can implement simple conversations with your NPC without any user input except voice. "Ok robot, do this", "Ok robot, do that" etc.


Download latest language server: https://github.com/IlgarLunin/vosk-language-server/releases


Video demonstration: https://youtu.be/iJVCsuuC5A4

Example project: here (Unreal Engine 4.27)

기술적 세부사항

Features:

  • No dependencies on other paid cloud services
  • One time payment
  • The server can handle multiple clients at the same time
  • Easy to setup
  • No internet required


Code Modules:

  • VoskPlugin (Runtime)


Number of Blueprints: 0

Number of C++ Classes: 1

Network Replicated: No

Supported Development Platforms: Windows, Mac, Linux

Supported Target Build Platforms: Windows, Mac, Linux

Documentation: https://github.com/IlgarLunin/VoskPlugin-docs

Discord: https://discord.gg/Tkf7xe2