Skip to content

vlasenkoalexey/human.ai

Repository files navigation

Human.ai

Experimental human like virtual assistant.

A real-time conversational AI system featuring lifelike avatars with synchronized speech, lip movements, and natural interactions.

Overview

Virtual Human is an interactive avatar system that combines text-to-speech (TTS), lip-sync animation, and conversational AI to create realistic digital human interactions. Perfect for customer service, virtual assistants, education, entertainment, and accessibility applications.

Demo

drawing

Core Capabilities

  • Real-time Conversation: Natural language processing for fluid, context-aware dialogue using OpenAI models
  • 3D Avatar Rendering: Customizable virtual human characters with realistic expressions using Ready Player Me
  • Lip Sync Animation: Accurate mouth movements synchronized with speech audio using SALSA LipSync Suite
  • Hybrid Speech-to-Text (STT): First voice it transcribed using lightweight Recognissimo model that also served as a reliable Voice Activity Detection (VAD). Then same audio clip is sent to OpenAI audio transcription API for high quality recogission.
  • Text-to-Speech (TTS): High-quality voice synthesis with multiple voice options using RTVoice (with custom Eleven labs provider)
  • Full emoji support ❤️

Paid components

For obvious reasons paid components are not included

Getting started

  • Purchase and install paid components to the default location
  • Set API key in Assets\OpenAIKey.cs
  • Open Assets\Scenes\Assistant.speech.unity

About

Experimental human like virtual assistant

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published