Skip to main content

Voice

OpenHuman supports voice interaction, allowing you to communicate with the agent when typing is inconvenient.

Voice Settings

Enable Voice

  1. Go to Settings → Voice
  2. Enable Voice Interaction
  3. Select input/output devices

Configuration Options

OptionDescription
STT LanguageTarget language for speech recognition
TTS VoiceVoice selection for synthesized speech
Wake WordTrigger word to activate voice (disabled by default)
MicrophoneSelect input device

Usage

Voice Input

Press and hold the spacebar to speak, release when done. Your voice will be converted to text and sent to the agent.

Voice Output

The agent's responses will be read aloud via TTS. You can choose whether to enable this feature in settings.

Wake Word (Optional)

When enabled, you can say "Hey OpenHuman" to activate voice input without pressing any keys.

Voice Models

OpenHuman provides multiple voice options:

VoiceStyle
defaultNeutral
friendlyFriendly, warm
professionalProfessional, formal

Privacy Note

  • Voice input is processed locally
  • Audio buffers are not stored
  • All processing goes through trusted backend services

Troubleshooting

Speech Recognition Inaccuracy

  1. Check if the microphone is correctly selected
  2. Confirm network connection is normal
  3. Use in a quiet environment

TTS No Sound

  1. Check output device volume
  2. Confirm TTS feature is enabled
  3. Try restarting the application

Next Steps