Essential Skills-
- Strong experience withspeech technologies(e.g., Whisper, DeepSpeech, Tacotron, VITS, etc.).
- Familiarity withvideo synthesisandavatar animation(e.g., DeepMotion, NVIDIA Omniverse, D-ID, or similar).
- Proficiency in Python and ML frameworks (e.g., PyTorch, TensorFlow).
- Experience with real-time systems, streaming protocols, and GPU acceleration.
- A creative mindset and passion for building human-centric AI.
- Experience with LLMs and conversational AI frameworks (e.g., Rasa, LangChain).
- Knowledge of emotion detection, prosody modeling, or affective computing.
- Familiarity with Unity, Unreal Engine, or WebGL for avatar rendering.