AI

WellnessAI

A generative AI video/avatar wellness coaching generator

WellnessAI 1st image
Client

Confidential

Duration

10 months

Category

AI

Year

2024

Technology

AWS, Stable Diffusion, D-ID API, OpenAI API, Python, NodeJS, ReactJS, FFmpeg, Docker

Introduction

WellnessAI is a generative AI video/avatar wellness coaching generator that creates personalized wellness coaching videos and avatars. This platform delivers scalable, custom mental and physical health guidance directly to users.

Wellness AI 2nd image

Challenges

WellnessAI aims to deliver personalized wellness coaching videos and avatars to users. However, several challenges stand in the way of achieving this goal:

– Maintaining emotional and contextual consistency in avatar delivery: The system must accurately synchronize the avatar’s visual and emotional expression with the custom-generated coaching text. Since the user writes both the avatar’s description and the program’s content, the lack of fine-grained control over the avatar’s emotional tone or lip-sync accuracy can lead to a disconnect between the coaching message and the avatar’s empathetic delivery, breaking user trust.

– Managing high inference costs and resource consumption across dual prompting: Generating both a custom visual asset (avatar) and a custom program (text) before rendering the final video involves multiple, consecutive, computationally expensive service calls. This dual reliance on paid, high-resource services, especially when combined with video processing, creates high variable operational costs and limits the scalability of personalized content production.

– Managing complex, multi-step asynchronous workflow latency for custom assets: The video creation pipeline (avatar prompt → visual asset → program prompt → text → voice→ avatar animation → final video merge) involves chaining many high-latency services across different providers. Coordinating these sequential, asynchronous steps while ensuring low latency for content delivery is a significant engineering challenge.

Wellness AI 3rd image

Solutions

To overcome the challenges, Hola Tech adhered to the best practices. Key components of the solution included:

– Implement a Contextual Emotion Synthesis Layer (CESL): Integrate a custom logic layer (Python/NodeJS) to analyze the emotional sentiment and keywords within the user’s coaching program prompt output (from the OpenAI API). This layer should automatically map the sentiment to the optimal avatar emotional state before passing instructions to the D-ID API, ensuring consistency between the generated message and the avatar’s empathetic visual delivery.

– Optimize generative inference: Reduce high variable costs by deploying and fine-tuning lightweight, proprietary Stable Diffusion models on dedicated AWS infrastructure instead of relying heavily on high-cost external APIs for every prompt. Implement aggressive caching for reusable avatar assets and common video segments within AWS S3 to minimize redundant generation cycles and speed up video rendering (FFmpeg).

– Leverage serverless functions and container orchestration for pipeline reliability: Utilize AWS Step Functions or AWS Lambda to orchestrate and manage the complex, asynchronous workflow, providing built-in state tracking and robust retry logic between the sequence of external API calls (D-ID, OpenAI). Containerize the computationally heavy FFmpeg and subsequent Python/NodeJS processing steps using Docker and orchestrate them via AWS ECS or EKS for efficient resource scaling and improved throughput.

Wellness AI 4th image

Featured numbers

– Supported an average daily output of 5,000 personalized coaching videos, scaling up to 10,000 videos during peak utilization

– Increased user engagement by 35% thanks to the hyper-personalized avatar and content generated directly from user prompts

Results

The implemented system successfully met the core goal of providing highly personalized and scalable wellness coaching. The platform has significantly lowered the time and complexity required to create custom, dynamic video content, attracting a rapidly growing user base. WellnessAI has empowered these users to design their ideal coach and program, resulting in over 5,000 unique, personalized coaching avatars and the delivery of more than 100,000 minutes of generated wellness video content to date. This demonstrates WellnessAI’s success in providing an accessible, highly personalized, and efficient solution for scalable digital health and wellness guidance.

Other Projects