PreBuild is a software development company that aims to streamline the software planning and development process for businesses.
A generative AI video/avatar wellness coaching generator
Confidential
10 months
AI
2024
AWS, Stable Diffusion, D-ID API, OpenAI API, Python, NodeJS, ReactJS, FFmpeg, Docker
WellnessAI is a generative AI video/avatar wellness coaching generator that creates personalized wellness coaching videos and avatars. This platform delivers scalable, custom mental and physical health guidance directly to users.
WellnessAI aims to deliver personalized wellness coaching videos and avatars to users. However, several challenges stand in the way of achieving this goal:
– Maintaining emotional and contextual consistency in avatar delivery: The system must accurately synchronize the avatar’s visual and emotional expression with the custom-generated coaching text. Since the user writes both the avatar’s description and the program’s content, the lack of fine-grained control over the avatar’s emotional tone or lip-sync accuracy can lead to a disconnect between the coaching message and the avatar’s empathetic delivery, breaking user trust.
– Managing high inference costs and resource consumption across dual prompting: Generating both a custom visual asset (avatar) and a custom program (text) before rendering the final video involves multiple, consecutive, computationally expensive service calls. This dual reliance on paid, high-resource services, especially when combined with video processing, creates high variable operational costs and limits the scalability of personalized content production.
– Managing complex, multi-step asynchronous workflow latency for custom assets: The video creation pipeline (avatar prompt → visual asset → program prompt → text → voice→ avatar animation → final video merge) involves chaining many high-latency services across different providers. Coordinating these sequential, asynchronous steps while ensuring low latency for content delivery is a significant engineering challenge.
To overcome the challenges, Hola Tech adhered to the best practices. Key components of the solution included:
– Implement a Contextual Emotion Synthesis Layer (CESL): Integrate a custom logic layer (Python/NodeJS) to analyze the emotional sentiment and keywords within the user’s coaching program prompt output (from the OpenAI API). This layer should automatically map the sentiment to the optimal avatar emotional state before passing instructions to the D-ID API, ensuring consistency between the generated message and the avatar’s empathetic visual delivery.
– Optimize generative inference: Reduce high variable costs by deploying and fine-tuning lightweight, proprietary Stable Diffusion models on dedicated AWS infrastructure instead of relying heavily on high-cost external APIs for every prompt. Implement aggressive caching for reusable avatar assets and common video segments within AWS S3 to minimize redundant generation cycles and speed up video rendering (FFmpeg).
– Leverage serverless functions and container orchestration for pipeline reliability: Utilize AWS Step Functions or AWS Lambda to orchestrate and manage the complex, asynchronous workflow, providing built-in state tracking and robust retry logic between the sequence of external API calls (D-ID, OpenAI). Containerize the computationally heavy FFmpeg and subsequent Python/NodeJS processing steps using Docker and orchestrate them via AWS ECS or EKS for efficient resource scaling and improved throughput.
– Supported an average daily output of 5,000 personalized coaching videos, scaling up to 10,000 videos during peak utilization
– Increased user engagement by 35% thanks to the hyper-personalized avatar and content generated directly from user prompts
The implemented system successfully met the core goal of providing highly personalized and scalable wellness coaching. The platform has significantly lowered the time and complexity required to create custom, dynamic video content, attracting a rapidly growing user base. WellnessAI has empowered these users to design their ideal coach and program, resulting in over 5,000 unique, personalized coaching avatars and the delivery of more than 100,000 minutes of generated wellness video content to date. This demonstrates WellnessAI’s success in providing an accessible, highly personalized, and efficient solution for scalable digital health and wellness guidance.
PreBuild is a software development company that aims to streamline the software planning and development process for businesses.
Levu is an internal C2C e-commerce platform that aims to facilitate seamless transactions and connections among buyers and sellers.
Verify Trusted is a service that aims to consolidate and showcase a business’s online reputation by combining reviews from various platforms into a single, optimized profile, thereby enhancing trust and online visibility
MadMAX is an interactive game on Telegram that aims to deliver an exhilarating endless runner experience, challenging players to collect unique in-game currency in a fast-paced and unpredictable environment.
Enjoy the holidays worry-free! Our team is here for you 24/7, even during the holidays, providing uninterrupted support for your project at no extra cost