Why AI Characters & Virtual Influencers Are the Next Frontier in Video ft Hedra’s Michael Lingelbach

Hedra CEO and co-founder Michael Lingelbach joins the podcast to unpack how generative video is evolving from a novelty into a powerful new medium. He explains why focusing on AI “characters” as the core primitive—not just video—is unlocking everything from viral memes to sophisticated enterprise tools.

The Character is the New Primitive

"It sounds like you almost define your primitive as a character rather than a video... In character is the fundamental unit that we think about control."
"We think it's really important to pick an area that you can really focus on and excel in. And for me, it's subject-level control. It's making people feel very real."
While many AI companies are building general-purpose video models, Hedra is carving out a niche by focusing on the character. The goal isn't just to render pixels but to imbue subjects with personality, control, and performance, crossing the "last mile" to make them feel alive.
This requires a different technical approach: a unified model where audio dialogue drives the holistic animation. This captures subtleties like breathing correlated with speech, a key differentiator from models that just layer lip-sync over a video.

From Viral Memes to Enterprise Tools

"I think memes are one of the best go-to-market strategies...this technology has allowed people to be very creative very quickly, and memes are the manifestation of that creativity."
"It was really interesting as you see this crossover between consumer and enterprise use cases... consumers will go out and they'll do the hard work to figure out, 'Oh, I can make a talking baby podcast,' and then I'm on Forbes the other day and I see a Hedra-generated ad."
Hedra’s go-to-market strategy proves that what starts as a viral meme can quickly become a serious enterprise tool. The same tech that powered the "talking baby podcast" is now being used for sophisticated B2B advertising.
This rapid feedback loop—where startups observe what prosumers create and then productionize it for enterprise clients—is shrinking the gap between consumer play and business value. Enterprise is now Hedra's fastest-growing segment as companies realize they can create timely, relevant content in hours, not weeks.

The Future is Interactive and Personalized

"I'm really excited about the future of interactive video... you should be able to stop a video and be like, 'Hey, you should look more sad for this portion of the speech,' and then just regenerate that portion."
"I actually don't want a personal Netflix because I feel like there's something inherently social about a lot of consumption experiences."
The next paradigm shift is from passive consumption to interactive co-creation. Soon, users won’t just prompt a video; they’ll direct it in real-time, shaping an AI character’s performance on the fly.
The end game isn't a "personal Netflix" for every individual, which risks losing the shared social fabric of media. Instead, the opportunity lies in "personalization at scale"—creating tailored content for niche, interest-based communities, from hyper-local news channels to fan groups for a specific camera brand.

Key Takeaways:

The era of passive video is giving way to a new frontier of interactive, character-driven media. This shift requires a deep focus on user-centric design and a willingness to build workflows that prioritize the best experience, even if it means integrating partner technologies. The line between consumer creativity and enterprise application is blurring, creating a powerful pipeline for AI-native products.
Character, Not Video: The winning primitive in generative video isn't the frame; it's the character. Companies that master subject-level control and performance are building a defensible moat in a crowded market.
The Meme-to-Enterprise Pipeline: Viral trends are the new market research. The fastest path to enterprise AI adoption is to follow what users are creating for fun and build a robust, reliable tool around it.
Interactive is the Next Platform: The future of media isn't just watching; it's directing. Real-time, interactive models that let users guide AI characters will unlock entirely new applications in entertainment, education, and commerce.

For further insights and detailed discussions, watch the full podcast: Link

This episode reveals how AI-native characters are moving beyond viral memes to become a new primitive for media, creating transformative opportunities in enterprise, education, and personalized content.

From Viral Memes to Enterprise Adoption

The conversation begins by exploring how consumer creativity on platforms like Hedra serves as a powerful signal for enterprise applications. Michael Lingelbach, founder of Hedra, notes that viral trends, such as the "talking baby podcast," quickly get adopted by businesses for advertising and content, demonstrating a rapid crossover from consumer experimentation to enterprise use.
This trend highlights how memes and viral content act as a go-to-market strategy, allowing new technologies to gain significant mindshare quickly.
Lingelbach argues that memes are a direct manifestation of accelerated creativity, where generative video allows anyone to turn an idea into short-form content in seconds.
Strategic Insight: Investors should monitor viral consumer AI trends, as they often serve as leading indicators for high-value enterprise use cases. The speed of this adoption cycle is accelerating.

The Rise of Virtual Influencers and Digital Personas

The discussion shifts to Hedra's core technology, which Lingelbach describes as the first major audio-to-video foundational model for expressive, dialogue-centric characters. This technology has powered the creation of millions of content pieces, filling a critical gap in the creative stack by making character animation fast, cheap, and flexible.
This has led to the emergence of "virtual influencers," a trend highlighted in a recent CNBC article. Examples include actor Jon Lajoie's character series (Moses, the baby podcast) and the "Monverse" created by Neural Viz, which centers on distinct character identities.
Lingelbach emphasizes the concept of "generative acting," which involves embedding personality, consistency, and control into models, distinguishing it from simple media generation.
The hosts observe two primary use cases: the creation of entirely new characters (like aliens) and real influencers scaling their digital presence without the constant need for filming.
Quote: Michael Lingelbach explains the importance of this new capability: "There's a lot that goes into imbuing, you know, personality and consistency and control into these models that's specific to performance and video. And so I think that's why you're starting to see these like distinct personalities take off that like aren't actually real people."

Automating Content and New Educational Frontiers

The conversation highlights the increasing accessibility of automation, where users combine tools like Hedra with research and scripting tools (e.g., DeepResearch) to create autonomous digital personas. This workflow is being applied to both real and fabricated personalities, with a notable application in education.
History-focused videos, where historical figures narrate events, are cited as an example of making learning more engaging than static text.
Lingelbach notes that while Hedra doesn't specifically target education, many education companies build on their APIs, leveraging the higher engagement rates of video.
The recent launch of Hedra's real-time interactive video model, which provides a low-latency experience, is poised to fundamentally change how users interact with LLMs (Large Language Models)—AI systems trained on vast text data to understand and generate human-like language.
This technology brings the concept of an interactive book, as imagined in Neal Stephenson's sci-fi novel The Diamond Age, closer to reality. An interactive persona can read a book to you, answer clarifying questions, and make the experience richer than a static audiobook.

Hedra’s Technical Focus: Characters as a Primitive

The discussion delves into Hedra's specific market positioning. While many companies are building general-purpose video models, Hedra has deliberately scoped its focus to dialogue-centric, controllable characters. Lingelbach explains that the "search space" for video is immense, making specialization crucial.
He draws a parallel to the LLM space, where different models have specialized: Claude for coding, OpenAI for general assistance, and Gemini for enterprise. Hedra aims to be the leader in making characters feel alive and engaging.
Quote: Lingelbach defines their core unit of focus: "It sounds like you almost define your primitive as a character rather than a video."
This "character-first" approach requires vertical integration, focusing not just on the core rendering model but also on the user experience (UX) for programming a personality and interacting with the character. This necessitates collecting rich user feedback on performance and feel, which guides model development.

The Founder’s Dilemma: Building a Unified Experience

Lingelbach, despite his background as a computer science PhD student, advocates for a user-centric approach over a purely technology-driven one. He explains the decision to integrate partner models, like ElevenLabs for voice synthesis, was driven by the need to solve the customer's core problem holistically.
The initial pain point Hedra addressed was the difficulty of putting realistic dialogue into media. The goal was to create a holistic model where audio conditioning drives the animation, from lip movements to breathing and gestures.
Lingelbach emphasizes the importance of building "deep for people that really care," such as marketers and content creators who have a high-quality bar and need an intuitive interface for a complex task like defining a personality.
Strategic Insight: In a fragmented generative media landscape, companies that curate the best end-to-end user experience, even by integrating third-party models, can build a stronger moat than those focused solely on their own technology stack. This user-centricity is a key differentiator.

The Future of Modality: Monolithic vs. Composable

The conversation explores whether AI characters will eventually be generated by a single, monolithic, omnimodal model. Lingelbach believes the trend is moving toward mixed-modality paradigms, primarily because users demand granular control.
Unbundling inputs (e.g., separate controls for voice, timing, and visuals) allows creators to fine-tune specific elements, which is difficult with a single prompt in a monolithic model.
He predicts the future is omnimodal input, omnimodal output, where a model can handle any type of data for both input and output, but with guidance signals that allow the user to control how closely the model adheres to each input modality.
Quote: Lingelbach on the control trade-off: "The more control we surrender over to these very large data-driven models, the more priors we can bake in to make that easier... But at the same time, the more control the user has to kind of surrender."

The Challenge of Real AI Actors and Authenticity

The hosts ask about the potential for "real AI actors" that a user could direct in real-time. Lingelbach believes this is inevitable but identifies the current bottleneck as the authenticity of LLMs, not the video models.
He argues that while LLMs are powerful assistants, they often lack the nuanced, adaptable personality required for authentic character performance. The underlying "LLM feel" is still detectable.
Significant work is needed on building more configurable and believable personalities within generative text models to achieve the vision of truly interactive digital actors.
The conversation also touches on the rapid quality improvements in all modalities (video, audio, text), creating a continuous race where the bottleneck for realism is constantly shifting.

Personalization at Scale: The New Media Paradigm

The discussion critiques the idea of hyper-personalization, like a "personal Netflix," arguing that media is an inherently social experience. The hosts and Lingelbach converge on the concept of "personalization at scale" as a more valuable paradigm.
Instead of content for an audience of one, generative AI enables the creation of tailored content for targeted interest groups (e.g., a specific subreddit community, local news audiences).
This allows for the revival of formats like local news, where one person can run an entire channel, or the creation of professional-looking content for niche global communities.
Quote: Lingelbach on the social aspect of media: "I actually don't want a personal Netflix because I feel like there's something inherently social about a lot of consumption experiences... it's like the shared human experience that you lose a little bit."

The Founder’s Grind: Leadership and Prototyping

Lingelbach offers a candid perspective on the founder's journey, describing it as "type two fun"—an experience that is grueling in the moment but ultimately fulfilling. He stresses that being a founder requires an obsessive drive and a willingness to lead by example, especially in a demanding environment.
He shares his hands-on leadership style, which includes vibe coding—a rapid prototyping method, often using AI, to build functional mockups of new features.
This practice serves two purposes: it helps him develop his own conviction about a product direction and provides a clear, functional artifact to communicate his vision to the team, short-circuiting ambiguity.
He emphasizes that this approach only works if a leader has earned the team's respect by being "in the trenches" with them and is plugged in enough to know when such a prototype is helpful rather than disruptive.

Conclusion: The Future is Co-Creating with AI

The episode underscores that the next frontier for AI media is not just generating content but co-creating it with intelligent systems. For investors and researchers, the key is to track companies that are building AI-native products focused on user-centric, character-driven primitives and rapid, iterative development.

Why AI Characters & Virtual Influencers Are the Next Frontier in Video ft Hedra’s Michael Lingelbach

Others You May Like

ElevenLabs CEO: Why Voice is the Next AI Interface

Rollup TV: Aria Protocol, Money Moves Fast EP 8, Crossmint

Sacks, Andreessen & Horowitz: How America Wins the AI Race Against China

Why AI Characters & Virtual Influencers Are the Next Frontier in Video ft Hedra’s Michael Lingelbach

Join 4,000+ smart readers to get access to all our research and tools for free.

Others You May Like

ElevenLabs CEO: Why Voice is the Next AI Interface

Rollup TV: Aria Protocol, Money Moves Fast EP 8, Crossmint

Sacks, Andreessen & Horowitz: How America Wins the AI Race Against China