a16z
August 13, 2025

This Week in AI: GPT-5 Ships, 4o Pulled Back, Grok Imagine Goes Social

This week, hosts Justine and Olivia dive into the collision of raw model power and consumer preference, exploring how social integration, user personality, and new "vibe coding" tools are shaping the next wave of AI applications.

Grok Goes Social

  • "I think what's really cool about it is it's sort of one of the first really truly social forays into AI image and video generation."
  • "It's not the most powerful model... But it's fast. So fast, which I think for a lot of people has been kind of the real barrier to doing image and video generation more seriously."
  • Grok’s new image model, Imagine, is winning not on power but on speed and social integration. It’s embedded directly into X, allowing users to instantly animate any photo on the platform with a long press, making creation seamless and viral.
  • Unlike other models, Grok is “uncensored” and generates images of real people, a key differentiator that unlocks meme generation and other personalized content.
  • This deep integration into an existing social graph feels different from Meta’s siloed AI character experiments, providing a powerful proof point for how AI creative tools can be baked into core social experiences.

The People vs. GPT-5

  • "Of course, classic consumer is like, ‘No, we don't want that. Give us the old toy back.’ Give us our fun friend who kind of mirrored the way we spoke to it and was over the top."
  • "I don't necessarily think the smartest model that scores the best on all these objective benchmarks of intelligence will be the model that people want to chat with."
  • OpenAI’s launch of the technically superior GPT-5 was overshadowed by the simultaneous deprecation of GPT-4o. Consumers quickly revolted, lamenting GPT-5’s lack of the “expressive, emotional, and fun” personality found in its predecessor.
  • The backlash, concentrated on Reddit, was so intense that Sam Altman announced GPT-4o would be brought back for paid users, highlighting a major disconnect between researcher-defined progress and consumer desire.
  • This saga exemplifies that for many users, personality and companionship are more valuable than raw intelligence scores, suggesting a future market for models optimized for engagement over pure performance.

The Vibe Coding Fragmentation

  • "The fact that a non-technical person can build something that thousands of people can use overnight is amazing and so exciting."
  • "There probably should be a platform that is like the training wheels version of vibe coding for non-developers... that does not allow you to make mistakes like exposing the API key."
  • A host’s experiment in “vibe coding” an app to create “selfies with Jensen Huang” demonstrated both the power and pitfalls of current tools. The app, built in hours on Lovable, went viral but exposed the creator’s API key—a risk the platform didn’t flag.
  • Today’s vibe coding platforms are powerful but assume a level of technical knowledge, creating a gap for a truly consumer-grade tool with built-in safety guardrails.
  • The market is set to fragment, moving beyond one-size-fits-all solutions. Expect to see specialized platforms emerge: some for enterprises needing deep integrations and others for consumers who prioritize speed, safety, and virality over technical flexibility.

Key Takeaways:

  • The podcast reveals a critical tension in the AI landscape: the smartest tool isn't always the winning tool. User experience, personality, and seamless integration are becoming decisive factors.
  • Personality Over Performance: For consumer-facing chatbots, an engaging, human-like personality can be more important than benchmark-topping intelligence. The GPT-4o backlash is a clear signal that users want companions, not just oracles.
  • Integration is the Ultimate Feature: The most successful AI tools will be those embedded into existing workflows. Grok’s deep integration into X makes creation frictionless, a model others will likely follow.
  • The AI Tooling Stack is Specializing: One-size-fits-all platforms are a temporary phase. The future of AI development tools, from LLMs to "vibe coders," lies in specialized solutions built for specific user segments and use cases.

For further insights and detailed discussions, watch the full podcast: Link

This episode dissects the critical tension between raw AI model power and user-centric design, revealing how social integration, user backlash, and no-code tools are shaping the next wave of AI investment opportunities.

Grok Imagine: The Social Future of AI Content Generation

  • The discussion begins with Grok's recent updates, focusing on its new image and video model, Imagine. Justine and Olivia note that while Imagine may not be the most powerful model available, its key differentiators are speed and deep integration within the X social platform. This allows users to instantly animate their own or others' photos, creating a seamless social-first AI experience.
    • Speed and Accessibility: Unlike competing models that can take minutes to generate content, Grok Imagine produces images almost instantly and videos quickly, encouraging rapid iteration. Olivia notes, "It's now become in less than a week like my go-to tool for image generation on mobile."
    • Uncensored Nature: The model's ability to generate images of real people, including public figures, unlocks use cases like meme generation that are often blocked by other platforms. This "uncensored" approach is a core part of its appeal.
    • Strategic Implication: Grok's strategy provides a powerful blueprint for how decentralized social (DeSo) platforms can embed AI tools. For investors, the success of this integrated, less-restricted model highlights a competitive advantage against more powerful but siloed AI tools, suggesting a market for AI that prioritizes social context and user freedom over centralized content policies.

The GPT-5 Launch and the Surprising Backlash Over GPT-4o

  • The conversation shifts to the release of GPT-5 and the simultaneous, controversial deprecation of GPT-4o. While GPT-5 demonstrates superior performance on technical benchmarks, particularly in coding, users quickly revolted against the loss of GPT-4o's more expressive and "fun" personality.
    • Performance vs. Personality: Justine breaks down the user backlash, distinguishing between the removal of excessive validation ("glazing") and the loss of the model's engaging, human-like personality. GPT-5 is seen as more sterile and less conversational, lacking the emojis and enthusiastic tone of its predecessor.
    • OpenAI's Reversal: The intense user reaction, especially on platforms like Reddit, led OpenAI's Sam Altman to announce the return of GPT-4o for paid users, acknowledging the community's preference.
    • Justine's Analysis: Her perspective highlights a critical insight for the market. She states, "I don't necessarily think the like smartest model that scores the best on sort of all these objective benchmarks of intelligence will be the model that people want to chat with."
    • Actionable Insight: This event proves that for consumer-facing AI, user experience can trump raw intelligence. This creates a significant market opportunity for specialized, personality-driven models. For Crypto AI researchers and investors, this signals a clear opening for decentralized platforms that can host a variety of fine-tuned models, giving users more control over their AI's "vibe" and behavior.

Navigating the Frontier of AI in Healthcare and Regulation

  • The episode explores the growing, and contentious, role of AI in providing medical advice. OpenAI is leaning into this use case with GPT-5, which was trained to be the top-performing model on Healthbench, a benchmark developed with over 250 physicians to measure an LLM's medical query proficiency.
    • Contrasting Approaches: OpenAI's endorsement of medical use cases is contrasted with recent regulatory pushback, such as an Illinois law banning AI for mental health therapy without professional supervision.
    • Enforceability Questions: Justine expresses skepticism about the law's practicality, questioning how regulators can effectively monitor private user chats with AI models.
    • Strategic Implication: The clash between centralized regulation and advancing AI capabilities highlights a key opportunity for Crypto AI. Privacy-preserving technologies like zkML (Zero-Knowledge Machine Learning), which allows for private verification of AI model inferences, could enable applications to offer secure, personalized health guidance while navigating complex regulatory and liability landscapes. This presents a distinct investment thesis for projects focused on decentralized and private AI.

Google's Genie 3: Generating Interactive Worlds from a Single Prompt

  • The hosts discuss Google's Genie 3, a new interactive world model—an AI system that generates a dynamic, 3D environment from a prompt (text, image, or video) that a user can then explore in real-time. While not yet publicly released, demos show its ability to turn paintings or videos into explorable scenes.
    • Key Use Cases: The discussion outlines three primary applications for this technology:
      1. Controllable Video: Users can "record" a video by moving through the generated 3D world, offering unprecedented control over the final output.
      2. Personalized Gaming: Enables the creation of unique, individual game worlds from a simple prompt, opening a new market for personal entertainment.
      3. Reinforcement Learning (RL) Environments: These dynamic worlds can serve as training grounds for AI agents, helping them learn to navigate and interact with complex environments.
    • Actionable Insight: Genie 3 points directly to the future of metaverse creation. For investors, the opportunity lies not just in world-generation models but in the decentralized infrastructure required to support them. This includes decentralized storage for persistent worlds, on-chain assets (NFTs) for in-world economies, and decentralized compute networks powerful enough to run these demanding models.

ElevenLabs Enters Music with a Fully Licensed Model

  • The conversation covers the launch of a new music generation model from ElevenLabs, a portfolio company of the hosts. The model's most significant feature is that it was trained exclusively on fully licensed music, a critical differentiator in the highly litigious music industry.
    • The IP Moat: Unlike image or text models that often scrape public data, training on licensed music protects enterprise users (media, gaming, advertising) from potential copyright liability.
    • Market Impact: This approach allows businesses to confidently use AI-generated music in commercial projects, from advertisements to films, representing a major step forward for the commercial viability of AI music.
    • Actionable Insight: The emphasis on licensed data underscores the massive intellectual property challenge facing the entire generative AI space. This creates a clear opening for crypto-based solutions, such as on-chain IP registries, automated royalty distribution via smart contracts, and transparent, decentralized marketplaces for training data. These systems can provide the auditable IP foundation that the AI industry needs.

The Vibe Coding Revolution and Its Pitfalls

  • The final segment focuses on vibe coding, a term for using no-code or low-code platforms to rapidly build and deploy AI applications. Olivia shares her firsthand experience building a "Selfie with Jensen" app, which allowed users to generate a photo of themselves with the NVIDIA CEO.
    • Democratization and Danger: The experiment was a success, attracting thousands of users overnight. However, it also exposed critical security flaws in the no-code platform, such as a publicly exposed API key and unprotected photo storage buckets. Olivia's experience demonstrates the power of these tools but also their current immaturity for non-technical users.
    • Market Fragmentation Thesis: Justine and her colleague Anish Acharya published a post arguing that the vibe coding market will fragment. Different platforms will emerge to serve specific user segments, from consumers needing simple, secure tools to enterprise developers requiring deep customization and control.
    • Actionable Insight: The security gaps in current vibe coding platforms represent a major opportunity. There is a pressing need for tools built for the Web3 ecosystem that integrate decentralized identity (DID) and secure key management by default. Investors should watch for specialized no-code platforms that enable users to safely build decentralized applications (dApps) and AI-powered front-ends without exposing sensitive credentials or user data.

Conclusion

This episode reveals a maturing AI market where user experience and specialized applications are beginning to outweigh raw performance. For investors and researchers, the most promising frontiers are the gaps this creates: niche models, secure no-code tools for Web3, and the decentralized infrastructure for IP, compute, and identity.

Others You May Like