Weights & Biases
December 16, 2025

Humanoid Robots: Hype vs Reality in 2026 | Rerun CEO

The single most important argument is that while AI in robotics (especially manipulation) has made incredible progress in the last year, moving from impossible to boring for tasks like laundry folding, the real challenge and opportunity lies not just in algorithmic breakthroughs but in building robust, scalable data infrastructure and practical product systems that can handle the long tail of weird small variations of the physical world. The gap between impressive demos and deployable products is a systems problem, not just an AI problem.

The State of Embodied AI: From Lab to (Near) Reality:

  • "We're just really seeing an incredible progress in sort of ability to do quite advanced manipulation. This has been a really difficult problem in robotics for a long time... folding laundry has kind of gone from being impossible to sort of boring over the last year basically."
  • "I think there are different like kinds of robotics companies. So I think there's a lot of that and the ones that are more maybe swinging for the fences directly... you see these incredible demos and then yeah it's actually quite far from a product."

The Data Infrastructure Gap in Robotics:

  • "For LLM data that's like yeah text and normal things that you can use like robust well-proven storage formats like parquet, iceberg and things like that and high-scale data processing engines like spark... But for robotics like none of that exists basically. So they're all the data formats are built for a world before machine learning."
  • "You end up having to put in these file formats that are optimized for saving logs... The way you answer all these questions as you write a huge parallel job that reads the file into memory... it's just like a huge amount of work where it should just be like a SQL query or something like that."

The Practicalities of Productizing Robotics:

  • "At the end of the day you're not it's not just a robot that can do a task. You need to build a product, right? So I think that distance and this whole systems aspect of like deploying robots is like is very difficult."
  • "The thing that humanish form factors have going for them is data collection... you could imagine maybe it's more efficient to have four arms, but it's really hard to have a human control four arms at the same time."

Key Takeaways:

  • Strategic Shift: The next frontier in robotics is less about pure algorithmic breakthroughs and more about building robust, scalable data infrastructure and full-stack product systems that can handle the messy physical world.
  • Builder/Investor Note: Prioritize companies solving the "boring" but critical data and systems problems. Look for practical, "scrappy" companies deploying robots in specific industrial niches, rather than just those with flashy, general-purpose demos.
  • The "So What?": The gap between impressive demos and deployable products will narrow over the next 6-12 months as data pipelines mature and product-focused companies gain traction. Expect to see more robust, self-correcting robots performing longer, more complex tasks in controlled environments.

For further insights and detailed discussions, watch the podcast: Podcast Link

This episode dissects the rapid advancements in AI-driven robotics manipulation against the stark reality of product deployment, exposing critical data infrastructure gaps hindering widespread adoption.

Rerun.AI: The Data Backbone for Embodied AI

  • Rerun.AI, a high-scale logging and data platform, serves as the system of record for physical AI applications, including augmented reality, spatial computing, and robotics. Founder Nico West highlights Rerun's evolution from a debugging tool to a comprehensive data lakehouse, enabling rapid data curation and training loops.
    • Rerun's open-source SDK logs, models, queries, and visualizes multimodal, time-varying data from diverse sensors (3D, RGB, motion, text logs, neural network outputs).
    • The platform's core innovation is a flexible data model, inspired by entity component systems (ECS) from game development, allowing dynamic composition of data components rather than rigid object structures.
    • Rerun's custom, Rust-built, in-memory database and sparse file format (akin to a sparse Parquet) efficiently handle multimodal, multi-rate, episodic "physical data" that traditional tabular models cannot accommodate.
    • West asserts that visualization is too critical and pervasive across the development lifecycle to be monetized directly, leading Rerun to open-source its client-side visualizer and build a commercial cloud backend for large-scale data management.
  • "We've actually redesigned the data model probably four times at this point... we kind of really cared about getting those core pieces right and been ready to redo." – Nico West

Robotics Breakthroughs: Manipulation and Learning Paradigms

  • The robotics field experiences "incredible progress" in advanced manipulation, solving long-standing challenges. This acceleration stems from the scalable machine learning demonstrated by LLMs, attracting investment and data collection efforts.
    • Advanced Manipulation: Tasks like laundry folding, once "impossible," are now "boring" due to end-to-end learning methods.
    • Imitation Learning: Robots learn by observing human teleoperation, recording full state data (joint angles, perception inputs) to train neural networks for task replication. This "robotics version of supervised learning" began showing serious results 2-3 years ago.
    • Reinforcement Learning (RL): Traditionally used for walking and motion, RL now combines with imitation learning, yielding robust manipulation capabilities.
    • Ecosystem Drivers: The LLM "ChatGPT moment" proved scalable ML's power, priming the robotics sector. Innovations in modeling robotics problems (e.g., using Transformers), cheaper hardware, and increased investment in data collection fuel this growth.

Hype vs. Reality: The Productization Gap

  • Despite impressive research demos, a significant gap exists between advanced capabilities and deployable products, particularly in consumer robotics.
    • Two Robotics Company Types:
      • "Swinging for the fences": Produces "incredible demos" but remains "quite far from a product."
      • "Practical and scrappy": Aggressively deploys working products (e.g., warehouse robots) using open-source Vision-Language-Action (VLA) models and teleoperation where needed.
    • Industrial Deployment: Pockets of success exist in manufacturing (e.g., pick and place tasks), with companies deploying tens to hundreds of robots. These are typically robot vendors, not factories themselves.
    • Consumer Robotics: Robot vacuums remain the dominant consumer application, with newer models (like Matiq) offering improved autonomy through better mapping and robustness. Full home autonomy for complex tasks is still distant due to the "fat tail" of real-world variations and the need for comprehensive product systems (servicing, onboarding).
    • Underrated Startups: West highlights Generalist AI (for its foresight in data collection via UMI—Universal Manipulation Interface—puppeteering) and Ultra (for practical, AI-first logistics pick-and-place) and Cyriak (for manufacturing).

Challenges: Benchmarks, ROS, and the Funding Model

  • The robotics industry grapples with fundamental infrastructure and evaluation challenges.
    • Lack of Benchmarks: No "great benchmarks" exist due to the necessity of co-designing and co-training for specific hardware, making standardized evaluation difficult. Internal, live robot testing is the current norm.
    • ROS's Enduring Power: The Robot Operating System (ROS) persists despite its flaws due to strong network effects, a vast ecosystem, and a common message-passing standard. Funding models for its replacement remain elusive, despite its widespread use by well-funded companies.
    • High Valuations: Pre-revenue robotics companies command astonishing valuations, justified by the pursuit of "astonishingly large markets" and the need for massive scale to achieve hardware price points and collect the necessary data.
    • Humanoid Form Factor: Humanoid robots are favored for data collection efficiency as human operators can more easily control human-like forms, even if other form factors might be more efficient for specific tasks.

Embodied AI Data: The Core Bottleneck

  • Training embodied AI systems fundamentally differs from LLMs due to the nature of physical data, creating a significant infrastructure gap.
    • Shared Best Practice: Top AI teams (LLM and physical AI) integrate researchers/modelers with data preprocessing and modeling decisions.
    • LLM Data Advantages: Text data is easily visualized and processed using mature, high-scale tools (Parquet, Iceberg, Spark, Databricks) with declarative, database-style APIs for flexible querying (e.g., "all night recordings where the left gripper failed").
    • Robotics Data Disadvantages: Physical data is multimodal, multi-rate, and episodic, making it incompatible with tabular models. Existing robotics data formats are optimized for fast logging, not flexible querying. This forces teams to write "huge parallel jobs" with "imperative custom code" for every data query, a process that should be a simple SQL query.
    • Converging Trends: LLMs are increasingly used for data annotation (e.g., describing robot actions, identifying tasks) and embedding computation for search and curation in robotics.

Investor & Researcher Alpha

  • Data Infrastructure as the New Bottleneck: The primary constraint for scaling embodied AI is not model capability but the lack of robust, flexible, and high-performance data pipelines and query engines for multimodal, multi-rate physical data. Investment in this layer is critical.
  • Shift from Pure Simulation: Companies building real-world products are not solely relying on simulation. Real-world data collection and its efficient management are paramount, indicating a potential shift in research focus and capital allocation towards hybrid or real-data-centric approaches.
  • Generalist Robotics Potential: The pursuit of general-purpose robots, capable of serving diverse use cases with the same hardware, is a key driver of high valuations. This strategy aims to achieve the scale necessary for viable hardware price points, demanding massive data and compute.

Strategic Conclusion

  • The robotics industry stands at an inflection point, with advanced AI models demonstrating unprecedented manipulation capabilities. However, widespread product deployment hinges on bridging the profound gap in data infrastructure, demanding new paradigms for managing, querying, and curating multimodal physical data. The next step for the industry is to build robust, scalable data pipelines that match the sophistication of its AI models.

Others You May Like