The People's AI
June 13, 2025

Why the Robotics Revolution will be Powered by Decentralized AI, w/ the Founder of PrismaX

This episode dives into the current state of robotics and why decentralized data, as championed by PrismaX, is crucial for its mainstream adoption. Bailey Wang, co-founder of PrismaX and an MIT robotics veteran, explains how user-provided data and teleoperation can bridge the gap to a robotic future.

Robotics: Hardware Ready, Software Lagging

  • "Robotics is an interesting field because it combines both difficult hardware challenges and difficult software challenges and uniquely among any of these fields... the hardware challenges got solved before the software challenges."
  • "Nowadays... you see these crazy YouTube videos of robots doing parkour and front flips, back flips, barrel rolls, but you don't really see the robots being in the wild because they don't really have the intelligence or planning to be able to complete real tasks."
  • The physical components of robots (actuation, force generation) have seen significant advancements, making complex movements possible.
  • The primary bottleneck is software: AI capable of robust, autonomous interaction with the unpredictable real world. Current AI struggles with the high accuracy and diversity needed for physical tasks.
  • Traditional rules-based AI approaches are brittle and fail when encountering novel situations, unlike the emergent capabilities of large-scale AI models.

The Insatiable Appetite for Real-World Data

  • "When you want to train a robotics model, step one is to collect data and then you... stop because it's really annoying to collect data."
  • "If I'm a centralized project and even if I had the money to go hire a million people to collect data for me, I still don't know who to hire... you will always wind up with... manager bias."
  • Training robust robotics AI requires vast, diverse datasets of real-world interactions, a stark contrast to text-based AI that can leverage existing internet data.
  • Centralized data collection is costly, slow, and often results in biased datasets reflecting the limited perspectives of its creators, hindering model generalization.
  • PrismaX posits that decentralized, community-driven data collection can organically create more representative and robust datasets.

PrismaX: Decentralizing Robot Intelligence

  • "Fundamentally what inspired PrismaX is this idea that diverse large data sets and diverse large operator... environments are what powers these models and makes them robust."
  • "Right now... robotics companies are paying $50 an hour for this kind of operator [teleoperation]."
  • PrismaX is building a platform for users to contribute real-world data (e.g., videos of folding laundry, warehousing tasks) and remotely operate robots, addressing robotics' core needs: data, teleoperators, and models.
  • This decentralized approach allows for scalable data collection and teleoperation, with users potentially earning rewards (via a future PXM token) for their contributions.
  • Billion-dollar robotics companies are currently struggling to scale their teleoperation efforts, creating an immediate market for platforms like PrismaX that can connect them with a global operator base.

Key Takeaways:

  • The robotics revolution hinges on cracking the AI software challenge, which is fundamentally a data problem. Decentralized platforms offer a promising path to acquire the necessary diverse, real-world data at scale.
  • Hardware Isn't the Holdup: Robot bodies are capable; their brains (AI) need smarter, real-world training.
  • Data is the New Differentiator: Access to diverse, large-scale, real-world interaction data will determine the winners in robotics AI.
  • Decentralization Unlocks Scale: PrismaX bets that a community-driven approach to data collection and teleoperation can overcome the biases and bottlenecks of centralized efforts, fueling the next wave of robotic intelligence.

For further insights, watch the episode here: Link

This episode reveals how decentralized AI and user-owned data are set to break the data logjam in robotics, charting a course for the next wave of innovation and investment in real-world AI applications.

Guest Introduction and the Robotics Landscape

  • Host Jeff Wilser introduces Bailey Wang, co-founder of PrismaX, a new project describing itself as a "base layer for real-world robotics."
  • PrismaX's mission is to connect real-world data, provided by users, with robotics companies that urgently need this data to train their AI models.
  • The project recently emerged from stealth at an A16Z demo day, highlighting its potential.
  • The core concept involves rewarding users for contributing valuable data, such as videos of everyday actions (e.g., folding laundry) or even by remotely operating actual robots.

The Current State of Robotics: Hardware Solved, Software Lags

  • Bailey Wang, drawing from his extensive robotics research background, explains that robotics uniquely combines difficult hardware and software challenges.
  • He asserts that, surprisingly, "uniquely among any of these fields robotics in robotics the hardware challenges got solved before the software challenges."
  • Significant advancements in actuation—the ability of robots to generate forces and move—occurred roughly between 2010 and 2019.
  • Consequently, modern robots can perform impressive physical feats like parkour and backflips, as seen in viral videos. However, they generally lack the sophisticated intelligence and planning capabilities required for autonomous operation in real-world tasks.
  • While owning hardware presents a scaling barrier, Wang considers the fundamental hardware design problem largely solved, with off-the-shelf or near off-the-shelf parts available to construct various robot forms.
  • Strategic Implication: For investors and researchers, the primary bottleneck and opportunity in robotics now lie in software, AI, and data solutions that can imbue advanced hardware with practical intelligence.

Key Bottlenecks: Automation and Real-World AI Reliability

  • The most significant hurdle preventing widespread robot adoption is achieving true automation—enabling robots to complete tasks independently without human intervention.
  • This necessitates more advanced AI that can reliably interact with the complexities and unpredictability of the physical world, a far more demanding task than AI operating in purely digital domains like ChatGPT.
  • Errors made by physical AI can have much more severe and tangible consequences. Bailey Wang illustrates this point: "If you go deploy a model like that on a robot like it's trying to clean your kitchen and you come back and 10% of the dishes have been thrown on the ground, you're going to be very angry."
  • The sheer diversity of real-world environments (e.g., countless variations in kitchen layouts, objects, and lighting) poses a formidable challenge for current AI models.
  • Actionable Insight: The robotics field urgently needs AI models demonstrating exceptionally high accuracy, robustness, and adaptability in unstructured physical settings. Reliability is non-negotiable for practical applications.

Limitations of Rules-Based Approaches in Robotics

  • Current attempts at robot intelligence often involve a rules-based approach. This might use machine vision to identify objects in a scene, then a VLM (Visual Language Model) – an AI model capable of processing and understanding both visual information and text – to plan tasks. However, these systems frequently depend on intricate rules written by human developers.
  • Such human-coded rules inherently struggle when encountering novel scenarios or environments that the developers did not anticipate.
  • Host Jeff Wilser draws a parallel to the evolution of general AI: before the advent of transformer architectures and LLMs (Large Language Models) – AI systems trained on vast quantities of text to understand and generate human language – AI development often relied on extensive manual data labeling. Transformers enabled models to learn complex patterns from massive, often unlabeled, datasets.
  • Bailey Wang acknowledges that modern rules-based systems are more sophisticated, often incorporating LLMs for high-level task planning (e.g., breaking down "clean the kitchen" into sequential steps).
  • Despite these advancements, the fundamental process remains reliant on human-defined rules and assumptions about the environment, which are prone to failure in unexpected situations. "It's one of those unfortunate things that even the best teams of developers are nowhere near omniscient. So they can't plan for everything and that's why the LLMs are so powerful, right?" notes Wang.
  • Strategic Consideration: The inherent brittleness of rules-based systems underscores the critical need for data-driven, emergent AI in robotics, mirroring the paradigm shift LLMs brought to language processing.

Bailey Wang's Journey: From MIT's Mini Cheetah to PrismaX

  • Bailey Wang shares his background as a robotics researcher at MIT, where he was a key member of the Mini Cheetah team.
  • The Mini Cheetah was a groundbreaking robot, described as the first "commercially manufacturable robot dog." It utilized advanced quasi-direct drive actuators – motors that provide high torque and precise force feedback, enabling dynamic movements – and was designed using mass production techniques. This small, agile robot could run, jump, and even perform backflips.
  • Despite its impressive hardware capabilities, Wang and the MIT team made a conscious decision not to commercialize the Mini Cheetah. He states, "We couldn't figure out how how to get the robot to do anything useful."
  • The most apparent application was as an expensive robotic pet for the wealthy, which they deemed not a scalable or venture-backed market. This experience highlighted the critical gap between hardware capabilities and practical software utility in robotics.
  • Insight: Wang's firsthand experience with state-of-the-art hardware (Mini Cheetah) failing to achieve practical application due to software and data limitations provides a compelling rationale for PrismaX's data-centric strategy.

The Data Scarcity Problem in Robotics

  • The emergence of models like GPT-3 powerfully demonstrated the efficacy of large-scale, unsupervised training on massive datasets in achieving emergent intelligent behaviors.
  • However, applying this successful paradigm to robotics immediately confronts a significant data problem: collecting diverse, real-world data for robot training is exceptionally difficult, time-consuming, and costly.
  • Jeff Wilser draws an analogy to Google Maps, which required an enormous physical effort to deploy cars with cameras worldwide to gather Street View data—a far more complex undertaking than indexing websites.
  • Bailey Wang concurs, emphasizing that robotics data presents even greater complexity due to the variety of sensor modalities involved (vision, touch, force) and the fine-grained nature of physical interactions (e.g., accounting for slight inclines, friction, material properties).
  • Actionable Insight: The "data moat" in robotics is substantial. Solutions like PrismaX, which aim to efficiently and scalably acquire diverse, high-quality, real-world data, are positioned to unlock immense value in the field.

Learning from Visual Data: A Path to Scalable Robot Training

  • Bailey Wang proposes that a significant portion of robot learning, especially for high-level tasks, can be achieved through visual data, much like humans learn many skills by watching others.
  • He suggests a two-tiered approach:
    • Low-level models, possibly trained using simulated data, could handle hardware-specific controls like grasping objects without crushing them or understanding basic physics like friction.
    • Higher-level models, which require vast amounts of data, would then learn complex task sequences and decision-making primarily from visual inputs (e.g., videos of tasks being performed).
  • Wang challenges the notion that constant force data is essential for all aspects of robot training. "When humans want to learn how to complete new tasks, you I don't have to like pick up your arms and like carry you through the motion... you're smart enough about the world... to learn how to do this task," he argues.
  • Large-scale training on visual data could then be fine-tuned with smaller amounts of robot-specific interaction data, potentially gathered through methods like teleoperation, to adapt the learned skills to a particular robot's embodiment.
  • Research Focus: A key area for Crypto AI researchers is investigating the extent to which visual data can train sophisticated robotic task planning and how to optimally integrate this with minimal, targeted embodied experience for different hardware.

PrismaX: Decentralized Data for Robust and Unbiased Robotics AI

  • The core inspiration behind PrismaX is the understanding that truly robust and generalizable AI models are powered by diverse, large-scale datasets and varied operator environments.
  • For robotics, Wang argues that decentralized data projects offer more than just cost savings; they are about "providing the data that works."
  • He critiques centralized data collection efforts, which, even with substantial funding, often suffer from "manager's bias"—where the collected data inadvertently reflects the limited perspectives, assumptions, and priorities of a small group of decision-makers.
  • "If you let the community involve organically you wind up with a sampling of the world which much more closely corresponds to what people care about in the real world and that gives you robustness," Wang explains.
  • Decentralization fosters emergent behavior in data collection, mirroring the emergent capabilities seen in LLMs trained on broad internet data. This approach naturally prioritizes data related to tasks with genuine real-world value and frequency.
  • Strategic Implication: Decentralized networks like PrismaX have the potential to surpass centralized data collection efforts in creating truly generalizable, adaptable, and unbiased AI for robotics by tapping into a global, diverse pool of data contributors and scenarios.

PrismaX in Practice: Data Modalities and User Participation

  • PrismaX is designed to address three fundamental needs in the robotics ecosystem: high-quality data, skilled teleoperators (human operators who remotely control robots), and effective AI models.
  • Users can contribute to the PrismaX network in several ways:
    • Submitting passive environmental videos, such as clips from their smartphones. The PrismaX protocol will then analyze and identify useful segments, rewarding contributors accordingly.
    • Providing first-person view (FPV) videos of themselves completing specific tasks, either those requested by robotics companies or tasks they perform routinely.
    • Actively participating in teleoperation, remotely controlling robots connected to the network. This method collects extremely high-quality, embodied data.
  • The platform allows for a spectrum of participation, from passive (uploading existing videos for potential rewards) to highly active (engaging in skilled robot teleoperation, which can offer more immediate and substantial incentives).
  • Bailey Wang highlights the existing demand: "Robotics companies are paying $50 an hour for this kind of operator."
  • Investor Insight: PrismaX's multi-modal data strategy caters to various user engagement levels and diverse data requirements of robotics companies. The teleoperation segment, in particular, represents an existing market with immediate revenue potential that decentralized platforms can tap into.

Use Case Deep Dive: The Sheet Folding Project

  • PrismaX is actively working on a sheet folding data collection project. This choice is driven by the significant interest in domestic robotics within the industry, with applications in hospitality (e.g., hotels) being a clear go-to-market.
  • Sheet folding is an accessible task for data contribution (most people have sheets or towels) and is relatively low-stakes if a robot makes an error, unlike safety-critical applications like autonomous driving. Host Jeff Wilser aptly calls it "a great low-hanging fruit."
  • The project aims to gather around 10,000 hours of diverse videos showing people folding sheets. This diversity is crucial for training a robust AI model.
  • The AI model learns by identifying repetitive patterns within these unlabeled videos, building a rich internal representation (often called "latents") of the task.
  • These learned latents are then decoded by a smaller, "embodiment-specific model" tailored to a particular robot's hardware. This smaller model translates the general understanding into specific commands, like gripper positions or joint forces, for that robot.
  • "The model will go sort of watch those videos and learn the repetitive patterns in those videos without ever having them labeled," Wang elaborates.
  • Research Consideration: This two-stage learning architecture—a large, general model for feature extraction (latents) and smaller, specialized models for specific robot embodiments—is a promising approach for creating AI that can generalize across different types of robot hardware.

Nuances of Data Collection: Camera Angles and Obscured Views

  • Jeff Wilser raises a practical question about the importance of camera angles and the potential for blind spots if most contributed videos are filmed from similar perspectives.
  • Bailey Wang acknowledges that for highly specific, isolated tasks like sheet folding (where the sheet itself might frequently obscure parts of the action), data from multiple viewpoints (e.g., front and back) can indeed be beneficial.
  • However, for training large-scale foundation models intended for more general robotic intelligence, the emphasis shifts. With enough diverse data covering a wide range of actions and environments, the AI model should ideally learn to infer occluded parts or understand actions from various perspectives.
  • Wang argues against a strict requirement for specialized sensors or exclusively egocentric (robot's-eye view) video for all data collection. "If you go set a camera on on a desk and then like do things that contains a lot of information already," he suggests.
  • The key is that the model should learn to extract the underlying actions and intentions regardless of the specific viewpoint, much like humans can learn by observing a task from different angles, provided the view is reasonably clear and of sufficient resolution.
  • Key Takeaway: While varied viewpoints are generally advantageous, for foundational robotics models, the sheer quantity and diversity of scenarios, actions, and environments in the training data are paramount, potentially outweighing rigidly prescribed recording conditions.

Expanding Use Cases: Warehousing and Restocking

  • Beyond domestic tasks, Bailey Wang identifies warehousing and restocking as a particularly compelling near-term use case for robotics AI trained with PrismaX data.
  • This application domain aligns well with the daily activities of many potential data contributors, such as gig economy workers or individuals employed in retail and convenience stores.
  • There is a strong and immediate industry demand for automation in areas like last-mile logistics, light warehousing, and in-store restocking.
  • Data collection for such use cases can be relatively passive; for example, workers could wear a simple chest-mounted camera while performing their regular duties, capturing valuable real-world interaction data.
  • Market Opportunity: Focusing on use cases where existing workforces can easily and unobtrusively integrate data collection into their routines offers a pragmatic and scalable pathway for acquiring specialized data for industrial and commercial robotics applications.

Incentives, Community, and the Role of Teleoperation

  • PrismaX is strategically building both the demand side (engaging with robotics companies needing data) and the supply side (attracting data contributors) of its network.
  • On the supply side, PrismaX plans to collaborate with existing communities of data enthusiasts (such as those involved with Vana, a partner project focused on user-owned data) and crypto-native gaming guilds (like Yield Guild Games - YGG) that are seeking opportunities with real-world value.
  • These communities are often already familiar with incentive structures involving tokens or points that can be converted to future value.
  • Robot teleoperation is highlighted as an especially interesting area for incentive alignment and value creation.
  • "There are customers that are willing to like they're paying $50 an hour for that service right now, but they can't scale," Wang reveals, pointing to a clear market inefficiency.
  • He describes how even billion-dollar robotics companies often struggle to scale their small, sometimes founder-led, teleoperation teams needed for data collection and model validation.
  • PrismaX aims to provide a scalable solution, encouraging the formation of operator guilds. These guilds could organize teleoperators and potentially even invest in robots to generate data and earn incentives, creating a self-sustaining ecosystem.
  • Crypto AI Angle: The synergy between crypto-native communities (adept at token-based economies and guild structures) and the tangible, real-world demand for teleoperation data creates a powerful value proposition for decentralized platforms like PrismaX.

PrismaX's Web3 Mechanics: Tokenomics and DAO Structure

  • PrismaX will incorporate a native token, to be named "PIX" (though not yet live at the time of recording).
  • The platform will utilize Data DAOs (Decentralized Autonomous Organizations). These are community-governed entities responsible for managing access to the collected data, curating datasets, and facilitating data sales.
  • The tokenomics are designed to incentivize participation:
    • New, valuable data contributed to the network will result in the generation of new PIX tokens, reflecting the value added to the ecosystem.
    • Transactions, such as robotics companies purchasing access to data, will likely involve a token buy-and-burn mechanism, where tokens are bought from the market and permanently removed from circulation, potentially increasing the value of remaining tokens.
  • DAOs play a crucial role in this model, particularly in managing the buy-and-burn process, which can offer a more decentralized and potentially regulatory-compliant approach compared to centralized buy-back programs.
  • Investor Note: The sustainability and effectiveness of PrismaX's tokenomic model will be critical. Investors should assess how well it aligns incentives for both high-quality data contribution and consistent data consumption, and the governance role of the DAOs in maintaining this balance.

PrismaX's Emergence: A16Z Demo Day and Platform Launch

  • PrismaX operated in stealth mode for approximately a year, primarily focusing on establishing relationships with robotics companies to understand and build for the demand side of their data marketplace.
  • The project is now publicly launching to cultivate its community of data contributors and users.
  • A significant milestone was PrismaX's participation in the A16Z CXX SF cohort. A16Z (Andreessen Horowitz) is a prominent venture capital firm, and CXX is their crypto startup school, indicating strong early backing and validation. PrismaX also recently closed a seed funding round.
  • With its public launch, PrismaX has unveiled an early version of its user portal. This portal allows users to begin interacting with the platform by uploading existing data, collecting new data based on initial guidelines, and will soon enable them to reserve and teleoperate robots connected to the network.
  • Bailey Wang emphasizes the unique nature of PrismaX's two-sided data collection model: "There's no two-sided data collection projects yet where you you engage with like someone else's hardware or someone else's product to collect data for that product."
  • Strategic Move: Launching with a functional portal, coupled with the credibility of A16Z's backing, signals strong initial momentum and a clear strategy for developing a robust two-sided marketplace for robotics data.

The Appeal and Mechanics of Remote Robot Operation

  • Host Jeff Wilser expresses genuine enthusiasm for the prospect of remotely operating robots, suggesting it could even be an enjoyable activity people might pay for.
  • Bailey Wang confirms this intrinsic appeal and, more importantly, the existing commercial demand: "There are people who will pay you $15 for you to operate that robot because right now the like whatever the CTO has to go operate the robot himself to collect the data."
  • He reiterates that major robotics companies currently possess robots that require remote human operation for data collection and refinement, yet they lack scalable fleets of skilled operators.
  • PrismaX aims to address this by providing a standardized operating system (OS), user interface (UI), and setup for remote robot operation, accessible via web browsers, VR devices, or even mobile phones.
  • This standardization benefits both operators (allowing skills learned on one robot platform to be transferable) and the robotics industry (eliminating the need for each company to develop its own duplicative teleoperation tech stack).
  • By offering these tools, potentially free of charge initially, PrismaX can drive adoption among robotics companies. Each company that integrates with the PrismaX stack adds further value and opportunity to the network for data contributors and operators.
  • Actionable Insight: The teleoperation market for robotics data collection is an underserved niche with immediate and quantifiable demand. PrismaX's platform approach, focused on standardization and scalability, is well-positioned to capture significant value by connecting this demand with a decentralized supply of operators.

Future Vision: Human-Robot Synergy and Enhanced Productivity

  • Looking ahead, Bailey Wang envisions a future characterized by human-robot collaboration rather than outright replacement. He sees people working "as the backend of robots," performing tasks like ensuring operational reliability, handling edge cases, and conducting teleoperation when full autonomy isn't feasible or desirable.
  • This synergy could lead to significant productivity gains, where "one person will be able to do the work of seven or 10 people." Such an increase in efficiency could translate into more leisure time for individuals or the ability to earn more while working less.
  • In the near term, Wang is particularly excited about the impact of robotics in environments like commercial kitchens and restaurants. Automation here is not solely about reducing costs but also about enabling greater culinary diversity, allowing human staff to focus on creativity and customer experience while robots handle repetitive preparation and cleanup tasks.
  • "You'll see it, you'll feel it, and your life will become better and more exciting because of this technology shift," Wang predicts regarding the near-term impact of robotics in sectors like food service.
  • The ultimate long-term "endgame vision," shared by many in the field, is a robot in every home, capable of performing a wide range of domestic chores.
  • Perspective: Wang’s vision of human augmentation, rather than replacement, offers a more optimistic and potentially more socially acceptable narrative for the integration of advanced robotics into society.

Overcoming the Main Obstacle: The Need for Vast Data

  • To realize the vision of mainstream robotics, such as a ubiquitous "toilet cleaner robot," Bailey Wang identifies the primary obstacle: an immense need for data.
  • He states unequivocally: "In order for the toilet cleaning robot to be mainstream, we need a heck of a lot more data."
  • Robots require vast and diverse datasets to learn how to navigate the myriad complexities of real-world environments like homes—each with unique layouts, objects, and conditions—and to perform tasks reliably and safely.
  • While a highly specialized robot capable of only cleaning toilets might be feasible with current technology, its limited utility would not justify its cost or complexity. True value comes from general-purpose capabilities, which demand exponentially more data.
  • Wang foresees a developmental progression for such tasks: starting with collecting videos of humans performing the task (e.g., cleaning toilets), moving to teleoperated robots performing the task under human guidance, and eventually culminating in fully autonomous robots.
  • Core Challenge: The sheer volume, diversity, and contextual richness of data required for general-purpose domestic robots highlight the absolutely critical role of scalable, efficient, and diverse data collection platforms like PrismaX.

Timelines and Five-Year Predictions for Robotics

  • While hesitant to provide a definitive timeline for the arrival of general-purpose domestic robots, Bailey Wang suggests it might be "about a decade" before the underlying technology and AI models mature sufficiently to handle such complex, open-ended tasks. He speculates that high-value, more constrained tasks (like specific cleaning functions) might be achievable sooner.
  • His five-year prediction focuses on a more subtle integration of robotics: robots will "silently like be part of your life and you won't you won't really notice it, but you'll find that your life has just gotten better."
  • Examples include improved efficiency in services like restaurants (more timely food delivery due to robotic kitchen assistance) and hotels (cleaner rooms, with human staff freed up to focus on guest experience while robots handle laundry and cleaning).
  • He draws a parallel to the early impact of AI, where advancements like improved Google Translate or reverse image search enhanced daily life without users necessarily being aware of the sophisticated AI powering them. This was before the "ChatGPT moment" where direct, explicit interaction with AI became mainstream.
  • "Maybe what you want is a tool that like silently works in the background to make your make your life better," Wang muses, suggesting that the most impactful robotics applications may not always be the most overtly visible ones.
  • Expectation Setting: Investors and researchers should anticipate a gradual, often behind-the-scenes, integration of robotics technology, delivering incremental improvements in efficiency and service quality across various sectors before highly visible, anthropomorphic robots become a common sight.

Reflective and Strategic Conclusion

This episode underscores that decentralized, user-sourced data is the linchpin for unlocking the robotics revolution. Crypto AI investors and researchers should monitor platforms like PrismaX, as they pioneer scalable solutions to the critical data bottleneck, potentially defining the next era of real-world AI.

Others You May Like