The People's AI
March 5, 2025

Who Owns AI? Fixing the Data Problem, w/ Vana's Creator Anna Kazlauskas

In this episode, Jeff Wilser dives into the critical issue of data ownership in AI with Anna Kazlauskas, co-founder of Vana. The discussion centers on how Vana aims to revolutionize data sovereignty, allowing users to own and monetize their data, thereby enhancing AI development.

Data Ownership and Sovereignty

  • “Most people don't conceive of their data as their property... In an AI native world, data really is this core asset.”
  • “You legally own all of your data today... It's the same as when you put your car in a parking lot; the parking lot doesn't own your car.”
  • Data is the lifeblood of AI, yet users often don't realize they legally own their data.
  • Vana's mission is to empower users to control and monetize their data, akin to land ownership.
  • Data sovereignty allows users to decide how their data is used, creating a fairer AI ecosystem.

The Data Problem in AI

  • “AI companies are running out of data... All of the low-hanging fruit has been plucked.”
  • “If you have good attribution... it's really just about scaling yourself.”
  • AI development is stalling due to a lack of new data sources.
  • Vana proposes a marketplace for user-owned data, ensuring privacy and fair compensation.
  • This model aligns user incentives with AI progress, turning data into a valuable asset.

Vana's Approach to Data Markets

  • “When you connect your data to Vana... you're putting your data in your wallet.”
  • “Data Dows are these applications that are possible to build on top of Vana.”
  • Vana uses blockchain to create a secure, user-controlled data marketplace.
  • Users can join data DAOs, receiving tokens that represent their data's value.
  • This system allows for the creation of high-quality AI models, leveraging diverse data sets.

The Future of Decentralized AI

  • “We can't just win on ideology; we have to win on actual capability.”
  • “If you have the most data and the most high-quality data... you can build the best AI model.”
  • Decentralized AI must outperform centralized models to succeed.
  • Vana aims to aggregate user data to build superior AI models.
  • The vision is a user-owned AI ecosystem, distributing economic benefits widely.

Key Takeaways:

  • Data sovereignty is crucial for a fair AI ecosystem, empowering users to control and monetize their data.
  • Vana's blockchain-based marketplace offers a solution to AI's data scarcity, aligning user incentives with AI development.
  • The future of AI lies in decentralized models that leverage user data to outperform centralized systems.

For further insights and detailed discussions, watch the full podcast: Link

Welcome to this episode titled "Understanding Data Sovereignty with Anna Koslowski: Building a Decentralized AI Future".

AI's Dependency on Data

  • Host Jeff Wilser introduces the critical role data plays in AI development, emphasizing that AI's quality hinges on data quality. The common adage "data is the new oil" is used to contextualize the current discourse on data sourcing and ethics.
  • Wilser highlights two significant issues in AI data: legal ambiguities in data acquisition by major tech firms and potential data shortages as accessible internet data is already extensively harvested.

AI's Data Conundrum

  • Discusses the dual challenges: untraceable sourcing of data by tech giants and the exhaustion of easily available data, which could hinder AI progress.
  • Personal anecdote: Wilser, an author, recounts discovering his books in an AI training dataset without his consent, reflecting broader concerns over intellectual property and AI training practices.

Vanana's Vision for Data Sovereignty

  • Introduction to Vanana's mission to establish a user-controlled data marketplace using blockchain technology for privacy protection. This allows users to retain ownership and receive compensation for their data used in AI development.
  • Communicates how Vanana aims to break AI’s dependency on tech giants by offering an unprecedented supply of high-quality data.

What is Data Sovereignty?

  • Anna Koslowski defines data sovereignty as the self-sovereign ownership of data by users, giving them control over how their data is used and the potential to earn from it.
  • She likens data sovereignty to historical shifts in land ownership that led to increased productivity, underlining the transformative potential in data management.

Current State of Data Ownership

  • Koslowski asserts that individuals legally own their data even when stored with tech giants, drawing an analogy to parking lot car ownership to clarify misunderstandings.
  • Emphasizes that rights over data are strong, though often obscured by complex terms of service agreements, convenient for large platforms’ current business models.

Transforming Data Sovereignty into Practical AI Applications

  • Discusses how Vanana could catalyze more democratized AI development by aligning user data contributions with developers’ needs for high-quality datasets.
  • Highlights the transformation from individual data contribution to collective power, using the concept of a "data labor union."

The Role of Data Dows

  • Explains the function of data Dows, which organize collective user data, providing contributors with tokens that represent their share in the data assets.
  • Details various existing data Dows within networks like Amazon and Spotify, each with its specific utility in training AI and how users benefit from tokenized data economies.

Journey from Economics to AI and Blockchain

  • Koslowski narrates her journey from programming with graphing calculators to a deep interest in economics, eventually leading to a fascination with decentralized blockchain systems and AI.
  • Her experience at the MIT Bitcoin Club kindled her interest in decentralized systems as a means to empower individuals economically.

Vanana’s Technical Framework

  • Explains the technical processes users undergo on Vanana, from data encryption to decentralized storage solutions, ensuring privacy and control.
  • Outlines Vanana's framework for integrating user data into scalable AI model training through federated AI approaches.

Future Directions and the Grand Vision

  • Koslowski shares the ambitious vision of using decentralized data systems to outpace traditional tech giants by leveraging unique, high-quality data pools to train superior AI models.
  • Discusses potential economic and societal impacts of decentralized data systems transforming AI development.

Takeaways and Future Conversations

  • Recap of key themes: the urgency of data sovereignty, decentralized AI’s potential, and Vanana’s pioneering role.
  • Encourages listeners to engage with Vanana and explore data sovereignty, with actionable insights on participating in data Dows.
  • Invites listeners to contemplate future implications and invites Anna for continued dialogue on building towards a decentralized data paradigm.

External Resources

  • Suggestions for exploring technical papers on AI models discussed, including "Attention is All You Need."
  • Links to Vanana’s platform and related projects for deeper engagement and participation in the data revolution.

Others You May Like