Latent Space
December 31, 2025

[State of Research Funding] Beyond NSF, Slingshots, Open Frontiers — Andy Konwinski, Laude Institute

How the Databricks Motion Saves Open AI Research by Latent Space

Author: Andy Konwinski

Date: October 2023

Quick Insight: This summary is for builders and investors who realize that the most valuable AI breakthroughs are moving from closed labs to open compound systems. It explains how the Laud Institute uses a venture-style picker method to turn academic research into trillion dollar infrastructure.

  • 💡 Why is the Databricks motion: the new gold standard for AI startups?
  • 💡 How does the layer above the model: create more value than pre-training?
  • 💡 Can a nonprofit grant system: outpace the NSF in funding frontier research?

Top 3 Ideas

🏗️ THE DATABRICKS MOTION

"I want big breakthroughs to happen in the open."
  • Founder Resilience: Large research teams that have worked together for years reduce the risk of founder divorce. This stability allows teams to focus on disruptive ideas rather than internal politics.
  • Open Velocity: Shipping research as open source projects creates a funnel for massive commercial success. This path turns academic insights into products used by millions.
  • Meritocratic Funding: Using successful technologists to fund new researchers creates a virtuous cycle of wealth and innovation. This ensures that the next trillion dollar company starts with a public breakthrough.

🏗️ THE ORCHESTRATION LAYER

"The language models themselves are more like compute or GPU a generation ago."
  • Compound Systems: Value is moving from the base model to the management layer that handles memory and tools. This abstraction allows for higher impact decisions in software architecture.
  • Prompt Optimization: Tools like DSPy and Jepa use evolutionary techniques to improve model outputs without retraining. This reduces the need for expensive GPU hours while increasing performance.
  • Continual Learning: Bespoke information can be integrated into systems without massive weight updates. This makes AI systems more responsive to real-time data.

🏗️ THE OPEN FRONTIER

"Western open science has lost the number one spot to China."
  • Research Transparency: Chinese startups are currently out-publishing American labs that have closed their doors. This gap threatens the lead of Western AI development.
  • Venture Picking: Applying Silicon Valley intuition to research grants identifies winners faster than traditional government processes. This high velocity method deploys capital where it has the most impact.
  • Unified Front: Bringing together the most influential open researchers creates a roadmap for the entire ecosystem. This collaboration ensures that open research remains competitive with closed labs.

Actionable Takeaways

  • 🌐 The Macro Trend: The center of gravity in AI is moving from closed-door pre-training to open-source compound systems that prioritize context management.
  • ⚡ The Tactical Edge: Identify research teams with long histories of collaboration and fund them before they incorporate to capture the highest upside.
  • 🎯 The Bottom Line: Open research is the only way to maintain a democratic and competitive AI ecosystem against both closed labs and international rivals.

Podcast Link: Click here to listen

Light and space to fire. Wake up. Light. We're here with Annie Kinsky of Loud Institute. Welcome. Yeah. Thank you so much. It's weird to welcome to you to your own lawn show. We'll talk about that. No, I am welcome here. Yes, everybody that everybody invited is Yes, I'm here. So your intro is very very long but you're a co-founder of data bricks and perplexity and now I think people are less familiar with the law institute and what you're doing it's brand new we covered your terminal bench events recently that was a little bit of a coming out party but maybe just for folks who are out of the loop what is the law institute?

Oh good question LA itself beyond just law institute is an organization that gives resources to researchers at the exact moment that they need them. So we say right resource, right researcher, right time and there's a sort of dotted line in the life cycle of a researcher who makes a company which is when you incorporate. So after that you want to raise venture funding. So law has a venture arm that funds researchers and researchy technologists that aren't coming from PhDs or or professorships to build companies. And that's actually a path that I am extremely passionate about with data bricks perplexity being companies founded by PhDs at paradigm at the heart of a paradigm respectively big data for data bricks and public cloud and then kind of search as genai came around for perplexity.

So that's a a part of a lot that I'm very passionate about just helping researchers build the next data bricks the next trillion dollar or so far hundred billion dollar company and before that dotted line researchers need in an unprecedented way like an existential existential way more resources to participate in frontier AI research people doing research in the open and inspired by data bricks and my history I want big breakthroughs to happen in the open from research things that move humanity forward like computing, you know, silicon or the personal computer or internet or big data with data bricks or the next paradigm of search with perplexity. So I want all that to be happening in the open. So I want to fund the researchers who are doing open research just in a philanthropic motion and across both of those there's so so there's law ventures for the venture fund law institute this nonprofit for the upstream stuff and across both this obsession with shipping in in the case of the venture fund shipping your code as product in the case of the nonprofit shipping your research as open source projects and then there's this ethos to bring meccratic principles which researchers love into the whole thing as a governing philosophy.

So for researchers and by researchers so the venture funding there's 50 professors and PhDs the top names like Jeff Dean and the top faculty of Berkeley and Stanford and my co-founders of data brace and perplexity who've invested in the fund and then give advice and make deal flow introductions and on the institute side it's my money as as sort of a successful technologist plus the tech the money of other billionaires who have done well by their tech companies giving money into the nonprofit to do this no strings attach grant writing that ultimately becomes a funnel for recreating the data bricks motion you know like make the breakthrough in the open go become a billionaire tech founder that translates your research insight into people products in people's hands so that's a lot law ventures law institute just kind of this one la vision a lot of visions I'll I'll dive in I want to talk about slingshots and as as a funding vehicle but you mentioned a a bit of this the data bricks motion and I think that's an interesting analogy because data bricks is very unusual as founding story and I'm wondering if it's replicable.

I thought why do you think it's unusual? Uh eight co-founders the seven eight. Yeah. So basically it what was the model that you think really worked for data bricks and does it transfer to the other law grantees who are not as big who don't have the and I think openai had quite a lot of co-founders. I think VMware had a decent number of early research team turned co-founding team. I think snorkel AI is a unicorn came out of out of Stanford to have a pretty good size founding team. Core research teams wouldn't when they gel at a university or outside at a research lab in an industry after years of of deep scars from working together knowing each other's ins and outs and team like the the dynamism and like the the gestalts like some is greater than the parts of working together as a team. They're great bets.

So you kind of derisked the founder divorce risk with uh that's one of the most big problems that VCs have to watch out for by having built a breakthrough together. You've also proven traction that you know how to come up with a disruptive idea. You still have to prove that you can turn it into a product market fit. But I think that the founding team sizes is one thing that there is some precedent for the path from I cited a few examples of the path from research just massively successful startup. the Google co-founders Larry and Sergey obviously took research NSF funded research by the way and turned it into one of the most iconic companies in the history of humanity. I cited some VMware. There's even examples, you know, more and more examples of researchers besides perplexity coming out of Berkeley like Ella Marina more recently has become the standard for evaluation for all of the the the whole ecosystem and that was a research turned startup product now project now.

So 10 years ago I would have agreed that it was and when we found a data brick sure it was a little bit more rare. Now I think it's becoming the gold standard and the thing that you know all the VCs want in on and being at inside the heart of it all lot institutes sort of bridges those two and is obsessed with that path to impact that so many researchers want to have. So I would say that um I do believe going you know we are we have reached the tipping point already of this being the most high lever path from breakthrough to breakout company to world changing you know hopefully we'll get to that trillion dollar or beyond we'll get some we'll be at 10 trillion dollars in value uh so I think it's been a new norm for and it'll be great for the world to have this path because researchers are they're both amazing founders and they're genuin ly good people in that they tend to want they're very utilitarian.

They want they want to bring all of humanity forward very pragmatic and reasonable and evidence-based. So kind of I think the world needs a little bit more of much of that right now too. Yeah. Amazing. Uh let's talk about some of the research directions that you have funded through your slingshot program. Um so we we already featured Terminal Bench. People can go look at that episode. Uh Arena we already mentioned. We also just did an episode. Um, one thing that we haven't talked about as much is Jeppa, which you have a couple of Jeepa uh co-authors in there. That's right. Um, maybe Jeepa and then any others that you care to pick out. But yeah, I've been asking people what's been buzzing as I've been talking to people at Nurups this year and this idea that the language models themselves are more like compute or GPU a generation ago where what can we build at the layer above and in software systems we've traditionally thought of be a more great example.

You have the operating system and the underlying architecture. you know, how do you make it a layer of abstraction above that empowers higher leverage decisions and choices and more applications that are more profound and powerful. So, I think that's neat to think that there's this layer above which is getting a lot of attention in the last year with context management, rag being the most well-known kind of implementation of this. It's this layer above where we're managing what the how the agents work and how they make decisions, how many memories they can keep track, how and how many memories they keep track of. That's all at this prompt management. And it's not just prompt anymore. It's like tool usage and and memory curation and lots of innovation happening at the earlier above the core stack pre-training, post- training, distillation and and now talking about one of the Delara the PhD student working on some of this this prompt up from his jeep related stuff talking about post post training and that's tying over post. I like that. I like yeah exactly.

So there that's tying over to um there's actually a little bit of bleed over where this layer above now what we talk about as compound systems or prompt optimization where and and context management is kind of getting a new level of buzz and a new level of of enthusiasm and maybe a level up of the number of researchers working hard at that layer. Jeepa I would say is the probably up there for or the most successful example of adoption so far of ESP well it is kind of embedded in adjacent to so kind of like I I don't know what to think of it you know it's like has like a couple years of branding and we're in ESPI the way I think of it is DSP is something closer more akin to a lang chain so it's a a framework for writing agent type codes you give it a task and it like com it it it takes natural language and comp uh it actually takes code and compiles natural language. So it's like a reverse compiler that's DSPI and the optimizers which came kind of were born in the context of this DSPI project they uh take a prompt and make it better.

So that so Jeppa is the part of optimizer and it's using this evolutionary genetic technique like a very old area of research reinvented by Lakshia the mean PhD student on Japa really the way I zoom out and think about it is this layer above the core AI model stuff and now it's broaching into doing model weight updates as well and there's this big debate about is it in the context is it rag plus vector database plus whatever and sort of like lightweight, very few like if I if I tell you I had yogurt for breakfast this morning, you can remember that. It's a very bespoke piece of information that I just told you. Uh you didn't have to like go pre-train your brain weights for another round of like 10,000 GPU hours. You're able waiting that. So that's kind of the parallel with this.

And that also takes us not to get too sprawling here into continual learning. Another buzz area that overlaps a ton with this layer. So really excited about that. that higher layer compound systems is kind of a word that people are using to capture it. Continual learning factors really close to it. Context management, comp optimization, jeepa, better together. Those are some of the buzzwords. Florida in that in that area. A lot of what your source of alpha is is coming from the Berkeley ecosystem. Uh obviously you're not exclusively Berkeley. There's a lot of more and more Stanford. More and more Stanford. I'd say kind of equal partnerships, right? Is there a risk of that you're just doing only west coast universities and no like you know the good ideas come from anywhere right that's true and we have massive focus to beyond Berkeley Stanford I would say it's power law big focus which obviously like wrote let's let's be objective Berkeley and Stanford have produced a lot so yeah more than anything else but walk around here right now and you'll talk to Jackson Clark second year PhD from UIC think second year your first second year from UI in Illinois several UI teams here he's running a thing called S bench we have teams Stanford and MIT uh sorry not Stanford CMU and MIT very well represented Wisconsin PhD students around here Caltech PhD students lots of great projects are of our slingshots I would say the majority are non Berkeley Stanford or something maybe I don't I need to double check that number but lots of great projects and a heavy push one one example is I started this PhD club focus on entrepreneurship at Berkeley when I was a PhD student in 2012 called computer science grad entrepreneurs it transformed in my brain eventually to this venture upon computer science grant ventures CSGV and that was a pro prototype that turned into lot eventually so has this lineage I' been doing this a while PhD yeah yeah 2012 PhD club now we've taken that I've been that and I've went to University of Washington so starting with another west coast school and we started a club called agent there's their CS department is in a building called Allen uh so it's Allen graduate entrepreneurs agent it's a PhD student club on entrepreneurship then we went to Isconin has started one called research to impact.

There's another one at UIC forming right now. There's another one one at Stanford called saplings. So, but a big focus on going to find the PhD students who are interested in shipping their research at CMU, UAC, MIT, Wisconsin, Taltech, all the top 15 universities, Toronto, Waterlue and McGill. Yeah. And then the the like vector and ma so North America bringing Canada into the mix because they've got also equally great universities up there. Yeah. There is a really big focus on bringing creating a fat pipe of bandwidth to the root of where all the action like you can't get around that the root is of all of it's happening in Silicon Valley. Yeah. It's really opening the the like flying them out here more engaging them deeply in the programs giving them money. Uh, let's be honest, getting you as a mentor. That's right. And animal collecting other Braden who did snorkel. He was a PhD of Chris Ray at Samford who went on to do a unicorn company. He joined a lot. So, assembling a partnership like a venture partnership, but for PhDs and researchers turn founder turn unicorn founder.

Yeah. I want to get to open frontiers, but one more question about just the NSF, which I think you you mentioned once for for about data bricks, but also I think you're trying to target the law grants as sort of NSF level prestige and and and and impact. I think one thing I maybe as like a spicy question is well what's broken about the NSF process you're trying to fix? Uh I don't think NSF is broken. I love NSF. It has been probably the best investment the American popular public has ever made. Thousands xx return on their investments. I mentioned Google earlier, data bricks, all these researchers came from NSF funding. Uh and it's just been a paradigm generation. So DARPA was important for at some point. There's been these these paradigm shifts in how open research has been funded. DARPA was a key one. NSF's been a key one. And now NSF is not is not big enough. It was $1 billion a year for computer science and that they're trying to cut that into half of that. But we need $10 to hundred billion dollars to do frontier AI research. So it's not broken. They are trying to break it more is insufficient. You need more NSF.

And and I think in addition to the mechanisms NSF has used to to boy the money which are very effective. We have secret sauce in Silicon Valley of how startups find product market fit. We have really good pickers. We have venture capitalists that that have their own sort of evolutionary uh Hunger Games way of like finding who is great who are really good at at having intuition when they meet a person to pick the next winner. That's a that's a a unique approach that NSF does not use. They do not make a little venture partnership inspired group of research partners who go grant do grant writing. you don't get equity back, but you do uh empower the right project. If you if you got good pickers, you can actually you can actually have much more effective deployment of a way you can actually deploy an order of magnitude less capital and have more impact with this approach and it's very complimentary because you can still have traditional NSF.

INSAP is is brings faculty, many of which are very near and dear friends, into a centralized organization that is in charge of deploying billions and billions of dollars of funding, has an insane track record, and we are going to complement that with a much more Silicon Valley inspired approach of a high velocity, turn the funding around really quickly, have very strong opinions about the type of research we fund and re and be very narrow in the type of research. NSF funds everything from like all types of research from biology to you know like computer to uh and beyond uh English research and anthropology and sociology like I said 1 billion to computer science 10 billion total so one10enth going computer science and NSF has to manage all of that law and law like approaches were focused very tightly on high impact computer science research yeah especially AI but AI systems you know crypto cryptography security in networking and architecture.

Um, so like other areas besides core AI as well. So by being so laser focused, we can go deeper by bringing in researchers who have shipped, researchers who started companies, researchers who had found product market fit. We can actually identify projects that are more likely to become a data bricks or an Apache spar or array or an almarina sooner and and and with more confidence. And so I think it's a very complimentary model, but nobody's ever done it before. and you needed someone to kind of step up and propose a pretty fundamentally different architecture or organization shape like you need to hire people in a different way. It's a nonprofit which is you know got its own challenges but you wanted to behave in a very lean extremely high velocity way like a startup. So that's what we did. It's working amazingly so far. Congrats on on everything. Final question. Uh we have the open frontiers uh announcement and you're also doing the law launch here in Euros. How are you able to get such amazing people? You got Oral Vignyals and Jeff Dean and I saw Dylan Patel was a long long one. Yeah, that's so fun. Uh so so basically like uh what is open frontiers? Where is this all going? Great. Yeah.

So L lounge basically Nurips has needed a VIP lounge. We have VVIPs the gods of computer science and AI is you know obviously AI particularly walking around the halls of that Giants conference hall. It's actually hard to find a chair much less a comfortable couch. You got meeting room 23A on good. Yeah. No, no, no. It's It's not a VIP launch at all. It's just Yeah. Yeah. So, the idea here is open 8 to 10, 8 a.m. to 10 p.m. Identifi and it's not too hard to know who the big names are. The high impact researchers, Yan Stoka, France Shalet, Yen. Yeah, Jen D is over there and I mean Tallwalker and we kind of know the circles already. So, it just took a lot of emails and texts to like, "Hey, if you're going to be in nerves, cross the street. We're just going to throw VIP lounge Wednesday, Thursday, Friday, free food, Starlink, Wi-Fi, open all day, chat with us. You want to jump on a podcast. You want to talk to some talk to Swix." I mean, you're a big draw here, too. Uh, walking through, people like, "Oh, is that Swix?" Yeah. Like, you just walk through and you feel important and you are important if you're hanging out here. So, it's it's working. So it's like the MVP of open frontiers. Okay. So that's that's Law Lounge.

Open Frontier is a project that I've been cooking on for a while in my brain, but really has come together in the last 3 weeks really quickly. It's inspired by the core premise of open research that happen that that is symbolized by Nurups, right? People publishing their ideas. And what's happened in the last year is western open science and research discourse has no has has lost the number one spot to China. So uh we if you ask Stanford and Berkeley PhD students which I have dozens of them and they're hanging around here I'm going to introduce you where where are the best papers the most interesting papers about AI coming they'll say I read twice as many interesting papers by Chinese startups than I did by Americans because really just make an effort. Yep. Moonshot, Kimmy, Deep Seek, they're publishing really interesting stuff. They make an effort to talk about it. Whereas in the United States, since OpenAI closed their doors and stopped publishing, so did all the other labs. And by and large, you go and you don't get to publish. You are actually working on the frontier at those labs, but you're not talking about it.

And so, so you need to open the frontier. The so the openness is the key word. Um, Open AI was called Open AI and they were open for a while and now they're not. We need something that fills in that gap to be a champion and bring together all open researchers to put forward a unified front at the so that we can operate at the frontier. And so that's what the goal of this thing is to bring together every single organization that is leading in open research. So Allen Institute, France law will be around for ARC prize foundation. We have Berkeley, Stanford, Skyab, Berkeley AI research bearer. You got you know the like Yedjin's lab and Marin the project of Percy Leang who's kind of very senior at HAI and runs the center for uh foundation modelation models. He coined the term. Exactly. Yeah. Foundation model. So and Percy was in the lab with a lot of this is just my my network because Percy and I were in the same as lab with mate and all the other data bricks founders back in 2008 back at Berkeley. So it wasn't hard to go find these people. Everyone I've approached said yes.

The idea is like let's team up, get together one day in San Francisco in the next five months and get the hundred most influential open researchers together for a conference and for meetings where we share our road maps and we talk about common goals amongst the entire ecosystem. It's kind of surprising this hasn't happened yet. And the the the thing we're basically doing is making an open frontier lab effort starting with a conference. Yeah. And it's live streamed so they'll actually open so that we actually can actually like embody the democratic principles that have gotten us to where we are today. Get an order of magnitude more people in the world watching the breakthroughs that are happening in these labs. All the continual learning, prompt optimization, BLM SG, breakthroughs in inference, breakthroughs in evaluations, terminal bench and a lot of other I just heard for the first time about this benchmark called impossible bench. It's a genius idea. I haven't talked about it with you until Yeah. just so um that that dissemination of what's actually happening will capture the world's attention and in turn the goal is to with the world's attention help this become to gel into uh the a well enough funded and resourced collaboration team up of the ecosystem that we can once again become number one at open frontier research uh because that should be so hard it shouldn't you talked to like yeah it shouldn't And then I was like, "How about we just put this conference together?" They're like, "That's a good idea." Okay, we're doing it. Well, thanks for kicking it off. I'm excited to support it in any way I can. And I'm see I'm sure you'll be there and I'm sure everybody will be like, "How do we get on with Swix?" Thank you so much. Yeah, my pleasure.

Others You May Like