This episode dissects the intricate AI rivalry between the US and China, revealing how economic models, compute access, and cultural dynamics are shaping the future of AI investment and innovation.
The Sustainability of China's Open-Source Dominance
- Economic Surplus is Key: Alex points out that US tech companies historically have higher gross and net margins, creating a larger economic surplus to fund long-term, speculative R&D. For Chinese companies to sustain their open-source push, they need a similar uplift in profitability.
- Engineering Prowess: Despite economic constraints, Alex is bullish on China's ability to sustain its efforts. He notes that Chinese firms excel at reverse-engineering and rapidly pushing the boundaries of known technologies. He observes, "I already see 20-ish articles that talk in depth of breaking down the OpenAI release on the open source models and this is a very bullish form of commercialization push."
- Commercialization Flywheel: This rapid commercialization cycle, driven by a strong domestic demand for monetization, creates a reinforcing loop. Quick application of open-source models generates revenue, which can then be reinvested into further R&D, potentially driving the necessary economic surplus.
Incentives, Capabilities, and the Shift to Post-Training
- Broad Business Ecosystems: Alex explains that major Chinese tech companies have incredibly wide business spans, with "tentacles" in e-commerce, payments, social media, and more. Open-sourcing a model allows them to kickstart a flywheel where developers build applications on the edge, and the resulting data and improvements feed back into their core products.
- The Post-Training Paradigm Shift: The discussion shifts to the growing importance of post-training, a method of refining models using real-time feedback, often through reinforcement learning. Alex argues this could benefit China. While pre-training requires massive, centralized compute clusters (often using sanctioned NVIDIA chips), post-training relies more on large-scale inference, which can be accomplished with less advanced or domestically produced hardware like Huawei's chips or older NVIDIA A100s.
- Strategic Implication: This shift from a pure capex game (pre-training) to a data and inference game (post-training) could allow Chinese firms to partially circumvent US export controls and remain competitive, a critical trend for investors to monitor.
Cultural Divides: Silicon Valley vs. Chinese Tech
- Execution Velocity: "Under clarity, the execution velocity is 10x," Alex states, highlighting that once a path is clear, Chinese teams execute with phenomenal speed.
- Drive for Commercialization: There is a much stronger and more immediate drive to monetize new technology in China. Small, agile teams rapidly take open-source models and build vertical applications, serving a fragmented, long-tail market.
- The "996" Reality: Alex demystifies the 996 work culture (9 a.m. to 9 p.m., 6 days a week). While the hours are long, he notes that inefficiencies exist within large corporations, with long breaks and less-than-productive time. The real differentiator is not just hours worked, but the intense focus on execution when a commercial goal is in sight.
- Innovation Models: Alex suggests Chinese tech culture excels at "one-to-x" innovation (scaling and improving existing ideas) rather than "zero-to-one" (foundational breakthroughs). The high risk and massive upfront capital required for early AI model development were better suited to the US VC ecosystem.
A Tour of China's AI Labs
- The Startups (Moonshot, MiniMax, Zhipu AI):
- Moonshot AI: Known for a viral consumer product but faced some setbacks. Now attempting a rebound.
- MiniMax: Pivoted to overseas markets, focusing on character/role-playing AI. A significant portion of its revenue comes from outside China, and it is reportedly heading for an IPO in Hong Kong.
- Zhipu AI: Receives strong government support and is expected to IPO on China's STAR Market (the "tech NASDAQ"), likely focusing on government and state-owned enterprise contracts.
- DeepSeek: Backed by a quant trading firm, it has produced strong open-source models. Its future is tied to potential state capital injections, positioning it as a key player in the government's AI ambitions.
- The Incumbents (Alibaba, ByteDance, Tencent):
- Alibaba: Alex's top pick due to its strong engineering on open-source models (like its Qwen series) and its vast e-commerce and payment ecosystem, which facilitates rapid deployment and mindshare.
- ByteDance: The most profitable company in China, with a treasure trove of multimodal user data (video, audio, social). It is well-positioned to create a "super intelligence" flywheel, similar to Meta's ambitions.
- Tencent: While previously lagging, its dominance in social media via WeChat is a massive advantage. Its new AI agent, Yuanbao, is being integrated directly into the app, giving it unparalleled distribution for a personal assistant.
The GPU Constraint Paradox
- Price vs. Access: Alex clarifies that low prices for inference-grade chips suggest an oversupply for current, simpler use cases like image generation. The real constraint is on acquiring the highest-end chips (like NVIDIA's H100/B200) needed for cutting-edge pre-training at scale.
- Procurement Dilemma: Chinese tech giants face a dilemma: they want to buy NVIDIA's best available chips, but the government pressures them to procure locally. However, local suppliers often can't meet demand or performance requirements.
- Gray Market: Despite official controls, a "white glove" gray market exists, particularly in cities like Shenzhen and Guangzhou, allowing determined buyers to acquire sanctioned hardware, albeit without a predictable supply chain.
- Investor Takeaway: China is not completely cut off from high-end compute, but the lack of a stable, predictable supply chain for leading-edge chips remains a significant bottleneck for building massive, next-generation training clusters.
China's Industrial Policy and the "Land Financing" Model
- The "Land Financing" Loop: Alex explains land financing, a model where local governments sell land leases to developers to fund infrastructure and industrial projects (like EV or chip factories). This attracts talent, creates housing demand, and drives up real estate values, which in turn generates more revenue for the government.
- The Break in the Loop: This model breaks down when state-led investment leads to overcapacity and a "rat race" among provinces. As seen in the EV sector, massive subsidies created an oversupply, forcing companies into a price war that erodes profitability. When the real estate market cools, the entire financing loop falters.
- Social Consequences: Pondering Durian adds that this model drove housing-to-income ratios in tier-one cities to unsustainable levels (30-40x), contributing to demographic challenges and social phenomena like "Tangping" (or "lying flat"), where young people opt out of the competitive rat race.
The Taiwan Question and Semiconductor Supply Chains
- Takeover is Not Straightforward: Even if China were to take over Taiwan, controlling TSMC's fabs is not simple. Alex notes that critical lithography machines from ASML can be shut off remotely. Furthermore, the complex supply chain for maintenance and replacement parts (a single EUV machine has ~450,000 components) would be nearly impossible to replicate.
- US Hedging Strategy: The US is already hedging this risk by incentivizing TSMC to build advanced fabs in Arizona and fostering local manufacturing capabilities. The goal is to reduce dependency on Taiwan for the most advanced nodes (sub-5nm).
- Trailing Edge Dynamics: While China is rapidly building capacity in mature "trailing edge" nodes (16nm and above), Alex believes the US sees this as less of a national security threat. The US still dominates in the design and IP for these chips (e.g., Texas Instruments, NXP), creating a durable competitive advantage that is difficult for Chinese firms to copy.
The Future of Compute: Beyond the Chip
- System-Level Innovation: The bottleneck is no longer just about a single chip's performance but about how thousands of chips communicate. This drives innovation in areas like co-packaged optics (CPOs), which integrate optical interconnects directly onto chips to boost bandwidth for rack-to-rack communication.
- Infrastructure as the Moat: The competition is scaling up to the data center level. This includes holistic solutions for power (even small-scale nuclear), cooling, and management to lower the total cost of ownership (TCO).
- Investment Thesis: Alex suggests investors look at companies enabling this system-level buildout. "My bet is that at the communication front, there's going to be key breakthroughs in certain players, IP players especially... Second on the system level to infrastructure buildout there are holistic service providers that will actually give you really good TCOs."
2035: A World of Renewed Competition
- Architectural Shifts: He anticipates at least two major AI architectural breakthroughs in the next decade, moving beyond the current GPU-dependent, auto-regressive models. Future models may be far more efficient and less reliant on advanced manufacturing, neutralizing some of the US's current hardware advantages.
- Geopolitics vs. Innovation: Alex firmly believes that "innovations will not be hindered by geopolitics." The secular trend of improving productivity through technology will drive both nations forward.
- A Leveled Playing Field: As China undergoes painful but necessary economic reforms and new AI architectures emerge, he predicts the competitive ground will level. The race will become truly global again, driven by market-based competition rather than strategic divides.
Conclusion
This episode reveals that the US-China AI race is not a simple hardware arms race but a complex interplay of economic models, software innovation, and strategic adaptation. Investors and researchers must look beyond chip sanctions to understand how shifts in AI architecture and China's internal economic reforms will define the next decade of competition.