NNuggets
BookmarksCollections
  • About Us
  • Terms of use
  • Privacy policy
  • Disclaimer
  • Copyright & Takedown Policy
  • Community Guidelines
  • Cookie Policy
  • Contact

© 2026 Nuggets

NuggetsMarket PulseCollections

On this page

Speakers & Credentials

  • Speakers & Credentials
  • 1. Executive Summary
  • 2. Chronological Table of Contents
  • 3. Detailed Thematic Summary
  • The Reference Vault
  • 4. Data & Figures
  • 5. Core Frameworks & Mental Models
  • 6. Anecdotes
  • 7. References & Recommendations

On this page

  • Speakers & Credentials
  • 1. Executive Summary
  • 2. Chronological Table of Contents
  • 3. Detailed Thematic Summary
  • The Reference Vault
  • 4. Data & Figures
  • 5. Core Frameworks & Mental Models
  • 6. Anecdotes
  • 7. References & Recommendations
Technology/March 26, 2026/11 min read/youtu.be

The 20-year journey to fully autonomous cars with Dmitri Dolgov of Waymo | Stripe

Source
Source
Watch on YouTube ↗

"We've clearly moved past the stage of scientific research and kind of deep core technology development to this new phase of accelerated global scaling and deployment." - Dmitri Dolgov [00:23:47]

"It's deceptively easy to get started but it is super hard to go the full distance and edge... the engineering rule of thumb that every next nine takes 10x more." - Dmitri Dolgov [01:00:24]

References

  1. Original source (youtu.be)

Disclaimer: Orignal content owned by or sourced from third parties. It does not represent the views of 'Nuggets' platform or it's team. AI is used extensively across this platform including for summaries. Accuracy is not guaranteed, there can be mistakes. Any info or content on this platform is not a financial, legal, or investment advice. Do your own research. Refer for complete disclosures:- Terms of Use · Full Disclaimer

Related nuggets

Jun 2, 2026

AI Is Escaping the Screen | 01 Jun 2026 | Coatue

Coatue : AI is entering a new phase: moving beyond digital tools and into fully autonomous systems operating in the physical world. From advanced manufacturing and surgical robotics to robots in the home, the next wave of innovation will b…

Jun 2, 2026

Kalshi Monthly Volume - Politics ($M) | Chart of the Day | Coatue

Coatue: Kalshi's political volume has scaled dramatically, and the American Power Index KPOW is what that scale enables: a single number gauge of the current balance of political power and where markets expect it to move, which Kalshi bill…

Jun 2, 2026

The BlackBerry Problem |18 May 2026 | The Mistakes Series | Malcolm Gladwell's Revisionist History

"My mistake and naivity was to think that people are were with me so you're flying around the world you're trying to get people on side and you think they're on side but they're not mhm mhm and you get blindsight" Jim Balsillie 00:01:34 ht…

Jun 2, 2026

Partnership Perspectives: Network International | 2 Jun 2026 | Brookfield Perspectives

Actions

Reading

Published
March 26, 2026
Read time
11 min read
Progress0%

"The core technology generalizes really well... increasingly we're finding especially now that we're able to hook the Waymo AI to the AI in the digital world, the VLMs, and inherit the general world knowledge... we're seeing really strong results from zero-shot or few-shot learning." - Dmitri Dolgov [00:25:06]

"If I think about the hardest parts of building a fully autonomous rider-only system, they are very different from what you do for a driver assist system. I don't want to say you can't make the jump, but it is a qualitative jump." - Dmitri Dolgov [00:50:04]

"Slow is smooth and smooth is fast... if everybody was a smooth, predictable, and consistent driver, you would still have those traffic jams, but the time constant to clean it out I think would be very different." - Dmitri Dolgov [00:56:32]


Speakers & Credentials

  • Dmitri Dolgov: Co-CEO of Waymo. A former physicist and applied mathematician who joined Google’s self-driving car project in 2009 as one of its foundational engineers. He took over as co-CEO in 2021 and has guided the project from an experimental "moonshot" into an active global commercial transit fleet.
  • John Collison (Host): Co-Founder of Stripe. Conducting the interview from the perspective of enterprise infrastructure and economics.

1. Executive Summary

  • Waymo has officially crossed the chasm from experimental scientific research into a massive phase of commercial scaling, currently executing 500,000 fully autonomous rides per week globally.
  • The architecture driving this capability rejects binary "end-to-end vs. modular" dogmas; instead, it relies on a hybrid design where massive off-board Foundation Models are specialized via simulated "Teachers" and then distilled into highly efficient models for real-time onboard inference.
  • Dolgov definitively counters the industry narrative that advanced autonomy is an extension of Level 2 Driver-Assist features, positing that solving the final "99.999%" edge cases requires a fundamentally distinct technological leap utilizing multi-modal sensor fusion (Cameras, Lidars, Radars).
  • The system operates completely independently of the cloud for safety-critical inference, demonstrating emergent AI spatial reasoning capabilities that go far beyond geometric pixels to interpret the complex social behaviors inherent to driving.

2. Chronological Table of Contents

  • [00:00:01] Intro & Dmitri's Path from Russia to Google
  • [00:02:52] The Waymo Tech Stack: Sensors & Onboard Inference
  • [00:06:24] Architectural Debates: End-to-End vs Modular & The 3 Teachers
  • [00:11:59] Encoding the Physical World & Limits of VLMs
  • [00:19:49] Optimization Variables: Safety, Smoothness, and The Drop-Off Problem
  • [00:24:15] Operating Domains & The Evolution of Waymo Driver Generations (Gen 4 to Gen 6)
  • [00:29:38] Autonomous Economies: Stripe Agent Commerce
  • [00:35:18] Hardware Economics: Lidar vs. Radar & Sensor Fusion
  • [00:40:25] Emergent AI Behaviors & The "Pedestrian Behind the Bus"
  • [00:46:40] Business Metrics, Scaling Velocity & Operations
  • [00:54:53] Second-Order Effects: Traffic Jams & Urban Redesign

3. Detailed Thematic Summary

Dmitri's Background and Soviet Academic Roots [00:00:21]

  • Dmitri Dolgov grew up in the Soviet Union with a physicist father, moving to Japan (Kyoto University) and eventually Berkeley before deciding his next educational step [00:00:21].
  • Despite obtaining a US green card and his family's hesitation, he returned to Russia in 1994 to earn his bachelor's and master's degrees in physics and applied math, drawn exclusively by the extreme rigor of the Russian mathematical and scientific institutions [00:01:00].
  • A diaspora pattern is noted by the host, connecting Dolgov's background to founders like Nikolay Storonsky of Revolut and Alex Gerko of XTX Markets, illustrating a pipeline of Soviet-era math rigor powering modern Western tech infrastructure [00:02:20].

The Anatomy of the Waymo Driver: Sensors and Real-Time Inference [00:02:52]

  • The primary hardware architecture relies on three fused sensing modalities—cameras, lidars (lasers), and radars—providing a continuous 360-degree spatial awareness around the vehicle [00:03:41].
  • The Zero-Cloud Imperative: All active AI generation and driving decision-making is processed locally onboard. Absolutely nothing related to real-time trajectory is sent to the cloud [00:05:25].
  • Cloud queries are strictly fenced off for non-real-time convenience processing, such as analyzing off-board models to detect if a passenger left a phone behind or dirtied the cabin [00:05:41].

Architectural Philosophy: The Foundation Model and The Three Teachers [00:06:24]

  • Waymo avoids the naive "end-to-end vs modular" debate, utilizing a vast off-board Foundation Model that innately maps the physical world and driving semantics [00:07:22].
  • This macro model is specialized into three heavy computational off-board models known as the "Three Teachers": the Waymo Driver, the Simulator, and the Critic [00:07:44].
  • These dense models are too heavy for a car; thus, they are distilled into smaller, highly optimized "student" models capable of ultra-fast inference locally within the vehicle's compute unit [00:07:53].
  • The role of the Critic is crucial—it sifts through immense datasets to find interesting edge events, acting as the opinionated mechanism for what constitutes "good vs. bad" driving behavior in the generation of reward functions [00:09:18].

The Limitations of Pure "End-to-End" Models (Pixels-to-Trajectories) [00:11:59]

  • While it is technically possible to take an off-the-shelf Vision Language Model (VLM) and fine-tune its text decoder to output physical steering trajectories (as Waymo demonstrated in their internal Emma paper), it is practically non-viable for long-tail autonomy [00:14:11].
  • Pure end-to-end setups (pixels in, steering out) work well in safe "nominal" conditions but fail catastrophic edge-cases because they lack multi-agent reasoning. Driving is fundamentally a social negotiation with cascading downstream variables, much like a conversation [00:15:42].
  • To solve this, Waymo leverages Reinforcement Learning Fine-Tuning (RLFT) utilizing "Intermediate Representations." By outputting structured constants (stop signs, objects, semantic road rules) instead of just raw pixels, engineers can build robust, deterministically manipulatable simulations [00:17:16].

Operating Domains, Generalizability, and Platform Generations [00:24:15]

  • Waymo is completely out of the R&D lab phase. The first fully autonomous operations launched in 2020 in Chandler, Arizona via the 4th Generation Driver mounted on Chrysler Pacifica vans [00:27:17].
  • The leap to the 5th Generation (Jaguar I-PACE) was a discontinuous step-change characterized by implementing an "AI as the backbone" architecture replacing fragmented subsystems, unlocking simultaneous expansion into radically different topographies like San Francisco and Phoenix [00:28:10].
  • While foundation VLMs offer excellent "zero-shot" learning for general logic, physics still matters. Extreme variables like winter ice change the physical stack inherently, demanding hardware heaters, cleaners, and entirely distinct mechanical control algorithms that AI alone cannot code away [00:25:46].

Autonomous Economies & Commerce [00:29:38]

  • As autonomy permeates the physical world, it is driving second-order changes in commerce where agents (software or physical) transact without human intervention.
  • Stripe is facilitating this infrastructure via its Agent Commerce Suite, allowing AI agents to navigate and initiate economic transactions across the web [00:30:06].

The 6th Generation Vehicle and Sensor Economics [00:30:17]

  • The upcoming 6th Generation platform is completely bespoke—designed around the passenger, omitting standard driver ergonomics, featuring sliding doors and a completely flat floor while maintaining a footprint barely larger than the current Jaguar [00:32:04].
  • This AI stack will also port to commercial platforms like the Hyundai Ioniq 5 [00:36:09].
  • Sensor Deflation: Waymo's Gen 6 hardware stack costs a fraction of legacy hardware, tracking close to premium commercial ADAS systems. Automotive Radars have plummeted to tens of dollars per unit [00:35:33].
  • Physics of Fusion: Radars are immune to severe fog, blinding sun, and dense rain where cameras fail, while Lidar provides the millions of physical depth-pulses per second necessary for pinpoint geometric confidence [00:38:09].

Emergent AI Behavior & Scaling Metrics [00:40:25]

  • The exponential scale of AI training leads to magical emergent behavior. In a profound incident, a Waymo vehicle stopped for a pedestrian completely hidden behind a city bus by fusing faint peripheral Lidar pulses that bounced under the bus chassis off the pedestrian's feet [00:44:16].
  • Waymo's commercial metrics are accelerating drastically: managing a fleet of roughly 3,000 cars facilitating 500,000 fully autonomous rides per week [00:46:40].
  • The vehicles log 4,000,000 miles per week across 11 U.S. cities (10 commercial, 1 ghost/test city in Nashville) [00:46:50].
  • Highlighting the velocity of AI compounding: Waymo recently unlocked service in 4 new cities in a single day, an expansion footprint that previously took them 8 grueling years of development [00:47:32].

Operational Automation and Second-Order Effects [00:50:53]

  • Depot management is highly orchestrated. Autonomous units identify their own cleaning or charging needs and auto-route to designated fleet stalls. If a vehicle identifies a mess, it requests a human via emojis on its sensor dome [00:52:37].
  • The societal impacts will be profound. Widespread autonomy will eliminate the "standing wave" effect of chaotic human traffic jams, maintaining systemic velocity [00:56:32].
  • Fundamentally, municipalities will be able to completely redesign urban grids by reclaiming massive parking garages and lots, as the robotaxi fleets will operate dynamically rather than remaining parked 90% of the time [00:57:13].

The Reference Vault

4. Data & Figures

Data PointValueContextTimestamp
Year Dmitri Returned to Russia1994Left the US to get his BS/MS in Russia due to respect for Russian math/physics rigor.[00:01:00]
Primary Sensing Modalities3Cameras, Lidars, Radars providing dense 360-degree coverage.[00:03:41]
Off-board AI Teachers3The Waymo Driver, the Simulator, and the Critic models used for foundation training.[00:07:44]
Gen 4 Commercial Launch Year2020Launch of fully autonomous commercial operations in Chandler, AZ (Chrysler Pacificas).[00:27:17]

5. Core Frameworks & Mental Models

  • The "Three Teachers" Distillation Framework [00:07:44]:
    • Application: A software engineering paradigm moving from a massive generalized Foundation Model into three specialized off-board high-capacity models (Driver, Simulator, Critic). These generate and evaluate scenarios, compiling their intelligence into a lightweight "student" model constrained strictly for ultra-fast, local onboard inference.
  • Intermediate Representation over Pure End-to-End [00:17:16]:
    • Application: Rejecting a simplistic "pixels in, steering out" VLM architecture that cannot mathematically guarantee "superhuman safety," Waymo forces models to output intermediate representations (bounding boxes, stop signs, pedestrians). This structured intermediate code allows for deterministic testing loops, Reinforcement Learning Fine-Tuning (RLFT), and exact safety-reward parameterization.
  • Operating Domain Expansion (The 5th Gen Leap) [00:28:10]:
    • Application: The conceptual framework of scaling AI software not by city borders (e.g., San Francisco vs. Phoenix) but by elemental domains (highways, fog, dense traffic). Changing from fragmented subsystems (Gen 4) to an integrated "AI-backbone" (Gen 5) permitted generalizable "zero-shot" learning, drastically accelerating multi-city launches.
  • The Structural Divergence of ADAS vs. Full Autonomy (L2 vs. L5) [00:49:29]:
    • Application: Dolgov challenges the popular narrative that standard Driver Assist Systems (Level 2) will linearly evolve into Level 5 Robotaxis over time. He treats them as completely separate qualitative engineering branches. Securing the final 0.001% safety barrier requires fusion architectures, off-board simulation machinery, and predictive social models that basic ADAS iterations simply cannot brute-force.

6. Anecdotes

  • The Pedestrian Behind the Bus [00:44:16]: In an incredible example of emergent AI behavior, a Waymo in San Francisco unexpectedly halted for a pedestrian entirely hidden behind a transit bus. The engineering team originally suspected a false positive or sensor glitch. Upon deep log inspection, they realized that the vehicle’s peripheral Lidars were bouncing pulses underneath the bus chassis. Detecting just the faint, noisy motion of the pedestrian's feet, the multi-modal AI correctly inferred the geometry, trajectory, and intent of the invisible human body, safely halting the vehicle.
  • Returning to a Fallen Empire for Math [00:01:00]: Despite the Soviet Union having recently collapsed, and despite possessing a coveted US Green Card, a young Dmitri moved out of the United States back to Russia in 1994 against his family's warnings. He was motivated purely by his absolute reverence for the demanding and globally unmatched rigor of Russian academic physics and mathematics, a foundation that ultimately led him to the peak of Western AI infrastructure.

7. References & Recommendations

  • Companies & Ecosystems: Waymo, Google / Alphabet, Stripe, Revolut, XTX Markets, Chrysler (Pacifica platform), Jaguar (I-PACE platform), Hyundai (Ioniq 5 platform).
  • Tools & Software: Stripe Agent Commerce Suite (infrastructure for AI agents to initiate transactions natively).
  • People: Dmitri Dolgov (co-CEO, Waymo), Nikolay Storonsky (Revolut), Alex Gerko (XTX Markets), Larry Page & Sergey Brin.
  • Concepts & Papers: "Emma" (Waymo's internal VLM paper), RLFT (Reinforcement Learning Fine-Tuning), Transformers, VLMs (Vision-Language Models), ImageNet, ADAS (Advanced Driver Assistance Systems).

"Brookfield's the largest infrastructure owner in the world... We drew a pipeline and we showed all the different components of the payments ecosystem on a pipeline and said it's like a pipe that moves any commodity except what it's moving…

Active Autonomous Fleet Size~3,000 unitsTotal fully autonomous vehicles currently deployed globally by Waymo.[00:46:40]
Autonomous Rides per Week500,000Number of rider trips served routinely by the Waymo fleet.[00:46:40]
Autonomous Miles per Week4,000,000+Total fully autonomous mileage accumulated on a weekly basis.[00:46:50]
Operating Cities (Total)11Cities Waymo operates in (10 with public riders, 1 mapping/ghost testing city).[00:47:08]
Same-Day City Launches4Number of new cities simultaneously opened to public riders on a recent launch day.[00:47:32]
Development Time for first 4 cities8 YearsTime duration from initial rider-only ops to opening access across 4 operational cities.[00:47:32]